(1) GENERAL INFORMATION: 



(i) APPLICANT: Charles Kunsch 
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Steven C. Barash 
Michael R. Fannon 

(ii) TITLE OF INVENTION: Staphylococcus aureus Polynucleotides and 
Sequences 

(iii) NUMBER OF SEQUENCES: 5255 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Human Genome Sciences, Inc. 

(B) STREET: 14203 Shady Grove Road 

(C) CITY: Rockville 

(D) STATE: Maryland 

(E) COUNTRY: USA 

(F) ZIP: 20850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1.4Mb storage 

(B) COMPUTER: HP Vectra 486/33 

(C) OPERATING SYSTEM: MSDOS version 6.2 

(D) SOFTWARE: ASCII Text 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
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(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER : 0 8 /7 8 1 , 98 6 

(B) FILING DATE: January 3, 1997 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Hymel, Lin J. 

(B) REGISTRATION NUMBER: 45,414 

(C) REFERENCE/DOCKET NUMBER: PB248D1 

(vi) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 25^6015 

(B) TELEFAX: (301) 309-8439 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TCCATTATGA AGTCACAAGT ACTATAAGCT GCGATGTTAC CAATGTTTTT TAAAATCCCA 60 

GTAATAAAAT CAAAAAATAA GTTAAATAAT GTATTCATTT TAAGTCCTCC TTARAAAGa 120 

aaataGGTAA TAATGTAATA GCTTCTATTA TGATGCCTAA TTGAATGAAT TGGGCAAATG 180 

GCTCTTTGAT GATAAGTGTG AT AAT GAAAA GGGTTAAACT AACAATAATC GCATAATATT 240 

TTTTTCGTTT AATAAGTCGC ACAGGAATGG GCTTCTTTTT AGTTGCTGCA GGAGCATATA 300 

C T GAG AT T AC ACCTAAAGAA ATAACTGTTA AAATAATCAT AATTAAAAAG TTAATATGAA 3 60 

AATTTACTAT TACTAAAGGT AAAAGTATAA ATAGTATAAT ACTTTCTACA T AAC AC C AAA 4 20 
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AAGAAGAAGG TGCATGTGCa CCATGTGCAT GtCTTCTTAT TAAATAAAAT GTTAAATTCG 4 80 

TAATTAACGT AAACAGAAAA ATGTTTAAAA TATAGEAAT AGTATACATA ACAATTAATT 54 0 

TACCTATATT TTTAGCTAAG ACCTGCATCC CTAATCGTAC TTGCAAAAAT TGAATATGAT 60 0 

CTAAGTTATT TCTCTTTTGA AGATACGTGG CAAACTGGTC AATTTTATTA T CAAAAT AAT 660 

TCAATTTTAC ACCACTCTCC TCACTGTCAT TATACGATTT AGTACAATCT TTTATCATTA 720 

TATTGCCTAA CTGTAGGAAA TAAATACTTA ACTGTTAAAT GTAATTTGTA T T T AAT AT T T 78 0 

TAACATAAAA AAAT T T AC AG TTAAGAATAA AAAACGACTA GTTAAGAAAA ATTGGAAAAT 84 0 

AAATGCTTTT AGCATGTTTT AAT AT AAC T A GATCACAGAG ATGTGATGGA AAATAGTTGA 900 

TGAGTTGTTT AATTTTAAGA ATTTTTATCT TAATTAAGGA AGGAGTGATT TCAATGGCAC 960 

AAGATATCAT TTCAACAATC GGTGACTTAG . TAAAATGGAT TATCGACACA G T G AAC AAA T 102 0 

TCACTAAAAA ATAAGATGAA TAATTAATTA CTTTCATTGT AAATTTGTTA TCTTCGTATA 108 0 

GTACTAAAAG TATGAGTTAT TAAGCCATCC CAACTTAATA ACSTGTAAA ATTAGCAAGT 114 0 

GAGTAACATT TGCTAGTAGA GTTAGTTTCC TTGGACTCAG TGCTATGTAT TTTTCTTAAT 1200 

TATCATTACA GATAATTATT TCTAGCATGT AAGCTATCGT AAACAACATC GATTTATCAT 12 60 

TATTTGATAA ATAAAATTTT TTTCATAATT AATAACATCC CCAAAAATAG ATTGAAAAAA 132 0 

TAACTGTAAA ACATTCCCTT AATAATAAGT ATGGTCGTGA GCCCCTCCCA AGCTCGCGGC 1380 

CTTTTTTGTA ATGAAGAAGG GATGAGTTAA TCATCATTAT GAGACCCGCC GTTAAAATAT 14 40 

ATGAATAAGT CTAATGTTGG AAAAGGTCAA AAAATTAATC AATTTAATTA AGAAAATCAT 1500 

TCATTTGCAA AGGGCGAAAT GGGTXTTAC TGAGTTATCT ATTATAAAAA AATAAACATA 15 60 

GACTTATGAA AAATCTCTCA TAAATCTATG TTTAGTCATG aCATGTGTTA AAT AT TAT TT 1620 

CGGGCGCTTC T TAT T TAT AC AAATCTAATT TAATACTTTT AAATACAGGT ATATTTTCgC 1680 

GTTGCTGTTC TACTTCATTT AAGTTTAAAT CTACAGTCAA AATATCTGCGGATTCATTTA 17 4 0 

ATTCTCCAAC TAAATCTCCA TTTGGGTTTA TAACTATCGA ATGACCAGCA TATTCTGTGT 1800 

TACCATCGAA TCCAGTGCTA TTAGTTCCAA TGACAAACAT ATTATTTTCA ATTGCACGTG 18 60 

CCTTTAGTAA TGAATGCCAA TGTTGAAGAC GTGACATAGG CCATTGCGCC AC AT AAAAT G 1920 

CAATTTTAGC ACCACTACGA GCAGGATATC TTAATAATTC TGGAAAACGT AAATCATAAC 1980 

AGATAAGTTG GGTCACATAA GTACCGTCAG AC AAT T G AAA GGGTTCAGCT ACGTATTCGC 2 04 0 

CAGCGGTTAA AAATTCATGC TCTCTTAACA TAGGAACTAA ATGAACTTTG TCGTATTCaT 2100 

TAATCAGCTG GCCACTTTTA TTCACACTAA AKCTGTATT AAAT AT T T G A TTGTTTCTAA 2160 

TGTTAGAAAC TGACCCAGCT ACGATATCGA CTTTATATTT TTCAGCTAAA TGTTTAATAA 2 220 

ATGAAAAACT TTGTCCTAGA TTATTATCTG CTTTTTCATT TAAATGCTCT AAATCATAGC 2280 
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CATTATTCCA CATTTCAGGT AAAACGACTA CATCTACTTC AGCATTCATA TTTTTTTGA 234 0 
ACCATTGCGT TATTTGAGTT TCATTTTTAG AACTATCTCC AAAAACAATC GGTAATTGAT 2 4 00 

AAATTTGGAC TTTCATAACA TCACATCCTT GATAGATCTT ATATATAACT TACTAAAAGT 2 4 60 

TATGTTGAAA CGCAAAAAAC GAGCACAAGA CATAAAATCA AAGTCCTAGG CTCTACAAAG 2 52 0 

TTATATTGAC AGTAGTTGAT GGGGCCCCAA CATAGAGAAA TTGGAACACC AATTTCTACA 2 58 0 

GACAATGCAA GTTGGGGTGG GCTCTAACAT AAAGAAATAC TTTTTCTTTA GAAATTAGTA 2 64 0 

TTTCTTATAC ATGAGTTTTA CTCATGTATT CCTATTCTTA AGTGCACATT AGCAGCGGCT 27 00 

AATGTGTAAG AAC T AC T AC A TAATGAATAA CTAATGATTC TTTATCATTT CTGTCCCATT 2 7 60 

CCTAACAATA TATTGATTAT TTTTTTATTA CGAAACGATC TTCCACTGGA TTAAATGTTT 2820 

TTTCGCCAGC AGCTTCACGA ATATCACCAA ATGGCATTTG AGCAATAAGT TTCCAACTTT 288 0 

TAGGAATATT AAATTCATTT GAAGTCATCT CAT C AAC AAG TGGATTATAG TGTTGTAATG 24 0 

AAGCACCTAT GCCTTTAGTA GCTAATGCAG TCCAAATTGC AAATTGATGC ATGGCATTTG 3000 

TTTGAGTTGA CCATATTGCA AAATTATCAT AGTAGTTTGG CATTTGTTCT TGTAAACCAC 30 60 

TTACAACATC TTGATCTTCA TAAAACAAAA TTGTACCGTA TGAATGTTTG AAGTTATCAA 3120 

TTTTTTGTTC AGTTGGCTCG 2AATCACGAT TCTCTCCCAT GACTTCTTTT AAAATTGCTT 3180 

TTGTGTTATC CCAAAATTTA TTATTGTTGT CATTTAACAA GAGAACAATT CTAGTTGATT 32 4 0 

G AG AAT T AAA T GAT GAAGG A ACATGTTTAA CTGCATGTGC AAT CAT T GAT TCTAATTCGT 3300 

CATCGCTAAT TGATATCGAA TCTTTCAAAT TATATATTGA ACGTCTTCT TCCATTGCAT 3360 

TGTCAAAAGT CATTGCTTTT TTATCTTTTT TAAATAAGCC C AT AAT TAT T GCTCCTTCTT 3420 

TAGTAAAGAA TACTTAATAG ACTAAGTATA AAATTTATAC TCGTACTTGT AAAGCAATAT 34 8 0 

TTACGAAAAT TTCAAGAATA TTAATATTCA TTTTCAAATT CCAAATATAA ATGCATTTTC 354 0 

AACGCATATT TATTATACTT AG AT T AAT AC TTACATGAAA AAGGGAGGTG TCTCGTGAAA 3600 

TGTCATATCA TTGGTTTAAG AAAATGTTAC TTTCAACAAG TATTTTAATT TTAAGTAGTA 3 660 

GTAGTTTAGG GCTTGCAACG CACACAGTTG AAGCAAAGGA TAACTTAAAT GGAGAAAAAC 3720 

CAACTACTAA TTTGAATCAT AAT AT AAC IT CACCATCAGT AAATAGTGAA ATGAATAATA 37 8 0 

ATGAGACTGG GACACCTCAC GAATCAAATC AAACGGGTAA TGAAGGAACA GGTTCGAATA 38 4 0 

GTCGTGATGC TAATCCTGAT TCGAATAATG TGAAGCCAGA CTCAAACAAC CAAAACCCAA 3900 

G T AC AG AT T C AAAAC C AG AC CCAAATAACC AAAACTCAAG TCCGAATCCT AAACAGATC 3 960 

CAGATAACCC GAAACCAAAA CCGGATCCAA AACCAGACCC AGATAAACCA AAGCCAAATC 4 020 

CGGATCCAAA ACCAGATCCA GATAACC CGA AACCAAATCC AGATCCAAAA CCAGACCCAG 4 08 0 
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ATAAACCAAA GCCAAATCCG GATCCAAAAC CAGATCCAGA TAAACCAAAG CCAAATCCGA 414 0 

ATCCAAAACC AGACCCTAAT AAGCCAAATC CTAACCCGTC ACCAGATCCC GATCAACCTG 4200 

GGGATTCCAA TCATTCTGGT GGCTCGAAAA ATGGGGGGAC ATGGAACCCA AATGCTTCAG 4 260 

ATGGATCTAA TCAAGGTCAA TGGCAACCAA ATGGGAATCA AGGAAACTCA CAAAATCCTA 4 32 0 

CTGGTAATGA TTTTGTATCC CAACGATTTT TAGCCTGGC AAATGGGGCT TACAAGTATA 4 380 

ATCCGTATAT TTTAAATCAA ATTAATAAGT TGGGCAAAGA TTATGGAGAA GTTACTGATG 44 40 

AAGACATTTA TAATATTATT CGAAAACAAa ATTTCAGCGG AAATGCATAT TTAAATGGAT 4 500 

TACAACAGCA ATCGAATTAC TTTAGATTCC aATAT TTCAA TCCATTGAAA TCAGAAAGGT 4 5 60 

ACTATCGTAA TTTAGATGAA CAAGTACTCG CAT T AAT T AC TGGTGAAATT GGATCAATGC 4 62 0 

CAGATTTGAA AAAGCCCGAA GATAAGCCGG ATTCAAAACA ACGCTCATTT GAACCGCATG 4 68 0 

AAAAAG AC G A TTTTACAGTA GTTAAAAAAC AAGAAGATAA TAAGAAAAGT GCGTCAACTG 4 74 0 

CATATAGTAA AAGTTGGCTA GCAATTGTAT GTTCTATGAT GGTGGTATTT TCAATCATGC 4 800 

TATTCTTATT TGTAAAGCGA AATAAAAAGA AAAATAAAAA CGAATCACAG CGACGATAAT 4 8 60 

CCGTGTGTGA TTCGTTTTTT TTATTATGGA ATAAAAATGT GAT AT AT AAA ATTCGCTTGT 4 920 

TCCGTGGCTT TTTTCAAAGC CTCAGGATTA AGTAATTGGA ATSAACGAC AAATCCGTTT 4 980 

TGTAACATAT GGATAATAAT TGGAACAGCA AGCCGTTTTG TCCAAACATA TGCTAATGAA 504 0 

AAAAT G AC AC CCATACCAAA ATAAACTGGA ATAAATTTGA AATCATTATG TGCTAATGCA 5100 

AATATTAATG AACTTACTGT TGTAGCAATA ATAAATGCCA CGATACGATT ACCTTTAATC 5160 

G CAT T AAAT A ATTCTCCAAA GATTACTTTT CTGAATACAT ATTCTTCTAA TAAAGGACCA 522 0 

ATAATAGATA CAAAGAAGAT AAATATAGGT ATTTTTCGAG C AAT AAT AAT TAGCTTTTCT 5280 

GTATTAGGAC TTACTTGTTG TCCACCATAA ATTTGCGTTA ATACAATGCT CACTACCATT 53 4 0 

T GAT AAAT C A TTACCAATGC AAATCCAAGC AATGCCCATG G AAT GAT AT A TTTTTTAGGT 5 4 00 

TCTTTAACTT CTAATTCTAA TTTTGTTGGA TTTTTAATTT T T AAAT T AAT T AAAAT AATC 54 60 

GTCGTGGCGG CGATTAAAAA TAGAACAAGT TGTATGTAAA TGACTGCTTT AGTCAGTTCT 5520 

ATGCCACTAT AT T G T AC AAA TGGTAATTTT TTTACAATGA GAAGCGGTAAAAATTGAGAC 5 580 

AATAT AT AAA TAATAACAGT TAGCAATGAT GCCCATAATC t T G T CAT AAT TTTCCTCCAA 564 0 

ATATTTGTTT ATAATTTATT TTATCGTAAA TAACTTGAAG TTACAAAACT TAATTAAAAG 5700 

GTTATGACTT GAAATTTTGA CCAAATTTGA TTATTATAAA TGTATGTTAG CACTCTTTAA 57 60 

TGTTAAGTGC TAAACTTTAG GTTTTTTAAG GAGGAACAAT CATGCTAAAA CCAATTGGAA 582 0 

ATCGTGTGAT TAT T GAG AAA AAAGAACAAG AACAAACAAC TAAAAGTGGn ATTGTTTAAC 58 8 0 

TGATAGTGCT AAAGA 58 95 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

TTTGAAAAAA CAAGGTACGA TTGGTTTAAT ACATATATG AGAACCGATT CTACACGTAT 60 

TTCaGATACT GCCAAAGTTG AAGCAAAACA GTATATAACT GATAAATACG GTGAATCTTA 120 

CACTTCTAAA CGTAAAGCAT CAGGGAAACA AGGTGACCaA GATGCCCATG AGGCTATTAG 18 0 

ACCTTCAAGT ACTATGCGTA CGCCAGATGA TATGAAGTCA TTTTTGACGA AAGACCffiTA 24 0 

CCGATTATAC AAATTAATTT GGGAACGATT TGTTGCTAGT CAAATGGCTC CAGCAATACT 300 

TGATACAGTC TCATTAGACA TAACACAAGG TGACATTAAA TTTAGAGCGA ATGGTCAAAC 360 

AATCAAGTTT AAAGGAT T T A TGACACTTTA TGTAGAAACT AAAG AT GAT A GTGATAGCGA 4 20 

AAAGGAAAAT AAACTGCCTA AAT TAG AG C A AGG T GAT AAA GTCACAGCAA CTCAAATTGA 4 80 

ACCAGCTCAA CACTATACAC AACCACCTCC AAG AT AT AC T GAGGCGAGAT TAGTAAAAAC 54 0 

ACTAGAAGAA TTGAAAATTG GGCGACCATC AACTTATGCA CCGACAATAG AT AC GAT T C A 600 

AAAGCGTAAC TATGTCAAAT TAGAAAGTAA GCGTTTTGT CCTACTGAGT TGGGAGAAAT 660 

AGTTCATGAA C AAG T G AAAG AATACTTCCC AGAGATTATT GATGTGGAAT TCACAGTGAA 72 0 

TATGGAAACG TTACTTGATA AGATTGCAGA AGGCGACATT ACATGGAGGA AAG T AAT C G A 78 0 

CGGTTTCTTT AGTAGCTTTA AACAAGATGT TGAACGTGCT GAAGAAGAGA TGGAAAAGAT 84 0 

T G AAAT C AAA GATGAGCCAG CCGGTGAAGA CTGTGAAATT TGTGGTTCTC CTATGGTTAT 900 

AAAAAT GGGA CGCTATGGTA AGTTCATGGC TTGCTCAAAC TTCCCGGATT GTCGTAATAC 960 

AAAAGC GATA GTTAAGTCTA TTGGTGTTAA ATGTCCAAAA TGTAATGaTG GTGACGTCGT 102 0 

AGAAAGAAAA TCTAAAAAGA ATCGTGTCTT TTATGGATGT TCGAAATATC CTGAATGCGA 108 0 

CTTTATCTCT TGGGATAAGC CGATTGGAAG AGATTGTCCA AAATGTAACC AATATCTTGT 114 0 

TGAAAATAAA AAAGGCAAGA CAACACAAGT AATATGTTCA AATTGCGATT AT AAAG AG GC 1200 

AG C G C AG AAA T AAT AT T T T T ATTTCCTAGA TACATTTTAA GATTGTAAA TAGAATCATT 1260 

AGTGAATCTT ATTTTAAAGA TAGTAAAGGA TTAATCTAAA TAAGTGCGGA TAATATAAAC 1320 

ATAACAACAT AAT T AAmAG A CATAAATGAC aATAAAAGGA G TAT AG AAAT GACTCAAACT 138 0 

GTAAATGTAA TAGGTGCTGG TCTTGCCGGT TCAGAAGCGG CATATCAATT AGCTGAAAGA 14 4 0 



6 

( opied horn 1080(ii>0(i on 08/ \ ? ✓>()() I 



GGAATTAAAG TTAATCTAAT AGAG AT GAGA CCTGTTAAAC AAACACCAGC GCACCATACT 1500 

GATAAATTTG CGGAACTTGT ATGTTCCAAT TCATTACGCG GAAATGCTTT AACTAATGGT 15 60 

GTGGGTGTTT TAAAAGAAGA AATGAGAAGA TTGAATTCTA TAATTATTGA AGCGGCTGAT 1620 

AAGGCACGAG TTCCAGCTGG TGGTGCATfA GCAGTTGATA GACACGATTT TTCAGGTTAT 168 0 

AT T AC T G AAA CACTTAAAAA TCATGAAAAT ATCACAGTTA TTAATGAAGA AATTAATGCC 17 4 0 

ATTCCAGATG GATACACAAT TATCGCAACA GGACCACTTA C T AC AG AAA C CCTTGCGCAA 1800 

GAAATAGTGG ACATTACTGG TAAAGATCAA CTTTATTTCT ATGATGCGGC TGECCAATT 18 60 

ATT G AAAAAG AATCTATTGA TATGGATAAA GTTTACTTAA AGTCCCGTTA TGATAAAGGT 192 0 

GAAGCTGCAT ATTTAAACTG TCCTATGACT GAGGATGAAT TTAATCGCTT TTAT GATGCA 198 0 

GTATTAGAAG CTGAAGTTGC GCCTGTAAAT TCATTTGAAA AAGAAAAATA TTTCGAGGGT 204 0 

TGTATGCCTT TTGAAGTAAT GGCAGAACGC GGACGCAAGA CATTACTATT TGGACCAATG 2100 

AAACCAGTAG GATTAGAAGA TCCAAAGACT GGGAAACGTC CTTATGCGGT GGTTCAATTA 2160 

AGACAAGATG ACGCTGCTGG TACACTCTAC AATATTGTTG GCTTCCAAAC GCATTTAAAA 2220 

TGGGGAGCTC AAAAAG AAGT CATTAAATTA ATTC(AGGCT TAGAAAATGT TGATATTGTT 2280 

AGATATGGTG TGATGCATAG AAATACCTTC ATTAATTCAC CGGACGTATT AAAC GAG AAA 2 34 0 

TATGAATTGA TTTCACAACC AAACATACAG TTTGCGGGAC AAATGACTGG TGTTGAAGGT 2 4 00 

TATGTAGAAA GCGCAgcTAG CGGCTTAGTT GCAGGTATCA ATCTTGCGCA TAAAATATTA 24 60 

GGCAAGGGTG AGGTAGTATT T C C G AG AG AA ACAATGATTG GAAGTATGGC TTACTATATT 2 52 0 

TCTCATGCTA AAAACAATAA GAATTTCCAA CCTATGAATG CTAACTTCGG GTTATTACCA 2580 

TCTTTAGAAA CTAGAATTAA AGATAAAAAA GAACGCTATG AAGCACAAGC TAATAGAGCT 2 64 0 

TTGGATTACT TAGAAAATTT CAAAAAAACT T T AT AAAAT A GTTAGAAAGA CTAGATATGC 27 00 

TATTCATTCT TAAGTCATCA ACGAGTAAGT AATGACTTTC TAAATGGAAA ATACTTATCC 27 60 

TAGTCTTTTT AATTTTGGAA TTGTTACGTA TTTCTGACAA TTTAGAATTC GCATTCAAAA 2820 

AATATCTAAA TAAATAACAC GCAATAAGTT GATTGATGTA AfiTGTAAGA GAATGTTTTA 28 80 

AATAAACTTT ATTTAAAAGG CAATGAAATA ATAAATGGCA AGGCTATTAA TAAAGACTTT 2 94 0 

TAGTAATTAA TTTAAAAAAG AGGTATTCTA ATTAACAGGT TTTCCGATTA GTTACAATTA 3000 

TTTAATTCTC AAAAGATTTA GAATTGATTA TCAAATTACT GTAAGCCCTT TGCTGTATAT 30 60 

GCTACAATTC TTATTGATGG AGGGTAAATG TATTGAATCA TATTCAAGAT GCGTTTTTAA 3120 

AT AC AT T G AA AGTTGAACGG AATTTTTCGG AACACACATT G AAAT CAT AT CAAGATGACT 3180 

TAATTCAGTT TAATCAATTT TTAGAACAAG AACATTTAGA GTTGAATACT TTTGAATACA 32 4 0 

GAGATGCTAG AAATTATTTG AGCTATTTAT ATTCAAATCA TTTGAAAAGA ACATCTGTTT 3300 
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CTCGTAAAAT CTCAACGTTA AGAACTTTCT ATGAATATTG GATGACGCTT GATGAGAACA 33 60 

TTATTAATCC ATTTGTTCAA TTAGTACATC CGAAAAAAGA AAAATATCTT CCGCAATTCT 34 2 0 

TTTACGAAGA AG AAA T G G AA GCGTTATTCA AAAC T G TAG A AGAGGACACTTCAAAAAATT 34 8 0 

TACGGGATCG AGTTATTCTT GAATTGTTGT ATGCTACAGG CATCCGTGTT TCGGAATTAG 35 4 0 

TAAATATTAA AAAACAAGAT AT AG AT T T T T ACGCGAATGG TGTTACCGTA TTAGGAAAAG 3 600 

GGAGCAAAGA GCGCTTTGTA CCGTTTGGTG CTTATTGTAG ACAAAGCATC GAAAAT T AT T 3 6 60 

TAGAACATTT CAAACCAATT CAGTCATGCA ATCATGATTT TCTTATTGTA AAT AT GAAGG 37 2 0 

GTGAAGCAAT CACTGAACGC GGTGTACGAT ATGTTTTAAA TGATATTGTT AAACGAACAG 37 8 0 

CAGGCGTAAG TGaGATTCAT CCCCACAAGC TCAGACATAC ATTTGCAACG CATTTATTGA 38 4 0 

ATCAAGGTGC AGACCTAAGA ACAGTACAAT (ETTATTAGG TCATGTTAAT TTGTCAACAA 3900 

CTGGTAAATA TACACACGTA TCTAACCAAC AAT T AAG AAA AGTGTATCTA AATGCACATC 3 9 60 

CTCGAGCGAA AAAGGAGAAT G AAAC AT GAG TAATACAACA TTACATGCAA CAACAATTTA 4 02 0 

TGCTGTAAGA CATAATGGGA AAGCAGCTAT GGCTGGAGAT GGGCAAGTAA CGCTTGBCA 4 080 

ACAAGTCATC AT G AAAC AAA CGGCAAGAAA AGTGCGACGT TTATATGAAG GTAAAGTGTT 4140 

AGCTGGTTTC GCAGGTAGTG TAGCAGATGC GTTTACGTTA TTTGAAAAAT TCGAAACAAA 4 2 00 

AT T AC AAC AG TTTAGTGGTA ACTTAGAAAG AGCTGCTGTT GAATTGGCAC AAGAATGGCG 4 2 60 

AGGC GAT AAA CAATTACGTC AAT TAG AAG C TATGCTAATT GTAATGGATA AAG AT GC T AT 4 320 

TTTAGTTGTC AGTGGAACTG GCGAAGTTAT TGCTCCAGAT GATGACCTTA TCGCTATTGG 4 380 

ATCAGGAGGC AACTACGCAT TAAGCGCAGG ACGTGCATTG AAACGCCATG CATCGCATTT 4 4 40 

GTCTGCTGAA GAAATGGCAT ATGAGAGCTT GAAAGTAGG GCTGATATTT GTGTCTTTAC 4500 

C AAC GAT AAT ATTGTTGTCG AAAC AC TATA ATAATCAGAG C AC GAT AAA T AATTACGAGC 4 5 60 

AATTAATTTT AGTTAAAAGA CGGAGGAATG AAATTAATGG ATACAGCTGG AATAAGATTA 4 620 

ACTCCAAAAG AAATCGTATC TAAATTAAAT GAATACATCG TTGGACAAAA TGATGCTAAA 4 680 

CGTAAAGTGG CAATTGCCCT ACGTAATCGA TACAGAAGAA GTTTATTAGA TGAGGAATCA 4 74 0 

AAGCAAGAAA TTTCACCTAA AAATATTTTG ATGATTGGAC CAACCGGCGT TGGTAAAACT 4 8 00 

GAAATTGCAA GAAGAATGGC CAAAGTTGTC GGCGCGCCAT TTATAAAAGT AGAAGCTACT 4 8 60 

AAATTTACTG AGGTAGGTTA TGTAGGACGA GATGTTGAAA GTATGGTTAG AGATCTTGTT 4 920 

GATGTTTCAG T AAG AT TAG T CAAGGCGCAG AAAAAATCAT TGGTACAAGA TGAAGCAACA 4 980 

GCTAAGGCCA ATGAAAAACT TGTTAAGTTA TTAGTTCCAA GTATGAAAAA GAAAGCGTCT 504 0 

CAAACGAATA ATCCTTTAGA GTCACTTTTC GGAGGTGCAA TTCCAATTT CGGACAAAAT 5100 
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AACGAAGATG AAGAAGAACC AC C TACT GAG GAAATTAAAA CAAAACGTTC TGAAATTAAG 5160 

AGACAGCTAG AAGAAGGCAA ACTTGAAAAA GAAAAGGTAA GAATTAAAGT CGAACAAGAT 522 0 

CCTGGTGCTT TAGGTATGCT AGGTACAAAT CAAAATCAGC AAATGCAAGA GAT GAT GAAT 528 0 

CAATTAATGC CTAAAAAGAA AGTTGAGCGA GAAGTTGCTG TTGAGACGGC AAGGAAAATC 534 0 

TTAGCTGATA GTTATGCGGA TGAACTAATT GATCAAGAAA GCGCTAACCA AGAAGCGCTT 54 00 

GAATTAGCAG AACAAATGGG TATCATCTTT ATAGATGAAA TCGACAAAGT TGCGACGAAT 54 60 

AATCATAATA GTGGTCAAGA TGTCTCAiGA CAAGGTGTTC AAAGAGATAT TTTACCTATA 5 52 0 

CTTGAAGGTA GCGTTATTCA AACCAAATAT GGTACTGTGA ATACTGAACA TATGCTGTTT 5 58 0 

ATAGGTGCTG GAGCTTTCCA TGTATCTAAG CCGAGTGACT TGATACCAGA ATTGCAAGGT 5 64 0 

CGTTTTCCGA TTAGAGTTGA ACTTGATAGT TTATCGGTAG AAGATTTTGT AAHATTTTG 57 00 

AC AG AAC C AA AATTGTCATT AATTAAACAA TATGAAGCAT TGCTTCAAAC AGAAGAAGTT 57 60 

ACTGTAAACT TTACCGATGA AGCAATTACT CGCTTAGCTG AGATTGCTTA TCAAGTAAAT 582 0 

CAAGATACAG AC AAC AT T GG TGCACGTCGA CTTCATACAA TTTTAGAAAA GATGCTAGAA 5880 

GATTTATCAT TCGAAGCACC AAGTATGCCG AATGCAGTTG T AGAT AT T AC CCCACAATAT 5 94 0 

GTTGATGATA AATTAAAATC AATTTCAACA AATAAAGATT TAAGTGCATT TATTCTATAA 6000 

AAATATACAA AAGGAGAAAA ATTCATGAGC TTATTATCTA AAA C GAG AG A GTTAAACACG 60 60 

TTACTTCAAA AACACAAAGG TATTGCGGTT GATTTAAAG ATGTAGCACA AACGAT T AGT 6120 

AGCGTAACTG TAACAAATGT ATTTATTGTA TCGCGTCGAG GTAAAATTTT AGGATCGAGT 618 0 

CTAAATGAAT TAT T AAAAAG TCAAAGAATT ATTCAAATGT TGGAAGAAAG ACATATTCCA 624 0 

AGTGAATATA CAGAACGATT AATGGAAGTT AAACAAACAG AATCAAATAT TGATATCGAC 6300 

AATGTATTAA CAGTATTCCC ACCTGAAAAC AGAGAATTAT TCATAGATAG TCGTACAACT 6360 

ATCTTCCCAA TTTTAGGTGG AGGGGAAAGA TTAGGTACAT TAGTACTTGG TCnAGTACAT 64 20 

GATGATTTTA ATGaAAATGA TTTGGTACTA GGTGAATATG CTGCTACAGT TATTGGTATG 64 8 0 

GAAaTCTTAC GTGAGAA3CA TAGTGAAGTA GAAAnAGAAG CGCGCGATAA AGCTGCTATT 65 4 0 

ACAATGGCAA TTAATTCATT ATCTTATTCT G AAAAAG AAG CGATTGAACA TATCTTTGAA 6600 

GAACTTGGCG GTACGGAAGG CCTATTAATC GCATCAAAAG TTGCAGATAG AGTTGGTATT 6660 

ACTAGATCTG TAATTGTAAA TGCACTACGT AAAT TAG AAA GECTGGTGT AATTGAATCA 6720 

CGTTCTTTAG GAATGAAAGG TACTTTCATT AAAGTTAAAA AAGAAAAATT CTTAGATGAA 67 80 

TTAGAAAAAA GTAAAT 67 96 
(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2073 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID 'NO: 3: 

ATCCTAAAAT TnAAAATTAT CACGCCTTTT GaACAGCTTT GTAACCaTCt GGACGATCAT 05 

kAAATTCCaA TGTAAATCCT GGTTTAAaGT TGATCTTTAA CCTTATTTAA AyCACCAATT 120 

GTACGTATAT TATGTTGTTT AGCAAAATCA CGTTTTACAG CTAAAGCATA CGTATTGTTA 180 

TACTTCATTG GTTTTAACAT AGTCATTTGA TATTTCTTTT CAAGACTTTG CTTAGCTTGT 240 

TCATAAACTT TTTTCTCTTC TTTTGACTTC AATGGTTCTT TTGTTAATTC ACCTAAAACT 300 

GTTCCAGTAA ATTCTAAATA CCCATCTATA TCGTCAGATT TTAAAGCATT AAATAAAAAT 3 60 

GCTGTTTTGC CCATACCATC TTTCACTTCT ACAGTATTTT TGGTCTCTTC TTCTATTAAA 4 20 

ATTTTATACA TATTTGTAAT AATCGATGGC TCGGAGCCAA GCTTTCCAG TAACGTAATT 48 0 

TTATCACCTT TTTGTGCAAA CATAGGAATA GCGATAGCCA GTATAATAAT CATCACTATA 54 0 

ACACCTAACG TCATCAATAA CTTTTTATAA GATAATTTAG CCATAAATCT TAAAATTAAA 600 

TCAAATATAA TTGCCAATAA GGCTGCTGGA ATTGCACCTA ATAATATCAA CGATGCATTG 6 60 

TTACGGTCTA TACCTAATAA AATTAAATCT CCTAGTCCGC CTGCACCAAT TAATGCTGCT 7 20 

AGTGTTGCTG TACCTATAAT TAATACCATA GCCGTTCTTA CACCAGCCAT TATAACAGGC 7 80 

ATTGCTATCG GAAGTTCGAC TTTAGTTAAA CGTCTAAATG GTTTCATACC TATACCTTTA 84 0 

GCCGCTTCAA TGAGTGATGG ATCAACTTCT TTAATTCCAG TATACGTATT CCTTAAAATT 900 

GGTAACAACG CAT AC AC T AC AAGTGCAATA ATTGCTGGCA CACGACCGAT AC C AAAT AAA 9 60 

GGAATCATTA AACCTAATAA TGCCAACGAT GGTATGGTTT GAAGAATTGC CGCAATATTC 1020 

ATTACGATTT CAGATATCGT TTTAGTCTTC GTTAATAAAA TACCTAATGG TACCGSATA 108 0 

GCAGTTGCAA TCAATAATGC GAT AAAT GAT ATTTGAATAT GTTCTATCAT TGTCGAAAAG 1140 

AGTTGCCCCT TACGTTCACT CAATATGTCg AAAAAGTTAG TCATGTTGAG CTACCTCCTT 12 00 

TTTCTGGGAC AAAT AT T T G A AGATATCTTT CCTATCAATA AC AT AT T GAC CTACGCTATC 12 60 

TTCTTGCATG ACAAT GAC AC GCTCGCTCTC TGATAAAAGT TGATACAATA CTTCAATTGG 132 0 

TTGATTGTCA TAAACAATTG GATAAGCGCT CATAGATGTA ACCTCATCGA TTGGTTTCAT 13 8 0 

AATATCCAAG TCACGGATAA TTGCGTTCTC TTCAACACAT GGCGCAT CAT CTTCTAAATG 14 4 0 

ACTACCCATA AATTGTTTAA C AAAT T C AC T TTGAGGATA TTTTTAAATC CTTCTGGTGT 1500 

GTCAATTTGT TCAATATGCC CTTCATTCAA AAGACAAATC TTATCACCAA GTTTCATCGC 15 60 
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CTCTTGAATA TCATGTGTAA CAAATATGAT TGTCTTCTTA ATTTTAGTTT GTAATTCAAT 1620 

TAAATCATCT TGAAGTTTTT CTCGGCTGAT TGGGTCTAAT GCACTAAACG GTTCATCCAT 168 0 

TAAAATAACT GGTGGATCAG CTGCTAACGC ACGTATAACT CCTACACGTT GTCGTTGCCC 17 4 0 

CCCTGACAAT TCATCAGGTT TTCTGTTTTT ATATTTTTCA GGTTCTAATC CAACCATTTC 1800 

AAGTAATTCA TCTACTCTTT TATCTATATC TTTTTCTTTC CACTTTTTCA TTTGTGGCAC 18 60 

TTGTGCAAtA TTTTCTTTGa wTGTCaTATG TGGGAATAAT GCAATCTGCT GcAATACGTA 1920 

TCCAATATCC CAACkCATTT CGTATACTGG ATAATCACTT ATTGGTTTAT CTTTAAAATA 198 0 

AATATAACCT TCACTTAAGT GAATGAGTCG ATTAATCATT TTTAATGTCG TAGTTTTTCC 204 0 

ACAACCTGAA GGTCCAATTA GCACAAAAAA TTC 2 07 3 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ACTATTCTAG CTTCATCAGT TATCATATAT TCTTTGAAAC ACTTGTAAGA AAATATAATG 60 

AGTATTTACT ACATAATGAT ATTTCAAATT AGAAAAAAGG AAGTTATGAT TTAATGGCCT 120 

TGAGCCTATC ATAACTTCCT TTTATCATTT TATTGTTGTG TTGATGTTTC GATAACGTGG 180 

TACATCTTAT CAAACATCAA TTCGAAACCA TGCACCATGG CAT CAT GAT A TTCTTTTTTC 24 0 

TTTTGCTTGT ATTCTAAATT AGTAAATCGT CTTTCTTTTT CAACTAATGA ACGATAATAA 300 

AATAGCATTT GGGTGCCACC TGTTTCACGT TCAAAAAATT CTACCTCAAT GACATCTTGC 360 

GTTTCACTTA GTCCAGGCAT ACCGSTAGTC ATCTTAACGT ATTCATCCAT AACTAAAGAT 420 

TCATAAATGC CTTCAATCAC ATTTACTTTG CCATTACGTT GTTGATCTAC AATACGATAT 4 80 

TTACCGCCTT CTTTAACGTC CGCTTCAATC TCTTTATTCG TTCTGGCTGA TGTCATAAAC 54 0 

CATTGTTTCA ACAAATCTTT CTTTGTCCAA GCTTCGTATA CTAACTCTGGAGAAAATTTA 600 

TAAAGCTTTT CAATTTCAAC TTCGACATGT TCATTCTCTA CATTAAATTT TGCCACTGTT 660 

GTCCACCCAC TTTCGCTCTT ACTTTTATTT TAACGTATTT TTGCTCAGTT CCAAACATAG 720 

AT GAT CATC A TTTTTAAAAG ATTAGCGTTA TACGGTGAGT ACAACATGAT CTGTTAATAT 780 

AACAAGCCAC CTTACTTGGC TACATCGATA TATTGTTAAG CATTAATGTT TCATTTCTTG 84 0 

ACTAGTGTTC TTTTTTAGCT TTGGAAAATT AAATAAAATC GCAATAAGTC CGCATACACC 900 

TAATAATATA GGATAAATGC TGTATGGGAA TAACATTAAC GGTGAAATAC CAGCTACACC 960 
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AGCCGCTGaA ATGACTTGCG GGCTATATGG TffiTAAACCT TGGAAGCAGC CTCCAAATAT 102 0 

ATCAAGAATA CTTGCTGATT TCCTTGAATC TACATCATAT TCATCTGCAA TATTTTTAGC 1080 

TAAAGGACCT GACATAATAA TAGAGATGGT GTTGTTTGCC GTGGCAATAT CTGCGACACT 1140 

TACCAAACTA GCAATTCCTA ATTCTGCGCC ACGCTTTGAT TTCACTTTAG AGCGAACffiA 1200 

TTGCAACAAC CATTCAATAC CACCATTGTG TTGAATAATA CCGACTAAAC CACCAATTAG 12 60 

CAACGCAATC ATAGCAATAT CTTCCATGCT TATAATACCT TTGGACACTG CATCTAGTAG 1320 

CCCCATCCAA CCGAATGAAC CATCTATGAG ACCAATGATT CCGGCTAATA ATGTTCCGCC 1380 

AATCAATACG ATAATGACAT TTACACCTAA TAATGCTAAT ACCAATACTA AGATATACGG 14 4 0 

TACAACTTTA ATTAGATTAT AATCATAGTt TTTAGCATGA TTTAAAGAAA TGCCATTCGT 1500 

TAAGAAATAC AGAATAATAA TCGTTAAAAT AGCACCTGGC AATACAATTT TAAAGTTTAC 15 60 

TCTGAATTTA TCTTTCATTT TCGTATGTTG TGTTCTAACCGCAGCAATTG TTGTATCTGA 1620 

AATCATTGAT AGATTATCGC CGAACATTGC ACCTCCAACA ACTGTAGCCa tTGctAGCGC 168 0 

AGTTGGTACA TCTGTCGCTT GTGCAAATCC GTAACCAACA GGTGCTATTG CAGCAACAGT 17 4 0 

TCCTACAGAC GTCCCCATAG ATATAGAAAC AAACATACAA AT C AC AAACA ATCCTACAAT 8100 

AATTAAATTT TCTGGGATTA AT GAT AG T C C TAAATTAACT GTCGACTTTA CGCCACCCAT 18 60 

TTTTTCAGCT GTATTTGAAA ATGCACCTGC TAAAATAAAA ATCAACATCA TTAAAACAAT 1920 

GTTTGAATGG CCTGCACCTT TCGTGAAGAC CTCAACTTTT TTAGCAAATG ATTCTTTTCG 1980 

ATTCATTAAT AACGCCACAA TTACCGTTAT CGTAATTGCA ACATTTAATG GCATTGAAGT 204 0 

AAAATCACCT GTGATAATAC CTACGCCTAA AAACAACGCC ACAAATAATA ACAAGGGGAA 2100 

TAATGCCCAA GCATTGCTCT TTTTATGTAC TTCCATCCTT TTTACCTGCT TTCCAATTAA 2160 

AAATACCTCT TTCTCACAAA CGATGAAGAA AGAGGTTTTC ATGTGCTTA CCTGCTTATC 2220 

T T C AAAC CAT TACGGTTACT GGAATTGGCA CATTCGAGAT GTTGCCGAGG CTTCATAGGG 2280 

CCAGTCCCTC CACCTCTCTA GATAAGTGAT GCTTATTTAC GTTTACGTTA CAAGATAATC 2 34 0 

CTTAGTACGT CAATCATAAA TTAATCAGGA GTCGTATAAT ATTTTTCATA AACAATCATT 2400 

GCTACTGTAA TAATAATCAA AACAATAATG CTAATAACAA GTAAAAGCCA CCATTTAAGC 24 60 

ATTAATGCAA TAAAAAT GAA CACGATAGAC ACACTTACTA ATATTAATGA TATGACTTTA 2520 

AATTGCTGAA CACGTTGCTT GGAGATGACT TTCAACTGTT TGTTTGATAG ACGCGTATTT 2 58 0 

TTTATACTGA TTCCCAGTAT ATTTTCTAJT ATTTGAACCA ATACGATACT TATTGCAAAT 2 64 0 

ATAATAATTG GTAAAACATC ATAGCTCCCT ATAGTTAATG TATAAATTAC AAATCCAATG 27 00 

TAAAGTAACC C T GAG AC AAA GGATAAAAAG TATGCGACGT ATTTGTTAAA CTTAATGATA 27 60 



12 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



TGCTTTTTAA CGTTTTGATG TGTAAACCAT ACATTCGAAA CGATCGCAAC TGOTCAAAT 28 20 

AATGTGAATA CTATATATAA TGGTAATTTT TGTTCAGGAA AAACAGTCGC TATTCCAAAA 28 8 0 

GCTAATGCTA AAATCAAAAA TAATATAGCT CTAGATACTA TTAATGCCAT AATAACAACC 2 94 0 

CCTTTGTTTA ATATCGAGTT TGCAAATTTA CGTTTATCAG CGTTTCTATG AT C AG T AC T T 3000 

CTACGGGTAG CGTTTCTATG TAATTTACAT CATCTTAACA TATAAATACT TCGCTATTTA 3060 

ATTGAAAACA TATCCTATTA TTCTTTGTCC GTTCTGACGT TTAATATCTA GCCTTAGGCA 3120 

TTTCACTTGT TAATGAATTT AACTTTCTTC CACTAACCGT CCCTAAACCC AATCCCGCAA 318 0 

CAGTTTTTAA CTTTTTCGTT GTTGTCCTGA CATCCTATT AAGAAAGTTT ATTCTGCTTA 3240 

AAACTTATAA TCCACACCCT GAGCAAACGC TCCTTATGAC AGAGTATTAA AATAAGCCGA 3300 

TAAAGATACA CACCTTTACC GACTATTTAA AATACACTTC ACCAATTCAT TTTAATTTAA 33 60 

TGGATTGAAG TAACTAAATT AATATTATGT TGTTCAATTA AAAGCTTCAT ACAAACCTAA 34 2 0 

TCTATTTGCA CTCCACCGCT AACACCGAAC ACTTGTCCGG TTGTATAACT TGATTCTTCT 34 8 0 

GACGCTAATA GCACATAAGT TCCACATAAC TCAACAGGTT GACCTGCACG ACCTAAAGGT 354 0 

GTTTTTTGAC CAAATGTTGG GATTTTACTT TGAGGTTGTC CACCAGAAAT TTGTAATGGT 3600 

GACCAGAATG GACCAGGCGC TACACAGTTC ACTCTAATTC CTTTTGGTCC TAATTCTTCT 3 6 60 

GAAAAACTTT TAGTTAATGA AATAATTGCT GCTTTTGAAG CGGCATAATC ATGAAGAATA 3720 

GGACTAGGAT TATAACCTTG TACAGATGAT GTCGTTGTAA T T G AC GC AC C CGGTTTTAAA 37 8 0 

TATTCCAATG CTTTTTGAAC TGTCCAAAAT AGCGGATAGA CATCGTTTC AAATGTTTCT 38 4 0 

GTAAATGCCT CAGTTGTAAA TCCATGAATA T CAT CAT GAT ACTGTTGATG TCCAGCAACT 3900 

AAAGTAACAT TATCTAAGCC ACCTAATTGT TGATATGCTT GTTCAACAAG GTCATAGTTG 3 960 

AACTGTTCAT CTCTTATATC ACCAGGAATT AACACTGCCT TTTGACCACT TTCTTCAATC 4 020 

ACTTGGCGTA CTTCTTGTGC ATCTTGTTCT TCACTCGGAA GATAGTTAAT CGCTACATCT 4 08 0 

GCACCTTCTT TAGCATACGC AATTGCTGCT GCACGCCCTA TTGCTGAGTC ACCACCTGTG 414 0 

ACTAATATTT TATAGCCTTG TAAGCGTTGA TGACCTTGGT AAGACGTTTC GCCACAATCG 4 200 

GGTGCTGGCG TCATTTCAGA TTGTJAACCC GGTACCTCTT GTTCTTGTTT TTCATAATCC 4 2 60 

GTTGTTTTAA ATTTTGTTCT AGGATCTTGA GCTGCCATTT TTTTACATCT CCTTATTCGC 4 32 0 

TTAATGGTTA TTATTTACCC AATCTTCCTA GGAACTTAAT CATGATTACA CTAAAAATTA 4 38 0 

CTTTCTTCTT TATAAAAACA AGCTCGAATT ATTCATGCAA TAGTCTCTTTACAAATTCAA 4 4 40 

CAAAATACTC AGGTACTTTT TCCAGAATCC TTTCATCCGG TTTATATTGA GGATGATGTA 4 500 

AATCATATTC AC TAT G AG AA CCAATTAACG CAAATACACT TGGAAAATGT TGACTATAAC 4 560 

CTGAAAAATC TTCTCCAATC GTAAGCGGCT GTTCCATCAT TCCCACCTTA TATCCAACAT 4 620 
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GTTGGGCTAC TGCAATTGCT TTATGCGTCA ATGCCTCATC ATTCATCACA GCGCCAGGTA 4 680 

AATGCGTATA ATTTAAATTA ATTTTCATAT TATATGCTTG AGCCAATCCG TCCGCAATAT 4 74 0 

CTTGTAATCG TGTTTCTACA AGCTTTCGTA CCACAGGATC AAAACTACGC ACTGTGCCTT 4800 

GTACATACGC ATGATCAGCA ATGACATTCC AffiTATTACC ACATGATATT TGTCCAATTG 4 8 60 

TTACTACCGC TTCATCAAAC GCAGATAGAT TTCTACTAAC TATGGATTGA ATACTATTAA 4 92 0 

TCAATTGCGC CAACACAATA ACTGGATCGT TGCATTGTTC TGGcTTTGCA GCATGACCAC 4 980 

CCACGCCTTT AATATGAAAC TCAAAACGAT CTACTGCTGA TGTAATTGCC CCTGTTTGA 504 0 

TTGCAAATGT ACCTACCGAA CGCGATGGGT CAT TAT G AAA ACCCAATACT GCTTGTACAT 5100 

CTTTTAATGC ATGTGTTTCA ATAATTTTAA AAGCGCCATG TCCTAGTTCT TCTGCTGATT 5160 

GAAAAATGAA TTTAACACGC CCAGTAAGAG TGCCCTCAAT TTCTTTTAAT TTTACAGCTG 5220 

TAGCCAAAAT ACTAGCCATG TGAATATCAT GACCACACGC AT GC AT AACA CCTTCATTTT 52 8 0 

CAGATTTAAA ATCCTGTTCA ACAAGCTCTT GAATCGGTAA AGCATCAATG TCAGCCCGAA 534 0 

CAGCTATACA ACTCAGACCT TGTCCCACTT CAGCAACAAG CCCAGTCGCA AGTGGTAAGT 54 0 0 

CTAATATTCT AATATGATGT TCTGTTAAAA TATCTTTAAT TTTTTGTGTA GTCTTAAATT 54 60 

CTTTATCGGA TAGTTCTGGA AATTGATGAA AATACCTTCT CCAGGTAACA GCTTGATCTT 5520 

TTAATCCCAT CGGTCATTCC CCTTCCTTAA GTCAATGATA TGTTGTCTAC CCTACGATGA 5580 

TCATCTTTGA CTATTAAACG AT GAT T T C AC AACAATGTAC TCTTGTTAAT TGCTTTCGTT 654 0 

AATGATAGAC AGTTGTTTAA TAATATCGTA ACACTGTTGT CAAACTATTC TAACTTTTAT 57 00 

AATTGAGACT CTATACAAAA ACGTGTTCTC GAATATACTT GTTTTTACAA AC C AC AAAAA 57 60 

GCTCTAAACA TTAGTTTAAA CCAATGCTTA GAGCTTTCTA ATTATTTTAT GCTTTAAAAG 5820 

ATACTGTGTT ATCTACGATG ACCTTACCGT CTTTAATAAC TTTTTCTGCG TGATTGATAC 58 8 0 

CAAAATGATA TGGAATATAT TCATGATTTG GTGCATCCCA AATTACTAAA TTAGCCTTAT 5 94 0 

CACCTGTGTT AATTGTACCC GCGTTAATGT CTATTGCTTT AGCAGCATTG ACCGTAACAG 6000 

CATTCCAAAC TTCATTAGGT GATAGCTTTA ATTTCAAGGC TGCAATGCC AT AACAAGT T 60 60 

GTAAGTTGTT TGTGACACTA CTACCAGGGT TATAATCAGT TGCTAATGCA ATCGCACCGT 6120 

TATTGTCAAG CATGCCTCTT GCATCTGCAT AATCTTCTTT ACCTAAATAG AACGTCGTTG 6180 

CAGGTAAGAG GACAGCTACA GTATCACTAT TTCGCAACTT TTCTTTTCCT TTATCACTAG 62 4 0 

AAGCTACTAA GTGGTCTGCT GATATTGCTT GTTCATCAAT TGCTAATTCC AGTCCGCCTA 6300 

ACGGATCAAT TTCATCCGCA TGTATTTTCA CTTTAAAACC TGCTTCTTTG GCTTTTTGCA 6360 

TATAATGTTG CGATTGTTCT ATTGTAAATA CACCTGTTTC ACAGAAAATA TCCGCAAAGT 6420 
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CTGCATATTG TTTTACTTCC GGAAGTAACE CAATCATTTC TTCTAAAAAT GCCTCATTTG 64 8 0 

AACTTGCCTC TTTAGGTACA GCATGAGGCC CTAGGAAAGT ATGTTTCATG TCTAAATCAT 654 0 

ATTTCTCAGC TAAACGATTA GACACTTTCA ATTGCTTCAG TTCATTTTCT CTATCTAATC 6 60 0 

CATAACCACT CTTACTTTCA ACTGCAAGCA CGCCGTGTTT AAT CAT AG T A AGCftATCAT 6 6 60 

GCTCTGCTTT TTTAAACAAG TCATCTTCGG ATGTTTCTCT AGTAGCATTA ACGGTAGATA 67 2 0 

ATATGCCACC ACCCATTTCT AAT AT T T C AA GGTAAGACTT ACCTTGACGT TTTAATGACA 67 8 0 

TCTCATGTTC TCGAGATCCA CCAAATGTTA AATGGGTATG TGCATCTACT AATGCTGGGG 68 4 0 

ACACTACCTT CCCACTAGCA TCAATCGTCT CAGTCGCATC GTAGTCATCT GTATGTGTTC 6 900 

CAGCATATAC AATTTTGCCA TCTTTAATGA CAACTGTACC ATTTTTCACA ACATTTAATT 6960 

CATCTAATTC CTTACCCTTC AAAGGTTTAT CTGTTGATCT CGGTAAAATT AATTCTGCTA 7 02 0 

TATGATTAAT TATTAAATCA TTCATTACTT ATCACCGCT TTATCAATCA TTGGAATATG . 7080 

AATACCTTTT TCTTTAGCTG TTTGAATAGC GATGTCATAG CCAGCATCAA CATGTCGGGC 714 0 

AACACCCATA CCTGGGTCAG TCGTCAATAC ACGTTCCAAT CTTCTTTCAG CACGCTCTGA 7200 

TCCATCTGCT ACAACAACCA TACCCGCATG AAGTGAATAT CCCATGCCAA CACCGCCACC 72 60 

GTGATGGAAT GAAAT C CAT G AACCACCTGC AGCTGTGTTA ATGAGTGCAT T C AAT AC AG C 7 32 0 

CCAATCACCA ACCGCGTCAC TACCATCTTT CATACTTTCT GTTTCACGGT T AGG AC TAG C 7 38 0 

AACTGAACCA GCATCTAAAT GGTCTCGTCC AAT AAC AAT T GGTGCTGAAA TTTCACCGTC 744 0 

AC G T AC AAGA CGATTTAAA.G CTAAGCCCAT TTTCGCTCTT TCTCCATAGC CTAACCAAGC 7500 

AATACGTGAT GGTAGTCCTT GAT AT GAAAT TTTTTCTTCA GCTAAATCAA GCCATCTTAA 7560 

TAACTTTTCA TTTTCTGGGA AAAGTTTGCG CATTTCTTCA TCCGCACGCT CGATATCTTT 7 62 0 

TGGATCACCA CTCAACGCAG CAAAGCGGAA TGGCCCTTTA CCTCACAGA ATAATGGTCT 7 68 0 

AATGTAAGCT GGTACAAAGC CTGGGAAGTC AAAAGCATTT TTCACTCCGT TATTGAAGGC 774 0 

TACTTGACGA ATATTGTTAC C AT AAT C AAA TGCTACAGCG CCACGTTTTT GGAATTCAAG 7 800 

CATTAATTCA ACATGCTTTG CCATTGAAGC TTGTGACAGT TCAACATATT TTTTCGGATC 7 8 60 

TTTTTCACGC AATACTTTCG CTTCTTCTAC AGAGTATCCT TGTGGCACAT ATCCATTTAG 7 920 

CGGATCATGT GCACTTGTTT GGTCAGTAAT AATGTCAATT TTAAATCCTT T T T C TAG AAT 7 980 

CGCTTGATGG ATGTCTACAG CATTTCCAAC TAACCCGATT GATAATCCTT CTCCACGTTC 8 04 0 

TTTCGCCTCT TCTGCTAATT TTAAECTTC ATCTAAATCA GCTGTTTTAA. CATCACAGTA 8100 

TTTCGTATCA ATTCGCTTAT CAACACGTGT TTCATCAACA TCCACGCAAA TTGCTACCCC 8160 

ATGATTCATA GTAATTGCTA ACGGTTGCGC ACCACCCATA CCACCTAAAC CTGCTGTCAG 8 220 

TGTAACAGTG CCTGCTAAAT CTCCATTAAA GTGTTGATTA CCTAGCTCGGCAAATGTCTC 828 0 
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ATAAGTACCT TGCACAATAC CTTGAGAACC AATATATATC CAACTACCGG CTGTCATCTG 834 0 

TCCATACATG ATTAAACCTT TTTTATCTAA T T CAT T AAAA TGATCCCAGT TTGCCCATTC 8 4 00 

AGGCACTAAT ACTGAATTTG AAATTAATAC ACGTGGCGCT TCTTCATGTG TTTTAAATAC 84 60 

AGCAACTGGC TTTCCTGATT GTACTAACAT TGTCTCATCT GATTCTAATT CTCGTAACGT 8520 

TTTCTCTATT GCTTCAAAAG CTTCCCAATT ACGTGCTGCT TTTCCAATAC CACCATAAAC 8 58 0 

AACTAAATCT TCTGGTCTTT CAGCAACTTC TGGGTCTAAA TTGTTGTATA ACATTCTAAG 8 64 0 

TACTGCTTCT TGTTCCCAAC CTTTACACTC A?T AC T C AAA CCTTTTTTTG CTTGAATTTT 8700 

TCTCATAAAA TTCGCTCCTG TTCTTTTAAG AAGTTAATTC CACTAAATTT AAAACGCTTA 87 60 

CATTATTATC TTCAATATTC ATTATAGTAT GTTAAAATAT AGCCAACAAA TATAAATAAA 8 8 20 

CTAATTATCC ATAGCTTGAA TCTATAAATA AAAGGAGCAA AACACATGAA AATTATTATG 8 88 0 

TTAGAATACT TCTTGGCTAT CGTGAAATAT AATAGTTTTA CTAAAGCTGC ACAATTTTTA 8 94 0 

CATATTAGCC AGCCATCTTT AACTGCTACG ATTAAAAAAA TGGAAGCAGA TTTAGGTTAT 9000 

GACTTATTTA CACGTTCAAC AAAAGAC AT C AAGATTACCG AAAAAGGAAT ACAGTTTTAT 90 60 

CGTTATGCGA GCGAATTAGT TCAACAATAT CGATCCACGA TGGAAAAAAT GTATGATTTA 9120 

AGCGTTACAT CAGAACCAAG GATAAAAATT GGGACTCTTG AATCTACGAA TCAATGGATT 9180 

GCGAATTTAA TTCGAAAGCA CCATTCCGAC TACCCTGAAC AGCAATATCG TTTATATGAA 92 4 0 

ATACATGATA AACATCAATC TATAGAGCAA TTACTGAATT TTAATATTCA TTTAGCTATA 930 0 

ACAAATGAAA AAATAACCCA CGAAGATATA AGATCCATTC CTTTATATGA GGAATCTTAC 9360 

ATTTTATTAG CACCCAAGGA AACATTTAAA AATCAAAATT GGGTAGATGT TGAAAATTTG 94 20 

CCACTCATAT TACCAAACAA AAATTCTCAA GTGCGCAAAC AC T TAG AT GA CTATTTTAAT 438 0 

AGAAGAAATA TTCGTCCAAA TGTCGTTGTA GAAAC AG AT C GATTCGAATC AGCAGTTGGA 95 4 0 

TTTGTTCATC TCGGCTTAGG TTACGCTATC ATTCCGAGAT TTTATTACCA ATCATTTCAC 9600 

ACGTCTAATT TAGAATATAA AAAAATTCGT CCAAACTTAG GCCGAAAAAT TTATATCAAT 9 6 60 

TACCATAAAA AACGCAAACA CTCCGAACAA GTACATACAT TCGTACAACA ATGCCAAGAT 97 2 0 

TATTTATATG GACTTTTAGA GGCTCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 9780 

CTCAGTCAAC TGTATACCTT TTGCCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 98 4 0 

CTCAGTCAAC TGTATACCTT TTGCCTTTAA CTTAAGTTAT TAGAGCCCT TATGCAGTTG 9 900 

CTCAGTCAAC TGTATACCTT TTTCCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 9960 

CTCAGTCAAC TGTATACCTT TTGCCTTTAA CTTAAGTTAT TAGTGCCTCT TATGTAGTTG 10020 

CGTAGTCAaC TGTaTACCTT TTGCCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 10080 
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CGCAGATCAT CGTATAAAAA TTAATGACGT CAT T T C AAAA AT C G AT AC AA AAATAATTTA 1014 0 

T T AT AAAAAT TCTAAGAAAG AAGTGAAGCA GATGTTAAAA TCTATTAATC ATATATGCTT 10200 

TTCAGTCAGA AATTTAAACG ATTCAATACA TTTTTATAGA GATATTTTAC TTGGGAAATT 10260 

GCTATTGACT GGTAAAAAAA CTGCTTATTT TGAGCTTGCA GGCCTATGGA TTGCTTTAAA 10320 

TGAAGAAAAA GATATACCAC GTAATGAAAT TCACTTTTCA TATACACATA TAGCTTTCAC 1038 0 

TATAGATGAC AGCGAATTTA AATATTGGCA TCAGAGGTTA AAAGATAATA ACGTGAATAT 104 4 0 

TTTAGAAGGA AGAGTTAGAG ATATTAGAGA T AG AC AAT C A ATTTACTTTA CCGaCCTGA 10500 

TGGTCATAAG CTAGAATTAC ATACTGGCAC ACTTGAGAAC AGATTAAATT ATTATAAAGA 105 60 

GGCTAAACCA CATATGACAT TTTACAAATA AGGTGTCATT ATAAAAAGGC CTCTTGAACT 10 620 

CCGTTAAAAT TTTAATTAAT TATTATATAA TAAGAGAACT TTTCAAACAA TACAGTTGTT 10 680 

ATTTTTGCTA TTTCAACAAA CATAAATAAG CAGTAAGATG ACTACAACTT AAGAGTCTTC 107 4 0 

TTACTGCAAT TATTTTTCAA ATATATCAAC GTTAATATAA 'CTTCTATTAA GAAATACTCA 10800 

CATTCTGCCC TGCAATGCAA ATCTCGTCAC ATATAAATAT TTTTAATTAT TTTAAAAAAT 10860 

GAT GC AC T AA ATTAGCAACG AGCTTAGCAG TTCTATGTC AGCGTCATAT GTTGGATTCA 10920 

TCTCAGCAAT AC T AAC T G AA GACACCTTAT CACTTGGAAT AATACGTTTT GCTAATTCAA 10980 

GAACAGTATG TGGATACAAA CCTAACACTG CCGGCGCACT TACCCCAGGC GCAAACGCAC 11040 

TAT C AAT G AC AT CCAT ACAA ATCGTAAACA TAATGACATC ATGTTCATGT ACAAAACGTT 11100 

CAATCATATC TTTAATTGTT GGTGATACGT GACTCAATAA TTCATCTGCA AAGACATAAT 11160 

CAATCTTTTT CTCTTTAGCA TAATCAAATA AACTTTGCGT ATTACCACCT TGAGCAATAC 11220 

CAAGCACTAA ATAATCTGTG TTTTCATCTT CTTCTAAAAT TTGTCTAAAG CTCGTTCCAG 11280 

ATGTAGATTG TTGTTCAGCA CGTGTATCAA AATGCGCATC AATATTTATC ACACCAATAG 113 4 0 

ATTGTGTTGG ATAGACTTTA CGTGTTGCTA AATATTGAGC ATACGCAATA TCATGTCCAC 114 00 

CACCTAATAA AAATGTTTGT C TAT GAT TAG CAATTGACTT CGCTGCAAGC ATAGCAAATT 114 60 

CTTTTTGAGT ATCAATTAAT TCCTCATGAT CATGATAAAC ATTCCGTAA TCGACTAAAG 11520 

TTcACATTGA TTCAAATCCG GCAAACCTGC AAATGCTTGT TTAATCGCAT CTGGTCCTTC 11580 

TTTTGCACCA ATGCGCCCCT TGTTTAAAGC AACACCTTTG TCAACAGCAT AGCCTAATAT 1164 0 

ACCGACCCCT GATGGCATAC TACTCTTTTC CAGCTTAGAC AAATCTTCAA ATGTTACTGT 11700 

TTGAAAATGT CTAAATTTTT TCGGGTCTGT TTCACTATCT AACCTTCCAG TCCATAAATT 117 60 

TGGTTCACCT TGCTTGTACA CAGCATTTCC CCCTCTTATT TATGTGGCTT ATTAACAATT 11820 

AAAGTATAAC GTATAGGAAA TTTTGAATTC AATTCATAGT TAAATCCGTA TCTT AAAAAT 11880 

ACTTATCTAC ATTACTTTTA CCCCTATTTT CTATGTAATA ACGAATACTT AGCTGATTTA 11940 
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TGTTAATAAA ATACGTCAAG ACTATTACAT TTTCATTAAT ATTGACATAG ACAATTTATC 12 000 

TCTCGGCTTG TAATATGTAT AATTGTTACT AAAAGATATT TTGCTTGTTA CCTAATGGAG 12060 

GTTACATATA ATGAAGAACA ATAAAATTTC TGGTTTTCAA TGGGCAATGACGATTTTCGT 12120 

CTTCTTTGTC ATTACAATGG CGTTATCCAT TATGCTCAGA GATTTCCAGT CTATAATTGG 12180 

TGTCAAACAC TTTATATTTG AAG T T AC AG A TCTAGCACCA TTAATTGCTG CAATCATTTG 12240 

TATACTCGTT TTCAAATATA AAAAGGTCCA ACTTGCAGGT TTAAAATTCT CAATCAGCCT 12 300 

GAAAGTAATT GAACGTCTAT TGCTAGCTTT AATTTTACCT TTAATTATTC TAATTATTGG 12360 

TATGTACAGC TTTAATACAT TTGCAGATAG CTTTATTTTA TTACAATCAA CAGGCTTATC 124 20 

AGTACCTATT AC AC AC AT T C TGATTGGACA TATTCTGATG GCGTTCGTAG TAGAATTCGG 124 80 

ATTCCGTTCA TACT T AC AAA ATATTGTTGA AA:GAAAATG AACACGTTCT TTGCAAGTAT 1254 0 

TGTTGTTGGT TTGATGTATT CAGTTTTCTC AGCAAATACA ACTTATGGTA CAGAATTTGC 12600 

TGCTTATAAC TTCCTTTATA CATTCTCATT CTCTATGATT CTTGGTGAAT TAATTAGAGC 12660 

GACTAAAGGA CGTACAATTT ATATTGCAAC GACATTCCAT GCTTCAATGA CATTCGGST 12720 

TATTTTCTTG TTTAGCGAAG AAATCGGCGA TCTATTTTCA ATCAAAGTCA TCGCCATTTC 12780 

AACAGCAATC GTTGCAGTAG GATACATTGG TTTAAGCTTA ATTATCCGAG GTATTGCATA 128 4 0 

TTTAACAACA AG AC G AAAC C TTGAAGAACT TGAGCCTAAT AATTATTTAG AC CAT G T C AA 12900 

TGACGATGAA GAAACTAATC ATACTGAGGC TGAAAAATCT TCTTCAAATA TTAAAGATGC 12 960 

TGAAAAAACA GGTGTAGCTA CTGCATCAAC GGTTGGTGTT GCTAAAAATG AT AC T GAAAA 13020 

TACAGTGGCT G AC G AAC C AA GCATTCATGA AGGTACTGAA AAAACAGAAC CTCAACATCA 13080 

CATAGGTAAT CAAACTGAAT CTAATCATGA TGAAGATCAt GACATCACTT CGGAGTCAGT 1314 0 

AGAATCAGCm GaATCAGTTA AACAAGCACC ACmAAG T G AC gATTTaACAA ACGATTCAAA 13200 

TGAAGATGAA ATAGAGCAAT CATTAnAAGA ACCTGCGACT TATAAAGAAG ACAGACGTnC 132 60 

ATCAGTTGTA AT T GAT G C AG AAAAACATAT CGAAAAAGCT GAAGAnCAAT CTTCAGATAA 1332 0 

A 13321 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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ATGTGTTGTA AACTTTTATG TTGAAAAAGC TACTTATCTC AATGAAAACA AGTAGCATTT 60 

AATAAATTAA T TAG TAT AC A GCTAGTTTTT CTAATTGTTC TTTAACTTGA ATTAAGTTTG 120 

AC C G TAT TAG AGAGGCAGAT TGATCCATCG TTTGAATTGC TTGTCCTTCA TTTTCGTTCA 180 

AGCCATTACA AACAACTTCA AACTGTTGTG CCATTTGATC AAGACGCGCA TGAGCTTGTG 24 0 

TGTTTAAAAT AAACATATCG TCATAATGTG ATGGCGAATA GATAATTCGT CGTTGTATAC 300 

AAACGTATAA AAACCTTGTC ATATCAACGG TTTTGGCATT TTTAAECTC TGTGTTTTCC 3 60 

ACGCATGTTT GCCCTTATTT AAATAATTTG CCCTTTTTTC GCCCCGAAAA AAAAACACAA 420 

AAAAATAACC ACACTCCTAA ATTAATAGGT GGTGTGGTTT TGTTGATTGT AGGGGTATAA 4 80 

AAATAACCGC AT TAT T AAAG ATACGGTTAC TCTGTTATCT GTAAATATAA TAGTAGTTTA 54 0 

AAAATTAGTC GTTATTGTTA GTTCTTTTTT TATCTGATTC TTTTGTTTCT CCTAATTCGC 600 

AAACAGGACT CCACATAAAA ATCAACTCCT TTATATACCA TAATGATACT ATATTTTCTA 6 60 

GTTTATTTCA ATTTTTCAGT TTTTAAAAAT GAGTTTCTGT TTTTATTTAT ACGCTTTTCT 720 

GTTTTCTTTT TAAATTTTAT CTTTTTGTfA TTCCATTCAT TGTAAAATTC TATTAAATTA 780 

ACATAAAATT TTTCATGCCC TATTTTATTT GTTGATGAGA TATCAATGTA AAGACTCAAT 84 0 

ATTGTTTTTA AATAGATTTG ATGCAACGAC TGATAAACCG TAT TACT ATC TGCTATGTTA 900 

TTGGTAAAAT GCATAGAAAA ATATTCTAAT TTATTCATGC AATATATATG GGTTCATTA 9 60 

TACTTCTTAA TGAGTGTATT TATACCTTGC AATACGTCAT TACTTTTAAT AACAATTTCT 1020 

TTTTCACCTG TCGAAAAAGT CCACTGTTTA TCTCCTATAT TTTCTTTAAT TGTTTTCTTG 1080 

TTGTCAAATT CTAAAATTAT AGCCCGTAAA CACTCTTCTT TATAATTCTC GTTCTTGAAA 114 0 

GTACGAAGCA AAATTTTTAT AAATTCGGTA TTGGTGACTT TTTTATAAGT GTGATATTTT 1200 

GCAATCTCTT TATCAGTAAA GACTGTTCTT AGTTCGTGAT TATCAAAACT TAAATTCATC 12 60 

TTATTCTCTA ATTCATTAAT TTTATCTTGC AAACCAACAT TTTCTAAAAT TTTCTTGTTT 1320 

ATCTCCCCTA TATCAAAACT CCTTTTCGAA ATTAATTTG AAAACTCGTC TGCCATTTCA 1380 

ACAGCCTTTT CTTTCCTTTT ATACCTTTTG TTAAATTTAT GAACCACCGT TGCAGCATAA 14 4 0 

TACGATATCC CACCAGATAA AATAGATGaT ATTATCGGTA TGTATATATC ACCTTTCATA 1500 

TTTCCACCTC TTTTAACACA ATTAAGTATT AT GAT AC AC A ACTTGCGCAA AAAGATGTAG 1560 

ACAGAACATA ATGGCGAACA AAAACAACCA CCCAGTAACT AGTATGGGTG GCGTAgACTA 162 0 

TAACAACTCT ATGTTATCAA GATATATGTA TCGAGTGATG GCAAGGAAGA AGTCTCCTGC 168 0 

GGGACCAACA G T C AG AT AT A TGGCCTCTGC CGGGCTATAT AGTTCACTCC T AC T AT AT AA 17 4 0 

AAGTAAGTAT AACATAAAAA GCACCCCGTA AACTGTTATA CGGGAATGCT AAAGTCATAT 18 00 

ATACTACGGG GAGTAGTATG AAAACTATGC TCTCTATCGT AAGAAAAAAC ACCCAGTGAC 18 60 
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ATGCTTGGGT GAACAAGGAT AGATGTAAAT AGTTGATGCA TGTGTAcACA TCATAACAAA 192 0 

AAACTAGCCC GAAGcTAGCT ATAACATAAA AAAATAGGCA AGACCGAAG TACCTGCCAG 198 0 

TTACGCACAT TTAAATCTTG AGAGTAATGT TAAAAAGTGT ATAGGAATAT TAACATCCAT 204 0 

CCAAATAGTT ATTTAATAAC TGTAAGATTC CCTATAATTA ATGTAGCaAA ATTTTTATTC 2100 

TAAGTAAATA CTAAATCGTG CTAAACTTAC CAAAACTACT TATTCTATTA CCTGCCTTGT 2160 

CTACCTCTCC TGTCGCTATA TAACGACGTT GTCCACTATT AGCAATATAA GTAATCCATC 2220 

TATAGCCATT GATGCAATAT GCGCCGTCAT ATTTAATTGT TGCGTTATTA GGTAATACAC 228 0 

CTGTAATTCT TGAATTAGTT GAATAGCCGT CCCTTACGTT ATTACCTTTA ACATTGGCAA 234 0 

CTGTGTAATA ACCAGTCTCT TTTTTATACG GTACATTGTT TTTATCAAGT GTATAACCTG 2400 

CTGGCACTGG TGGATTTTTT TGGTTTTTAG CTGATGTTTT AACATTACCA GCTACCAAAC 24 60 

CACCTATAGG CTTACCATGA ATCGCACCGG CTATTAATTT AGAATACAAG TCATAGTTTT 2520 

TCTTAATCCA ATCCATATCA TTTTTATTAG TAATAAAACC TAATTCAGATAAACGATAGT 258 0 

TTATATTTAT TTCTGCTGAT ACATTAACGT TTAGTAAATC ATTACGAGGT GTTACACCTC 2 64 0 

TTATTTGTCC TAAGTTATTT TTAATAACAT CTTGTATACT TTTATCAATA GTATCTGCAT 2700 

TGAATTGACT TGAAATAATA ACATGCCCAC CACTTGCACT TTCTCCTGCT GCGTCTAAAT 27 60 

GAATCTCTAG AACAATGTCA TACCCATGTG ATTTAACCCA ATATAAGCCA TAATCTTTAT 2 820 

TATTTCCTAC ATTAACACCG TAAGCAGTAT CTTGATACAT ATCTTGTGAT TGACTTGAGC 288 0 

CACCATATAA TGCAACTTCG TGACCTGCAT GTCTTAAATA CTTAGCGATA TTTGGTGTTA 2 94 0 

TATATTTACG GATAAAATCA CGTTCATTTG TTCCGTTTCC GACTGCTCCA GGATCGTTAT 3000 

AAC CAT G AC C GGCTACAAGC ATAATTTTTT TAGGTTTAAT TACTGCTTGC TTTTTGGCAG 3060 

TTGCTTGCTT AATAACGCTT TTAGCTTTAT CTCCAACACT TACTTTATCT GGGAAATTTA 3120 

ATCTAATAAA ATACATTGGG TCATCGTAAT AATGAACATG TCTTGTAACG GTTTCGGAC 3180 

CCCAACCAGG TTGCGCAACG CCATTTGTCC AACCTTTACC ATTCCAATTT TGGCCAAACG 32 4 0 

ATGTGAAAGT GTTTAGATTA GCGCTCTCAA CAATTTCAAC ATGTCCaGct CCGCCACCAT 3300 

ACTTTGACGG GAAAACGACA ATGTCCAACT TTTGCGGTAA AAAGCTATCA TAGTTTTTAA 3360 

TTATTTGCCC GTATTTTTCA ATCCTTGCTT TATTATCAAA TGGAATATTA TAAGCGTATA 3420 

AACCTTGTAA CcTTTCGCCT GTTGCTATCA TAAAAAACAT ATTTGCGTAA TCGTAACACT 34 80 

GAAATCCATA AAACAAATCA GGATTGAACT GCTTCCCTAA TGAATTATCA AACCATTTTT 354 0 

CTGCTTGGTT TTTTGTTATC AACATTGGTC AACACCTAC CTAAATCATT TGTGTCGTTC 3600 

ATATTCGTAG GTGTCATTAC TTCTTTAATT GGCGCTTGCC CTGTTGCTTT TCTATACTTG 3 6 60 
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TTTTCAGCTT TATATTTCTT TAGCTTTTGA TTTGCCCATT TACCTTCTTG AGATGTTGGA 3720 

TTATCTTTAT ATGTAGTATA TAAAGCAACA ACTGTTAAGA TAATCGATGA AACACTTTCT 37 80 
TCATCTACTG GTATCGGACT TATACCTTTA TTCGCTAAAA ACT GAT T G AC TAATGCTAAG 38 4 0 

ATCAATACGA TGTATCTTGT TATTACTTTT GCATCCATTT GTTTGCTCCT TTTATCCAAA 3900 
ATAAAAAGCC AGTGCCGAAG CACTGACTCT TAACTATTAC TTACACTTAC TAAACCAGAA 39 60 

ACACGACCAA AAGCTATATC CTAAAATTCC CTTAAGCATG GTAATCACCT CCTTTAAATG 4020 
CCAAAAATAG TTTTTAACAA GGCTATAACA AATGTACTTA GAATCGTCCC TATTAATCCT 4 08 0 

AGAATCCACA TCTTGATGTC TCTAATATTT TTAGCATTTT TCTCTTTATT TTTTTCATCT 414 0 

TCTTCTTTGT CACGCCTTAG TTCTTCGAAA TTTCTATCTA ACTTGCCATA AATTTTTTCT 4200 
TGCGTTCTCA GACTGTCTTC TATTCTGTCG AATTTTTCAA ACATAGTCTT ATCATTTTCT 4 2 60 

TCTAATCGCG TTAAACGCCA ATCTTGTTCG TGTCGTTTGG TAAATCCAAA CATTACACCA 4 32 0 

CCCACTTTAT TCAAATTAAA AAGCCATAAG ATTATAACCT ATGACTCTAG ATTTTCTGGA 4380 
TACTTTTCTC CTGTAATAAT TGCATATTCC TCTTTATCTA TAACTTCCAT ATCTACATAC 4440 
CACGCTATAT CTTCTTTACT ATATTCTTTC AATTGATACC ATGTTTTAAT ATCTTCGAAT 4 500 

GTTGGTGAAA TTAATTTAAG CATTTTCAGT CTCTCCTTTA ACCTCTTCTA ATTTTTTATT 4 560 

AAGTGTCACA AGTTGTTTTG CCATTAGEC ATTTTGCTTA TTAACTTGCA TCGATAACTT 4 62 0 
TGTACTTTGA ACAACTTGTT TCTGCATACT AGCAACCATT TTTCGTAAGA TGTCATCAGA 4 68 0 

AGCGACTGTG TTTTGTTCTT CACTGTCAAT CTGTTGATGC AAGTCATCTT TTTCTTCTGA 4 74 0 

ATAATCTTCG TTAAAAACTA TTTCCCCATT TGAATATTTA AAGGCTTTAG GTEAAAAAC 4 800 
T T GAG AG AAA TTTTCTGGTA AATTTTCAAT ATCAATACCT TCTTCAAAGC CACCAATGAT 4 8 60 

AGCGTATGAA ATTATCTCAT TACGCTTGTT AACTAATATT TGCATTATTT TCTCACTCCT 4 920 

ATAATTTTGT TAATTGTCCC TCTATTTGCG TTCGCACCAG AGCCTCTTTG ACTTCCTAAG 4 98 0 

TCGAAATAGA CATCGTTTGA TATAGTTAAA GATGTACGAC TAGATTTAGT TAATCCAAAC 50 4 0 
T CATAAAC AC CTCCACCATT TCCATCACCA TCTGGAAGAT TTGAGGGATT CAATGAAATC 5100 
TTTCCTCCTC CAAAAGGACT GCCAAACTCT GTAAAGTCAC CACCTGGAAA AGTCCCATAA 5160 

AAAATTAATA AAATAAATTG GTCTAAACTC TCATTAAGT ACAATGTAGA GCCCACACCA 5220 
TTTGCTGTTC CAT C AAAAAT AACCGAATAC CTTTTATTAA ACTTGTCATC TGCGTATAAT 5280 
TTAGCGTTAC TTTCGGCCAT ATTAGCTTTT GATTGGGCAC TTTGAACAGT TTCAAAAGGT 534 0 

GTATTGTAAT CATTAATAGC TAATTCTGAC CACTCAGACC ATGAACCCGC TTCTTTTCTT 54 00 
TTAACAAATA CTTTATTTGT ACCGTTCGGT CGATAAGTCA TACGCTTGTA ATCTGAAGTT 54 60 

ACT ACT AAA T AT T C G AC AG T ACCGTTAGTA CTAACACCTC TTGGATAATT TATAGCTTGC 5520 
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GAAACATAAA TAAATTGGGT TGAATCACCT ATTCTTTGTT CTGGATTATT AAAATCAAAT 5 58 0 

CCAGTAATCT GCATTATCTT ACCATCATCT TTAGTAATCT TAGCTTTTTG CCAATTTGAA 5 64 0 

GTAGAACCAC TTGTGACTAA ACCACCACTA TTCACTGACT GCTTGAAGGC TTCATGTTTC 5700 

TCATCCATAT ATCGCTTTTG CTCATCGAAT GTTCTTGAAT ATGCTTGCGC TTTATTTTCC 5760 

AAATCAGATA --TATGGCTATT AGCAAGTTGC TTTAATTCAT CETACTTGA AGATTTTGCT 58 20 

ATTTGAATAT CTGATAGACC TTTTTCTTTA GCTTTTTCAA TCAGACTCGC ATAATCTTCA 588 0 

CCATTTTTTA TAGCCTCGTC CATTGCTTTC GCACGATCCA TAATAGTTTT TTCTAATTCC 5 94 0 

TGAAATTCAA CAATATAGTG TAATTTTGTT TCAGATGGAA TCATGCTAAA CAAACTTTTT 600 0 

TCAACGTTAA ATGTGATAGT TCTCTCGACA ACT AC CACGT CTGAATTACC TAATTCTGCA 60 60 

ACCGAAACTT GAGCTTGATA ACTTCCATCT CGTTTAATTA CATCATTAGG TAATTGAAAT 6120 

TTTAAAATAC CTTTAAATGG ATCTAATATT TCTAGTGGAG CAACTACCAT GACTCCTTTA 618 0 

CCTCGAATCG CTATTCGTGC kTTCATATTT tCTTCACTCA ATAATAACGG TTGATTATTT 62 4 0 

T TAG T GAT AT TAAAAAGAAG AACAGAAGAA TCACTCTCTC CTGTTCTAAA AGTTATATCT 6300 

AG AT T T G AAA TATTTCCATA ATGCGCTGTG TTTTCTAAAT TTATAGCTAC AGATTTCTCT 6360 

AAATTACTCA TTAACTTATA ATTCTCCCTT CGTGTAAAGT CCATGGCCCTGAACTTGTTT 6420 

TACTATCATA ATTTTTCAAT AGTATCTCAG CAGATGCTGT AACACTATTA CGAACTAGCC 64 8 0 

TATGAACAAA GCCACCTGTG TTTGAAGCTT CTACATATAA GTTCCAACCA GCTACCCCTT 654 0 

TACGTTCAGT TGGAAAATCT GTAAAACGTT TTGTATCATC CGTAGTTAAA T AAAAC G AC A 6 600 

TGCCTACTAT GTTAATATCT GACATTTTTG TGATGAATGA AGGTACTCTC TCCCATTTAC 6660 

CACTATTTTT AGGCACATAA TTCCAGTCCG AAATGTCTCC AGTTCTTCCA GAAAGCACCC 672 0 

TTTCAAAAGT CATCATATTC CTTGCATAAC TATTACGCGT CAATATCTGA ATTACATCAC 6780 

CGCCAGTTTG TGGTGGCTTA ACTTCCAAGA JCCAACCTGC ATCACGCCAT TCTCTTGGTA 68 4 0 

ATGGGAAATC ATCGATTTGA ACTGTATGAT CAGTGTATAA ATAGTAAAGA CCTGGCTCTG 6 90 0 

TTAACATCCC AAGATTCTTA AGTTTATCAG GCCTCATTGG TAAAGGTTTA ACTCTACCAC 6960 

CTGTGTCACT CaTGATAAAA GGAACGCCTC TTGAGTGAAG TATTTCTAAA ATACCTETT 7 02 0 

GCCCAATCAT GAAAATACGA TGTGTTCTAT TTCCaTCACC ACCGACAGTA ACACCTAGCA 7 08 0 

TCAAAGCTTT TTTACCACTA TCTTTGTCAT AGTATATTTG CAAACCTTtC TgCTTCCGCA 714 0 

AATTCGCCAG GAAATGAATC tAgTGTTCCA CCATAGTCAG CATTAACCTG ATACGCTTCT 72 0 0 

TCTCCTGTTT CTAAATCGAA AGCCGTTAAA TAGTTTCTAT TATTTGGATT ACTGTCTCCT 7 2 60 

GTATACCAAT ACAAGTATTT TTCATCAAAA GTCACACCCT GCATTGGTTG GGTTTCGTTT 7 32 0 
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GTTAGTCTCA TAGGGATACT GATTTTATGC AAAACTTTAT CAATATTTTT ATCAACATCG 7380 

TCTAAACTTC TTATCTCTAT AT AAn T CAT T GAGTTTTCA GTTCCCACTG ACTTCTAGGT 74 40 

CTCTCaATTC TGTATAGAAT TTTATTTTCT TTTTCATTTA TGACAGGGGT GATGTAGGGT 7 500 

TTTTCTGGGT GTCCTGTAAA TACATCTTGC ATACCATACT TGCCATAGCT AATTTCCACA 7560 

TTAGGCGTAT ACTTGAAACG AACTAATGTA TTCTCATTAT TACCATTTAA GATAAAACTA 7 62 0 

TAAATCCATA ACTCATcATC AATATATCTA TAACCGTTAT GTGTACCATG ACCCCCACCT 7 68 0 

ACAATCAATG AGCTGTCTAT AAATTGACCA TTAGGTCTTA GACGACTTAG CATATAGCCA 774 0 

TTATTTCTAG CTTGTGTCAT GTATACTATG CCTGTTCTAT TATCAAACCA GAAGGATTGC 7800 

ATTACTGCAT TTGTAAgAGG TGCAAGTTCT GTCACAAATA AAAATTCTTG CTTATCAGGT 7860 

TCAAAACGAT ACTCGATATC AAGAATTTCT TGTTTGGTCT TATTTAATTC TCTTATAGTT . 7 92 0 

TCCTCTTTAT TAATTTGAGT TTTGGTTTCC CAATCGTCTA AATGTTCTTT TAATGTGTCA 7 980 

AAGGTTTCGC CGTTTACATT AACTCGAGCT TGAACAATCT CATTAGACT GTTATTACGT 804 0 

GGTGCCACAA CAAGTGCGTT AATTTGACTT TGTAAAGATT TGTTTACTGC TGCTTGCGAT 8100 

CTACCATTAT AATAAATTTG CTCAGCGAAG TGTTGAATTG TTTTAGCTyT CTGATGCAAC 8160 

TTAAACTCTG TTGTCAAGCC AAGCGCAAAT TGCTCTATTC TTTGTAAGTT TTGTATTTCC 8220 

TTAGCTCTAT AATCTCGACC TGCTAAAGCT CCCAAATCCT TTATTAAATA CAAATTTTCC 828 0 

ATAATGCACC TTCCTTTCTA ATAAAATAGC ACTGTACCAA GTTTCCCACT ATCGTCAACT 8340 

GTTATTTTCC ACAATTTACC GTTTGGGGAT TTCTGTACAA TGCTATTTTG AATAATTgcC 8 4 00 

TGctTCGCCT ATTTTTAAAT TATCTAATfT ATTTkTATCA TTTACCGAAA TGATACCGTC 8 4 60 

TTGAGGCAAT CCATCAATAn CACTACTGCC TGCATAAGGT ATCCCATTTA TAGCTTTCCA 8520 

ATGTGTAGCT GGAAAGTACT GTTTATCGT 854 9 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

AGGCGTGTAG TGACTTACGG nTAGGAAACT ATGTATCCGA ATGATTTATT GfiACCAAAA 60 

AGGCAT TAAA GTCCATTGAA ATATCnGGTA GCGmGTTGGT ACgTGGACGT GGGGGCCCTA 120 

GAT G TAT GAG TCAACCATTA TTCAGAGAGG ACATTTAACG TAATAAATTA TAGAmACGAG 18 0 

GGTGAAAATA AT G AC AG AAA TTCAAAAACC GTATGATTTA AAAG G C AG AT CAT TAT TAAA 24 0 
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AGAAAGTGAT TTTACCAAAG CAGAATTCGA AGGACTTATT GATTTTGCAA TTACATTAAA 300 

AGAGTATAAG AAAAACGGTA TTAAGCATCA CTACTTATCT GGAAAAAATA TTGCACTACT 3 60 

ATTCGAAAAG AATTCGACGA GAACGCGTGC TGCGTTTACA GTTGCGTCTA TTGATTTAGG 4 20 

TGCGCATCCA GAAT T T T TAG GAAAAAATGA TATTJAATTA GGCAAAAAAG AATCTGTAGA 4 80 

GGATACTGCG AAAGTATTAG GTAGAATGTT CGATGGTATT GAATTCCGTG GTTTTTCACA 54 0 

ACAAGCTGTT GAAGATTTAG CGAAGTTCTC TGGTGTACCG GTGTGGAATG GATTAACAGA 600 

CGATTGGCAT CCTACACAAA TGTTAGCTGA TTTTATGACA ATAAAAGAGA ATTTTGGATA 660 

TCTAGAAGGA ATAAACTTAA CTTACGTTGG AGATGGACGT AATAATATTG CGCATTCATT 720 

AATGGTAGCA GGTGCTATGT TAGGTGTTAA TGTAAGAATT TGTACACCTA AATCAT TAAA 7 80 

TCCAAAAGAG GCATATGTTG ATATTGcAAA rGAAAAaGCG AGTCAaTATG GTGGTyCAGT 840 

CAT GAT T AC G GATAATATTG CAGArcCAGT TGAAAaTwCm GATGCTATAT ATmCAGATGT 900 

TTGGGTATCG ATGGGTGAAG AAAGTGAATT TGAACAcGTA TTAATTTATT AAAAGACTAT 9 60 

CAAGTGAATC AACAGATGTT TGATTTAACA GGTAAAGATT CAACGATATT CTTACATTGT 1020 

TTACCAGCAT TCCATGATAC AAATACACTT TATGGACAAG AATTTATGA AAAATAT GGA 1080 

TTAGCTGAAA TGGAAGTTAC AGACCAAATC TTTAGAAGTG AAC AT T C AAA AGTGTTTGAT 114 0 

CAAGCTGAAA ATAGAATGCA TACAATTAAG GCAGTAATGG CAGCAACATT GGGGAGTTAA 12 00 

TCACTAAATG GAACGATATG AATATGATGT GTCTGATGAT ATAAGTGTCA T G T AC AG AC A 1205 

CCTCATATTG GTATTAAAGG AG AAAT GAAT ATGAACGAAT C AG GAG AT AA CAAACTCAGT 1320 

AAATCTTCTT TAATTGGACT AGTTATAGGA TCCATGATTG GTGGCGGTGC GTTCAATATA 1380 

ATGTCTGATA TGGGCGGTAA AGCCGGTGGA TTAGCCATTA TTATTGGTTG GATTATTACA 14 4 0 

GCTATAGGAA TGATTTCATT AGCGTTCGTA TTTCAAAATT TAACCAATGA ACGGCCGGAG 1500 

CTAGACGGTG GTATTTATAG TTATGmTCAA GCAGGATTTG GCGATTTTGT AGGATTTATC 1560 

AGTGmTTGGG GATATTGGTT CTCAGCGTTT TTAGGCAATG TTGCCTATGC AAC AC T AT T G 1620 

ATGTCAGCAG TAGGTAACTT TTTCCCGATT TTTAAAGGAG GCAACACAT ACCAAGTGTT 1680 

ATTGTCGCCT CGTTACTACT CTGGGGTGTC CATTTCTTGA TTTTAAAAGG CGTTGAAACA 174 0 

GCAGCATTTA TCAATAGTAT TGTTACTGTT GCAAAGTTAA TACCGATTTT ACTTGTAATC 18 00 

ATATGCATGA TAATTGCATT CAATTTTGAC ACTTTTAAAA CAGGCTTTTT CAGTATGACG 18 60 

TCAGAGGGTG TATTGCCATT TAGTTGGGCG AGCACAATGA GCCaaGTt AA AAGTACGrTG 192 0 

CTAGTGACAG TTTGGGTGTT TATCGGTATC GAAGGTGCAG TAATTTTTTC TAGTAGAGCT 198 0 

nAAAAT GAGA AAGATGTAGG TAGTGCCACG GTTATAGGAC TTATATCAGT TTTAATTATC 204 0 
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TATyTCTTAT TAACTGTATT AGCTCAAGGC GTGATTTTGC AAAAT CAT AT TTCGCAATTA 2100 

GATTCGCCAA GTATGGCACA GGTGCTTGCA ACTATTGTAG GTGGTTGGGG ATCTACACTT 2160 

GTAAATATTG GTTTAATTAT TTCGGTACTA GGTGCATGGT TAGGATGGAC ACTGCTTGCT 2220 

GGTGAATTAC CTTTCATTGT TGCAAAAGAT GGATTATTTC CAAAATGGTT TGCTASGAA 22 8 0 

AATAAAAATG GAGCACCTGT AAATGCACTG CTTATTACCA AT AT ATT AG T ACAATTATTT 2340 

TTAATAAGTA TGCTATTTAC ACAGAGTGCG TATCAATTTG CATTTTCACT AGCATCAAGT 2 4 00 

GCTATTTTAT ACCCTTACAT GTTCAGTGCA TTTTACCAAG TTAAATACAC TTTAGAGCAT 24 60 

CGACAGCAAG CAACTACTAA ACAATGGACG ATTGGTATCA TAGCCTCAAT T TAT GC TATA 2520 

TGGCTTATAT ATGCAGCAGG TATCAATTAC TTATTATTGA CGATGTTACT TTATATTCCA 2580 

GCTCTTCTTG TT TAT AC a AT CGkTCmAAAG rATwATCAGa CACGTTTGAT TAAATCAGrC 2 64 0 

TATATTCtTT T T AT GAT TAT tATCGTACTT GCAGTTATG GGTTAATTAA GTTATTGATG 2700 

GGAACGATAA ATGTTTTTTA AAAGGAGCGA C AAAAAT AT G AAAGAGAAAA TTGTCATTGC 27 60 

ATTAGGCGGT AATGCGATAC AGACAACAGA AGCAACAGCT GAAGCACAAC AAACAGCTAT 2820 

TAGATGTGCG ATGCAAAACC TTAAACCTTT ATTTGATTCA CCAGCGCGTA TTGTCATTTC 28 80 

ACATGGTAAT GG T C C AC AAA TTGGAAGTTT ATTAATCCAA CAAGCTAAAT CGAACAGTGA 2 94 0 

CACAACGCCG GCAATGCCAT TGGATACTTG TGGTGCAATG TCACAGGGTA TGATAGGCTA 3000 

TTGGTTGGAA ACTGAAATCA ATCGCATTTT AACTGAAATG AATAGTGATA GAACTGTAGG 3060 

CACAATCGTT ACACGTGTGG AAGTAGATAA AGATGATCCA CGATTTGATa ACCCAACTAA 3120 

AcCAaTTGGT CCTTTTTATA CGAAAGAAGA AGTTGAAGAA T T AC AAAAAG AACAGCCAGA 318 0 

CTCAGTCTTT a AAG AAG AT G CAGGACGTGG T T AT AGAAAA GTAGTTGcGT CACCACTACC 324 0 

TCaATCTATA CTAGAACACC AGTTAATTCG AACTTTAGCA GACGCEAAAA ATATTGTCAT 3300 

TGCATGCGGT GGTGGCGGTA TTCCAGTTAT AAAAAAAGAA AATACCTATG AAGGTGTTGA 3360 

AGCGGTTATA GATAAAGATT TTGCTAGTGA GAAATTAGCA ACGCTGATTG AAGCAGATAC 34 20 

CTTAATGATT CTTACGAATG TAG AAAAT G T ATTTATTAAC TTTAATGAAC CTAATCAACA 34 80 

ACAAATCGAT GATATTGATG TAGCAACACT GAAAAAAtAC GCGGCACAAG GTAAGTTTGT 354 0 

GGAAGGATCG tGTTGCCAAA AATAGAAGCT GCGtACgtTT GTTGAaAGtG GGGaAACCAA 3600 

A 3601 
(2) INFORMATION FOR SEQ ID NQ 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 573 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGACACTATT AAATGAATTA GAGCACi&TC TAACAAATCA AATTCATTTT TCAAAAGATG 60 

AACGACTCAC ACATATCGCT TTAAAGTTAT TCGAAACAAC CGATCCTGTT TCAACAAAGC 120 

AACTTGCGCA AGATGTTAAT GTTTCGCGTC GGACAATTGC AGATGATATT AAAATGATTC 18 0 

AAGCCCAACT AGATCAATAC CATTTAAAGT TAAATTATGT CCACAATAAA GSTTCAATA 24 0 

TTATTGGTGA GG AAG AT CAT TATCGTAAAG CGTATGCACA CTTTATACAT CAATATATGA 300 

AACAAGCTGC ACCTTTTATA GAGGCGGATA TCTTTAATTC AGAATCAATC GCATTGGTTC 360 

GCCGTGCCAT TATTAAGACA TTAAATAGTG AAAAT TATCA TTTAGTTCAG TCGGCTATCG 420 

ATGGCTTAAT CTATCATATA CTCATTGCCA TTCAGCGTTT AAATG AAAAT TTTTCGTTCG 480 

ATATACCTAT CAATGAAATT GATAAATGGC GACATACTAA TCAGTATGCn ATTGCTTCAA 54 0 

AAAT GAT AG A AAACTTAGAA CGCAGTGTAA TGT 57 3 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: ^ 

(A) LENGTH: 1221 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

TTGATATTTA TAACGTTATA TTTTAATAGT TG&CTGGAT TATTAAATAA ATAGTCCGCC 60 

AAATTTTCTT TTTCTTTATC AATCTGaTkG TAATTAACaC TTTCGaCTTC TGTAGGAATT 120 

CTAATGTCAA CAGAAGCATT GATATAAGCT TGATGTTGCA TGCAATCACA CTCCTAATCC 18 0 

TTCATmTmAA AC G GAG AAG T AAACCCGTCA CTATTCAAAT TCAATCCTTT TGCCCAATS: 24 0 

ACAGGCTTAT TCATGATAGT TTCGATTTCC TTAAGTCCAT TTGAACCTCT AGGTATTTCT 300 

ACAATTACTT CATCATGGAC ATGGCCAACT ATTTTAAAAC CTAATGCTTC AAGCCTTGCT 360 

ATAGAAATCG CAAGTAAATC CCTTGCAGTT GCTTGAACAA TATTCTCGAC TAACTTCCCA 4 20 

CCATACGTTT TTAACTTTGA CCATTTACGG TTAAGATCTA ACCCCATAAA TTCAACAACT 4 80 

TGACTACCCC AACTATTTTC ACCAACTAAA GCTTTTGGAT AAGCTAAAGC TCTTCCACTA 54 0 

GGCAGTTCAA T CAT TAG AAA ACCTTTTTTC ATATAAAATC TAAGTCCATG TGTATGATGC 600 

GTCTTTCGGG ATTTTACAGT ATTAATTGCA GCCTCTTGGC AAGCCTTCCA AAAAT T AAC T 660 

ATGTTAGGAT TTGCGTTACG CCAACTATCA ACTAAACCTT GTAACTCGTT TTCTTCAATG 72 0 
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CCCATTTCCA ATGCACCCAT TGCTTTTAAA GCTCCAGCGC CACCTTGATA GCCTAAAGCT 7 80 

AATTCGGACA CTTTTCCTTT TTGTCTGAGA GGGTCGCCTT TAGTTATGCT TTCTACCGGT 430 

ACATTAAACA TTTGAGAAGC CGATGCTTCA TATATCTTTC CGTGTGTGTT GAAT AC AT C T 900 

AAACGCCATT GTTCTTTTGC ATACCATGCT ATGACTCTTG CCTCTATTGC AG AAAAAT C A 9 60 

CTTACTGCTA GTTCATTACC TTCTTCAGCA GTAAATGTCG TCCTAACTAA TTGACTTAAT 1020 

AAGTCTTGAG GATGAACATT GASTAATAAA TCTAAATCAT CAAAACGTTG TTCTTTAATA 1080 

AGATCTCTTG CTATTTCTAA TTCAGTATCT GAAATATAAT GCTTTGTTAA ATTCTGAAGT 114 0 

TGTACACCTC TACCTGCCCA TCTTCCAGTA CCGGCACCGT AAAATTGAAA CAGACCTCTT 12 00 

ACCCGTTCAT CACTGCACAT C 1221 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1090 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TTTTGTTTGG TATGAGGTAG CAATGACGAC GTGTCATTGG TGGAGATTGT AAAAATACAT 60 

AATAAAAAGA AGCGGCAATG TATACCGCTC CTTTTTTATA CTACATACCG ATTTTCAACC 120 

ATCTCTTTCT ACTTAGTAAT AAGACAATAG TATTAACTAT AAATAGAAGA ACGAAGAATG 180 

ATACTATATT TATAATTTCA GTAGGACACA TAAATGTTGA CTCGTTATTC AATATTTTTT 240 

CTACGGCACG ATACATCGTA TTGCTCGCCT CAAATGGAGC AACGATACCA AATATATTTT 300 

TATTAATGGC AACTAAGATG ACTGAACCAA TCCAATATAC AATGCTGATA CCTAAGCTGA 3 60 

TTAAAATGTT AGGTGAAACC ATACTAATQ3 TTCCAACAAC TAAGATATAT TGTAAGATAA 42 0 

CGAGTGAAAA TAAGATTATT AATAGTAAGT AATGTGAGAA ATCCGAATAT AT AAT T G AAA 4 80 

TAATAGTGAT ACTTAGAATT ATGAACACTA AACATTCAAA AAATAACACT GCTACCTTTT 54 0 

TATAGAAGAA GGTAAAGATA TTATCGCCAA TCAATTTATA AAACAGGATA TTTTATTCG 600 

AATACTCTTT ATTAATAAAA TATGCAATAA CAAATGAAAA TAGTAAGAAC CCTAATTGCG 660 

TTGCAACAGT ATATGAACTG AAGAAAAACT GGCTATAGCT TAAACTTTTA ACTTTGTCTA 720 

TACCTATTGG TAAAAAATAC CCAAGTAAGA AAAGGAATGT GAATAGCACA ACAAGCGTGT 78 0 

AAATAATTTT ATTGGAAATA CTTTTTTTAA ATTCTAATTT CAAAGTGGAC ACCTCAATTA 840 

TAAATTAATG TAATCATTTA TGACTTCTTC TTTTGATTGG TACTCTTCTA TTTGAAGGTC 900 

T T T AAAAAT A AAGTATTTAC CCGGCAAAGC ACTTAAATCG GATAAATTaT GTGTAATATT 960 
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GATAATAGTT TTAGTTTGAT GGCTTTGAAT AAAAT<ATTT AAAAATTCAT AAATTTCATT 



1020 



AACTGTTTTC TTGTCTAAAG CGTTTGTAAC TTCATCTAAT AT GAT T AAAT CATGATCTTC 



1080 



CAATAAGAAA 



1090 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 904 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TTAGGACTAT TTTATCATAT TCATTTAAAT TACGGCTAAA AATTTTAAAA ACGGGGATTA 60 

ATATATGGAA TTAAGCTATG AAAGTTAATT GATACTTGCA TTTTACGCTG ATT T AT AT AA 120 

GAATAACTAT TGTATAGTTT TAAAAACGAA CGTACGTTTG CAGGAGGCGA AATCATTGGC 180 

AAT GAAT AAA CAAAATAATT ATTCAGATGA TTCAATACAG GTTTTAGAGG GGTTAGAAGC 2 40 

AGTTCGTAAA AGACCTGGTA TGTATATTGG ATCAACTGAT AAACGGGGAT TACATCATCT 300 

AGTATATGAA ATTGTCGATA ACTCCGTCGA TGAAGTATTG AATGGTTACG GTAACGAAAT 3 60 

AG AT G T AAC A ATTAATAAAG ATGGTAGTAT TTCTATAGAA GATAATGGAC GTGGTATGCC 4 20 

AACAGGTATA CATAAATCAG GTAAACCGAC AGTCGAAGTT ffiCTTTACTG TTTTACATGC 480 

AGGAGGTAAA TTTGGACAAG GCGGCTATAA AACTTCAGGT GGTCTTCACG GTGTTGGTGC 540 

TTCAGTTGTA AATGCATTGA GTGAATGGCT TGAAGTTGAA ATCCATCGAG ATGGTAATAT 600 

ATATCATCAA AGTTTTAAAA ACGGTGGTTC GCCATCTTCT GGTTTAGTGA AAAAAGGTAA SB 

AACTAAGAAA ACAGGTACCA AAGTAACATT TAAACCTGAT GACACAATTT TTAAAGCATC 720 

T AC AT CAT T T AATTTTGATG TTTTAAGTGA AC G AC T AC AA GAGTCTGCGT TCTTATTGAA 780 

AAATTTAAAA ATAACGCTTA ATGATTTACG CnwGGgTAAA GAGCGTCAAG AGCATTACCA 84 0 

TTATGAAGAA GGGAtCaAAG rGTTgTTAGT atGTCCAaTG ArGGAAAAGA AGTTTTGCCT 900 

GACG 90 4 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11271 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GATTTCTAAA TCAAGATCTG TTTTACGATA AC CAT T C AAA CCTTGACGTT CATCTTCTTC 60 

AGGTTGATTT TGTTGCTGTG TGTCTTTGTT GTCAGAAGTC GCTACEGTTT TTTTATTATC 120 

TGTTTCTTTA GTCATAACAA ACGCCTCCGT TATAAAACGC TATATTTAAT GATATGTGAT 180 

ATTTAAGCCA TTATTTTGAA AAATGGTGAT AAATATATCT TCTTGAGTTA TATCATAATT 24 0 

TTAATAAGAC GATTCAGCAA GTTTTAAAGT ATTATTTGAC TATGTTGGAT TAGGCATCTA 300 

GTCCTATAAT ATCACTGACA TTGTCAAAAT GATGATCTTT TAAGTAACGT GCGATGCCTT 360 

TGTTCATTTT CTTAGTTAAA CCTGGGCCTT CAATAACAAG TGATGAATAA ATTTGAATAA 4 20 

GTGACGCACC GTGACGCATC ATTTTGATTG CATCTTCAGT ACTGAATACG CCGCCTGTAC 4 80 

CTATAATTAA AAATTCACCA TTTGTTTSCT GATAAgCATa CTTAATCAAT TTTAAATTAC 54 0 

GTTCAAATAA TGGACGACCA CTCAAACCGC CTTCTTCGAC TTTATTAGCA GAAGTTAAAC 600 

CATCTCGTTG TCGCGTTGTG TTTGCTAAGA TGATACCGTC AAATGTCTCA GTAATCGCTG 660 

GTAATAGTGC TTTTAAGCCA TCGAAATCCA TATCAGACGT TAGTTTTAAA TAATTGGCA 720 

CTGTTACATC ATGTTGTTTT TTAAATGCTG TTAAAGCTTG GCATAACATT GAAAAT TCAT 7 80 

CTTTATCATG GAAGTTTTGA AGATTTTCAG TATTTGGAGA ACTGATGTTG ACTGTGAAAA 840 

ATGAAACGTC GTGTTTAAAC GTATCAATAA CCTTTATATA ATCTTGATAA CGCGCTTCAT 900 

AAGGTGTCAT TTTATTCACA CCAACATTGA T AC C AACAGG TACTTGATAA GCATTTTTAC 9 60 

GCAAATGACT TAGTGCTTTG TTCATACCAA TATTATTGAA GCCCATTCGA TTTATCAAGG 1020 

CGTCATCTTC TAATAATCTA AACATGCGTG GTTGAGGGTT ACCCGGTTGA GGTTTAGGTG 1080 

TGATACCACC TAATTCTAAA GCACCGAATC CAAETGTTC CAATGCTTTT GGTACTTCGC 114 0 

AAGATTTGTC GAAACCAGCT GCTAAgCCAA TTGGATTGTC GTACGTATTA CCTTGTATCG 1200 

TTTGTGATAA CGTTGGATTC TTATAAGTAA ATAGTTTATC GACGACTGGG AATAAAACCG 12 60 

GaAACTTTTG TaACGTTTTT AATGCATCGA TAGTTAGTCC GTGTGCTTTT TCGGGTTCGA 1320 

TTTTGAATAA GAAAGGTTTA ATTAATTTGT ACATGAGTAT GCTCCTATTT CATTATATTT 138 0 

GAGGCTTACT ATCCTCAACT TAATATATGT GAAAT AT AT T CTTTTAATAG ACTAGCATTT 14 4 0 

CCATACATAA TTTCCTAGTT AAAACTAAAA AGTTTTGAAA ATTGACGCAA gTTTGAATAA 1500 

CGTTTTTAAG ATTAAATCAT CCTAATTAGG CAATATTATA GTATAAAGTA AGTAGATTGG 15 60 

AAGGTGTTTG TATGAATGAA CAATGGTTAG AGCATTTACC TTTAAAAGAT ATTAAAGAGA 162 0 

TTTCACCAGT GAGTGGTGGT GATGTAAACG AAGCATATCG AGTCGAAACA GATACGGATA 168 0 

CATTTTTCTT ACTTGTCCAA CGTGGACGTA AAGAATCATT TATGCTGCA GAAATTGCAG 1740 

GTTTAAATGA ATTTGAACGT GCAGGTATCA CGGCACCTAG AGTAATTGCA AGTGGCGAGG 1800 
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TTAACGGTGA TGCGTATTTA GTGATGACGT ATTTAGAAGA AGGGGCTTCA GGGAGTCAAC 18 60 

GCCAATTAGG GCAACTCGTA GCTCAATTAC ACAGTCAGCA ACAAGAAGAA GGCAAATTTG 1SC2 

GCTTCTCATT ACCTTATGAA GGTGGCGATA TTTCTTTTGA TAATCATTGG CAAGACGATT 198 0 

GGTGTACCAT TTTTGTCGAC AAACGCTTGG ACCATTTGAA AGATGAGTTG CTAAATAGAG 20 4 0 

GGCTATGGGA TGCCAACGAT ATCAAAGTAT ATGACAAAGT GCGACGTCAA ATTGTGGCGG 2100 

AATTAGAAAA GCATCAAAGT AAA^CGTCTT TATTACATGG TGACCTATGG GGTGGTAATT 2160 

ATATGTTCTT ACAAGATGGT CGTCCGGCGT TATTTGATCC AGCGCCATTA TATGGTGACA 2220 

GAGAATTCGA TATCGGTATT ACAACGGTAT TTGGTGGTTT TACGAGCGAA TTTTATGATG 22 8 0 

CGTATAATAA ACATTATCCA CTCGCAAAAG GTGCATCCTA TAGACTTGA TTTTATCGTT 234 0 

TATATTTATT GATGGTCCAT TTATTGAAAT TTGGTGAGAT GTACCGTGAT AGTGTTGCGC 2 4 00 

ATTCTATGGA TAAGATTTTA CAAGATACAA CAAGTTAGTT AAG AC G T TAG AT T G AG AT AA 24 60 

ATAGATAATA TGCACAGATA TTTTTACAAT GAGAAGCGAT ACAGCTGCCT CAATAAAAAT 2520 

ATTTGTGCGT TTTTATTGTT GGAAAATAAA ATTTTAATCG CTATTGTTAA TTTCTGTAAT 2580 

GTAAAACAAG GTTGAGTTAC AATAAAAGTG ATTTTATAAC TTTTTGTTCA ATAAAATTCT 2 64 0 

AGGAATGATA CATATTTATT GATACAATAA TTTTGAATAT AATCATAAAA CAATATTTAA 27 00 

GTATAATTGA ATGTTTGAAT ATCATATATT GATACAGTTT CTAATAATTT TAAAATAATT 2760 

TAAATGGAGA GAGGTGTAAA TGATGAGTAC AGTTCAAAGT GATATTTTTA AGACCAATAG 28 20 

TGCATCATCA TCTATTAAAA GCGCTGTTGA AACATGTAAT AATGTGTCGA AACCGGATAA 28 80 

AGATGAAAGT ACAACAGTAA GTGGAAATAA TAATGCTCAT AGTGTGATAG ATGATTGAT 2 94 0 

GAGTAAGAAT CAATCTGTTG CTGAAGCAAT ACGAACTGCG AGCGATAATA T AC AAAAAG T 30 00 

TGGTGAGGCT TTTGACCAAA CTGACGTAAT GATTGGTAAT GAAATTGGTA AAAAT TAAAA 30 60 

CGTGGTGAAA TGATGTCGAA TAAACTGGAT GAAATCAATA AAATAATCAC AGCGAAACAT 3120 

GAGCAAATGG ATGACTTATA T GAT GAAAAG CGAGAGGTTA AAGCATTGAT AGATGAAAGT 3180 

GATGCGCTTA ATCATTCGAT AGATCAATTA TATCAACATT TAGGTGAGCG T TAT TAT AG T 32 4 0 

AGCAATATGG CTAGTCGTAT GGAACAGTTC CGCGATGAAT TTCATTTTGC GAAACGACGT 3300 

TCAACGGAAG CGTTATACGA GCAGCAACAG CAAATTCMC ATGGCATTCG TAAAGTGGAA 3360 

G AAG AG AT G A TTGACTTGGA AATGCGAAGG AATGTTGAAA TTGAGACGGT G AC AAAG GAG 34 20 

GAAAATAAAT GGAAACAATA GGAAGCATTA TTTATTTAAA AGAAGGTTCG CAAAAGTTAA 34 80 

TGATTATTAA TAGAGGmCCA aTTGTAGAAA TTGAAAATCA AAAGTATATG TTTGACTATT 3540 

CTGCATGTAA ATATCCGATT GGTGTTGTAG AAGATGAAAT TTATTATTTT AACGAGGAAA 3600 
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ATATAGATTC AGTTATTTTT AAAGGTTATT CTGATCAAGA TGAGGTTAGA TTTCAAGAGT 3660 

TGTTTGAAAA TATGAAACAA AATTTGGATA GTGAAATACA ACGTGGAGAA GTTACACAAC 37 2 0 

AATAAAGAAA TACTTTTTCT TTATTGGGGT GGGACGACGA AATAAATTTT GTAAAAATAT 3780 

GATTTCTGTC TCACCGCCAA AATGTTATCT CCaAGTACTG AGGCTGGATG ATACCATGAA 38 4 0 

ATGTCATTCA TAATCATTTG AACTAAACGT AGCAGCCTTA AATTTTAAAA AAAGACACAT 3 900 

ACCAACTTCC GAAATGTAGA TGAATTCTCT ACAATAACGG AAGTTTTCT TTTAATATTG 3 960 

AAATTTCTCA AGGATAGGTC TATACTTTAT AAATCGTAAT TATTACGATT TATAATCAAA 4 02 0 

AACAATAACT T G AAAT AG AT CATTGAGGGA GTGTTAATAT GCAACATCAT AAAGTGGCTA 4 08 0 

TTATcGGTGC CGGTGCTGCA GGTATAGGTA TGGCCATTAC CTTAAAAGAT TTCGGTATAA 414 0 

CAGATGTCAT TATTTTAGAA AAAGGAACAG TAGGACATTC ATTTAAACAT TGGCCGAAAT 4 2 00 

CGACCCGTAC GATCACGCCA TCATTTACGT CTAATGGATT TGGCATGCCT GATATGAATG 4 2 60 

CAATTTCCAT GGATACTTCA CCAGCATTTA CATTTAATGA AGAACATATT TCCGGAGAAA 4 320 

CATATGCTGA ATATTTACAA GTGGTTCECA AC CAT T AC G A GCTGAATATC TTTGAAAATA 4 38 0 

CAGTTGTCAC AAATATATCT GTAGATGATG CAT AT TAT AC GATTGCAACG ACAACAGAGA 44 40 

TATATCACGC GGAT TAT AT C TTTGTCGCAA CAGGTGATTA TAATTTCCCT AAAAAgCCAT 4 5 00 

TTAAATATGG TATTCATTAT AGTGAAATTG AAGACTTTGA TAACTTTAAT ASGGGCaAT 4 5 60 

ATGTGGTTAT CGGAGGTAAT GAAAGTGGCT TTGATGCTGC ATATCAACTT GCAAAAAATG 4 620 

GC T C T G AC AT CGCACTTTAT ACTAGCACAA CCGGTTTAAA TGATCCGGAT GCTGATCCTA 4 680 

GTGTTAGATT GTCACCTTAT ACACGTCAGC GACTAGGTAA TGTCATTAAG CAAGGTGCTC 47 40 

GCATCGAAAT GAATGTACAT T AT ACAGT T A AAGATATTGA TTTTAACAAT GGACAGTATC 4 800 

ATATCAGTTT TGATAGCGGA CAAAGTGTGC TTACACCTCA TGAACCAATA CTAGCAACTG 4 8 60 

GCTTTGATGC AACAAAAAAT CCAATCGTTC AACAATTATT TGTGACAACA AATCAAGATA 4 920 

TTAAATTAAC AACACATGAT GAAT CGACAC GTTAXCGAA TATTTTTATG ATTGGTGCAA 4 98 0 

CAGTTGAAAA TGATAATGCC AAATTATGCT ATATCTATAA ATTTAGAGCG CGATTTGCAG 504 0 

TACTTGCACA TCTTTTAACA CAGCGGGAAG GcTTACCAGC TAAACAAGAT GTCATTGAAA 5100 

ATTATCAAAA AAATCAAATG TAT T TAG AT G AT TAT T CAT G TTGTGAAGTG TCATGCACAT 5160 

GTTAGAAGTG AAAT AT GAT A TGAGAACTGG GCATTATACG CCCATACCTA ATGAACCTCA 5220 

TTATTTGGTT ATTAGTCATG CGGATAAACT TACCGCAACA GAAAAAGCGA AATTAAGATT 52 8 0 

ATTAATCATA AAACAGAAAT TAG AT AT T T C ATTGGCAGAA AGTGTAGTTT CTTcGCCTAT 5340 

AG C GAG T G AA CAT GT GAT AG AACAATTGAC ACTATTTCAA CATGAGCGAC GACATTTAAG 54 00 

ACCTAAAATA AGTGCGACAT TTTTAGCCTG GTTGTTGATA TTTTTAATGT TTGCATTGCC 54 60 
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AATCGGTATC GCTTATCAAT TTTCAGATTG GTTTCAAAAT CAGTATGTGT CAGCATGGAT 552 0 

AGAATATTTA ACTCAAACAA CATTGCTCAA TC AC GAT AT A TACAGCATA TATTATTTGG 5580 

TGATTATGGT GTGCTATCAC TTGGAACATA TTCGCTCGTA TGGGCATTGC CGGTTGTAAT 5640 

ATTGATTAGT TTATCAACTG CTATAATTGA TCAAACAGGA C T C AAAT CAT GGATGATATG 5700 

GGCAATTGAA CCGTCAATGT TATGGATAGG ATTACAAGGT AATGATATCG TGCCACTATT 5705 

AGAAGGGTTT GGATGTAATG CAGCAGCTAT TTCACAAGCA GCACACCAAT GCCATACCTG 5820 

CACGAAGACA CAGTGTATGA GTTTAATAAG CTTTGGTAGT TCTTGTAGTT ATCAAATAGG 58 8 0 

TGCGACATTA TCTATTTTTA GTGTAGCTGG AAAGTCATGG CTATTTATGC CGTACTTAAT 5 94 0 

ATTAGTACTT TTAGGTGGCA TCTTACATAA AGGATATGGT TGAAAAAGAA TGATCAACAA 60 00 

CTTAGCGTTC CGCTACCTTA TGATAGGCAA TTACATATGC CAAATATACG TCAAATGTTG 60 60 

C T AC AAAT G T GGCAAAATAT ACAAATGTTT ATCGTTCAAG CGCTACCTAT TTTTATCACA 612 0 

ATCTGTCTTA TTGTTAGTAT TTTATCACTA ACGCCAATTT TGAATGTTT ATCACAAATA 6180 

TTTACACCTA TATTATCGTT ATTAGGCATC TCGTCAGAAT TGTCACCAGG GATTTTATTT 624 0 

TCAATGATTC GAAAAGACGG CATGCTCTTG TTTAATTTGC ATCAGGGCGC CTTATTACAA 6300 

GGAATGACAG C AAC AC AG T T ACT AC TACT T GTGTTTTTTA GTTCAACATT TACAGCGTGC 6360 

TCGGTCACAA TGACGATGCT TTTGAAACAT TTAGGTGGTC AGTCAGCACT AAAATTAATT 64 2 0 

GGAAAGCAAA TGGTGACATC ATTGTCTTTA GTTATTGGTG TAGGCATCAT TGTTAAAATA 6480 

GTAATGCTGA TTATTTAAAA AAAATGAACT ATAACTGAAT ATAGAGTCAT GTCAGTCAAT 654 0 

AGGAGATCTA TCTTGGAATA TGCTATTCAT ATGAAGTATA AGAGGAGAGT CGCAGATGAA 6600 

AATAGTTATT ATAGGTGGGT TTTTAGGTGG CGGTAAAACG ACTGTCTTAA ATCATTTGCT 6660 

CGCTGAATCA TTAAAGGAAT CGCTGAAACC AGCAGTCATC ATGAATGAAT TTGGGAAAAT 6720 

GAGTGTTGAT GGTGCCTTAG TATCTGAAGA CATACCTTTA AG T G AAC T G A CAGAGGGTG 67 8 0 

TATCTGTTGT GCAATGAAAG CAGATGTATC AGAACAGTTA CATCAATTAT ATTTAAAAGA 68 4 0 

GCAACCAGAC ATTGTATTTA TTGAATGTAG TGGGATTGCA GAACCGGTCT CTGTCTTAGA 6900 

TGCTTGTTTA ACGCCTATTT TAGCTCCGTT TACAACAATT AC AC AT AT G A TTGGTGTAAT 6960 

AGACGCAAGC ATGTATAAAC AC AT T AAAT C ATTCCCTAAA GACATCCAAG GCTTATTTTA 7 02 0 

TGAGCAATTA GCATATTGTT CTGTCTTATT TGTTAATAAA ATAGATTCAG C AG AT G T T GA 7 080 

AAC AAC GAG C AAACTATTGA AAG AT T TAG A AG T TAT T AAC CCAGAGGCCG ATATACAAGT 714 0 

CGGTATGCAT GGCAGCGTCA CTTTGCCAAT ATCAGTTSA C AAAT G AC AG CAACTTCTGA 7200 

CAATAAACAT AAGTCTTTAC AT C AAAT GAT T AAT CAT C AA TTTGTGCAAT C AC C AG T C AA 72 60 
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ATGTACTAAA GCAGAGTTTA TAAAACGTTT AGCATGCCTT CCGTCTCATA TTTATAGGTT 7 320 

GAAAGGG T T T ATGACATTTG AAGACACCGC ACATACGTAT CTCATTCAAT TTACACAAGG 7380 

ACAATATGAA TTAACACCTG TAGCATTTTC AAAAAAAGTG CCAGAATATT TGGTGCTTAT 74 40 

CGGAAAGGGT ATTTCAAAAG AAG AC TAT C A ATGTTTGGAA CAGTAGTGTT TTCAGTGGAA 7 5 00 

GAGAATGGTT AACATGCCTT CATGTATAAT AACGAGTTGA TTTGAACGTT TAAGCGTAAA 7 5 60 

TAAAAATAAG CTTGGTCAGC CATCAAATAT AATTTGAAAA CTGTCCAAGC TGTTTTATTA 7 620 

GAGAACAATC AATTAACCCC ACATATTTAA T AAT AC AT C A GCAAAGCCTT CAGGTTTTTG 7 680 

AATATAACCT AAGTGACCGC CTGGAATATC TACAATAGGT ATGCCAGTTT CTTTATTTAT 77 4 0 

ATAAAAGTTA ACATCTTGTG G G AAG GAG C C TCTAGAATCT GTCCBTTTA GTAGGGTGAT 7 8 00 

TTTATCGCTG TATTTTGTGA AAT CAT C C AA AGTAATATCT GAATGCGTAT ATTGTCTAAT 7 8 60 

TTCAAATTCT GACCAGAACA TCGTACGTTT GTACTGTTCT ATACGTCCTT CTTCAGTATC 7 920 

AGCAGGTTGA GACAT CAT T T TTGCATCAAT TGGTGCGATA TTTAATGTTT CGCCAAATGT 7 980 

TTTCATGCCT TTTTCTAAGC CTTCTGTTAA AATTTGATGC ACAATGTCAT CATTTTTATC 804 0 

TTTCCAATAA GTACTGTCTG GTAAAAATGT ATTAATTGGT GGTTCGTGAA ATGCAATCTT 8100 

TTTAACGACT TCAGGGTAAT CTTTTAACAC ATGCATCGCA ACGATTGAAC CTGAACTTGA 8160 

ACCTAATATA TAGACAGGTT CATCACTfAA TGACTTTGCA AGTTCGGCAA TGTCCTGTGC 822 0 

GTCGCGTTTG AC AC GAT AAT CACTGTCAGG GTTTGAAGCG GAATCAGGGA GTGGTTCAGT 82 8 0 

TAACTCGCTT TCTCCATAAT CACGACGATC AACGGCTACA AC AG T AAAAT GGTCTTTTAA 8340 

CTGTTCTGCA AG AG G C AG AA AAATGTCTCC GGTACCGTTT GCACCAGGAA T AAG AT GAG 8 4 00 

CACGGGTCCT TGTCCGACTT GGTGGTATCG TAATTTAGCG CCTTGTAATT CTAAAGTTTC 84 60 

CATATTCAAT GACCTCCATT TGTTAATTGT TAGGTGATAA ACC T AAT AAT TTAGCACCAT 8520 

TTGTATAACT TATTTTCTCT TTTTCTTCAT CTGTTAAACC CAGTTCATCT AAAAATACAC 8580 

CTAATTTTTC AGGCTCAATA T AT GGAT AAT CAGCAGCATA AAG AAT T C T A TCAATACCTA 8 64 0 

CTTCTTTCTT GACTAAATCA AACTGTGGCT TCGTTAACAT GCCACTCGGT GTGATATAAA 8700 

AATTATTTTT AAAG T AAT AG CTTACAGGGT GGTTCAAATG TTCAGCGAAT AAAGCTTCAT 87 60 

CCATACGTTC TAAGAAGAAT GGGATAAACT CACCCAATG TCCAATAATC ATATTTAACT 8820 

TTGGATAACG AT C AAAAAT A CCAGATAATA CTAGATGTAT TGTATGAATG C C GACAT C AA 8 880 

TGTGCCAACC ATAACCAAAA CAAGCAAATG TTGCCGCAGT TACTTCAGGA TAATTTCCTT 8 94 0 

TAT AG TAT GA TTGATAAATG TCACTGTTAA CTGGCGCGGG ATGTAGATAA ATCGGTACGT 9000 

CTAAATTTTC AGCTGTTTTG AAAAT AAT G T CATATTTGTC TTGATCAAGA AAACCATCTT 90 60 

GTGCACGTCC CATAATGAGC GCACCTTTGA ATCCTAAATC ATTGATGCAA CGTTCGAATT 9120 
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CTCGCGCTGC GGCTTCAGGC T CAT T GAT AG GTAAAGTTGC AAAGCCTACA AAGCGATTGG 9180 

GATATTGTGC AATATAGTTT GCCAATTGAT CATTTGCTTT TTGACATAAT TCAATGGCTT 92 4 0 

TCTGACCAAC CAAATTTGAA GGAGAACCAT TTCCATAAGA TAAGACTTGA ATTTGAACGT 9300 

CTTGATTATT CATAAATTGG ATACGTTCAT CATGATGTGA TAATTCGTCG GCATTTGTAA 9360 

AACCTGTCTT TTTTTcAAGG CCTTCTAACA TTACTTTCAT GGTACACCT TTAGGATCTG 9420 

CTGATATCGC ATTCATCGTT TCTTTTTGAA TATCTTCAAT GACATAATGT TCTTCAAACG 94 8 0 

TAATACTTTT CATTTACTTC GCCTCCATAT TGTATTGCAT GTTTATTGCA TCTATTGCAG 954 0 

AAGCATTTTT TATATACCTC TAATTTCAAT GTTTGTAACA TAAAATTGAT CTACCAAGGC 9 60D 

ATCTCTCCAT CGCCATTAAT AAATGTACCT GTTGGGCCAT CTGCACCAAT CGTTGCTAAT 9660 

TGAATGATTG GCTTGATTCC TTCAGAAACG TGTTTGGAAT TATTACTAAA AT C AC C AAC T 97 2 0 

AAATCAGTAT TTGTAGCGCC TGGATCAGCA GCATTGATTT GCATGTTAGG TAATCCTTTA 97 80 

GCGTATTGTA GCGTTAGCAT TGTTACTGCC GATTTAGACG AACAATAAGC TAATGAATTC 98 4 0 

ACTTTAGATT CAGCTGTTTC GGGGTTTGTA ACCATTCCAA AT G AAC C T AA ACCACTTGAT 9 90 0 

ACGTTGACGA CAACAGGTTG TTCAGATTTT TCTAAGAGAG GGACGAATGT ATTCATCATT 99 60 

CGTACGATAC CGAATACATT CGTTTGATAT ACTTCTTCAA CGTCACGAG TGTCAATTTG 10 020 

GAAGGTGCTG AAAATTGACC AGATATACCT GCATTGTTAA TGAGGATATC AAGACGGCCT 10080 

TCTTTTTCAG CAATCATGTT ATAAGCATTT TTGACTGAGT AGTCACTTGT AACATCTAAT 1014 0 

TGTACATAAT GAACACCTAA TTTTTGTGAT GCTTGTTGTC CTCTTACATC ATTCCGAGAA 10200 

CCTATATAAA CTTTGTAACC CAATGCTTTA AGTGCCTCTG CACTTGCATA GCCTAACCCT 102 60 

TTATTGCCTC CTGTGATTAA CACAATTTTA GTCATTACGT CCCACCTCAT CTAAATAAAT 10320 

GTTTAATAAA TAATTTCTGT ACGCTTCAAT TGAAATATGG CGATGCTCTA TTTGGAAGGC 10380 

AAAT AC AC T A GTTGATAATG ATTGCAACAG CATATCTGTT TTGAAtTCGT GTAAGTGTCG 10440 

TCATCGCTTT TAAATAAGTC ATAATAAAAA TCAAATAATT CTTGATAAAA TGCGCTTTGG 10500 

TAAAAACGTA ATTTATTGTT GCCTGCTTCA ATACATTGCA GTAGTGCCTT ATTATCGATT 105 60 

j 

TTAAATTGTA AAAGATAATC TAACGACACT TGCATAACCT CATAATTAGA AT G AT&T C A 10 620 

TCTTTAATTT GCTTAAAATG AGTGATAAAA ATATCAAGGT CTCTTTGTAT GACGTAGTAG 10680 

CATAAATCGC TTTTATCTTT GAAATGTCGA TACAATGTCC CCATACCGAT ACCTAGTTCT 10740 

TTAGCAATAC GATTCATACT AATGTTTTCA ACGCCTTCTT CAT C AAAAAG TTTGTGCGCT 108 00 

ATTTCTTCAA TTCGTTGCCT ATTCTCTTTT GCATCTTTTC GC AT GAT T AC ACCTACTTAA 108 60 

AATTCTCTAA AAT T G AC AAA CGGATAACTC TCCGTTTATT ATAAAACGTG TTAAGAAAGT 10 920 
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TAGCAATGAA TTTGCAATAA CTATTAAATA TCATAAAAGA AAAGAGTGTT GATAATGTCT 10 980 

TTAAATAAAG AGCAAAGACG CATCACAGCT GAAGAGT'EC AAGCACATTT CGAAGCATCT 110 4 0 

ACCTTATCGG TTCAAATGAT TGCTGAAAAA CTGAATGTCA CTACAGAAGA TGTGGAAAAA 11100 

G TAT TAG C T A TGACAGCGCC ACTAGGCATT TTTAGTCATC AATTACAACG AT T TAT T CAT 11160 

TTAGTATGGG AT G T C AG AG A TGTAATAAAC GACAATATTA AAGGAAATGG ACAAACACCA 11220 

GAACCATATA CGTATTTAAA AGGTGAAAAA GAGGACTATT GGTTTTTAAG A 11271 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CAACCCGTTC AGAACAAAAT AAAAACCGTA CAATTTTATC ATCTTAATGA TTATTGTACG 60 

GAAAAACTTT TTTACATCAT ATCTGCATGT GCATAATCGA TATCGGTAAA TT TAT TAT AT 12 0 

TGTTTCATAA AATGTAACTT AACTGTGCCT GTTGGACCGT TACGTTGCTT AGCAATGATA 18 0 

ATTTCAATTT CACCGTTTTC ATCATTCGTT TGTGGCTCGA AACCACCATC ATCGTCATCA 24 0 

TCTTCATCGC CGCCACGGTT ATAGTAATCA TCACGGTATA AGAATGCAAC GATATCGGCA 300 

TCTTGCTCAA TCGAACCAGA TTCACGAATA T C AC T CAT C A TEGACGTTT ATCTTGTCGT 3 60 

TGTTCAACAC C AC G AG AT AA CTGACTTAAT GCGATAACTG GACATTTTAA TTCACGGGCT 4 20 

AATGCTTTTA ATGTACGAGA GATTTCAGAA ACTTCCTGTT GTCTGTTATC GGACGCACGT 4 80 

GAACCACTAC CTTGAATCAA CTGTAAGTAG TCAATCACAA TCATGTCTAA GCCATGTTCT 54 0 

TGCTTTAATC GACGACATTT AG AAC G T AAA TCATTAATTC GAATACCCGG TGTATCATCA 60 0 

ATAAAAATCT TCGTACGTGA TAATTTACCT ACCGCTATAG TAAAACGACT CCAATCTTCC 660 

T C AG T CAT AG TACCCGTTCT TAAGCGGTTT GAGTCAACAT TTCCAGAACT ACAAATCATA 720 

CGTGTGGCTA ACTGATCAGC ACCCATCTCT AGCGAGAAAA TACCAACTGT AT AC AT AT CT 7 80 

TCATGCGTTG CAACTTTTTG TGCAATATTA AGTGCGAACG CAGTCTTACC TACAGATGGA 84 0 

CGCGCTGCAA GGATAATTAA ATCATTTCGG TTGAACCCTG CTGTCATTTG GTCTAAATCT 900 

CGATATCCTG TAGGTATACC TGGTGTTTGA CCACTATTTT GATCAAGCTCTTCAGCTGTT 9 60 

TCATACACTT GTCCTAAGAC GTCTCGAATG TCTTTAAAGC CATCGCTTTC AC GAGAAGAT 102 0 

GATAGCTCTA AAATTCGACG TTCTGCATCA CTTAAAATCG CATCTAGTTC AAGTTCATCA 1080 

TTATATCCAT CATTGGCAAT ACTATCTGCA GTTTGAATCA ATCTACGTTT TAATGCATGC 114 0 
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TTAGAAACGA TATCAGTATA ATACTGAACA TTTCGCGTCG TTGGTACATT TGTAGATAAC 12 00 

TCTGCAAGAT ATTGCGGGCC ACCCGCTTcA TTCAACGTAC CTTCCGTCGA TAATTGATCC 12 60 

ATCAATGTTA CAACATCAAT TTCTTTATTA TCTTCATTTA AGTGCATCAT TGCACGGAAA 132 0 

ATATGTTGAT GGGCACCCCT AT AAAAC G AC TJAGGAAGCA AAACTTCCTG AGTAGTATTA 138 0 

ATCAATTCTG GATCTATAAT AATTGAACCT AAGACAGACT GTTCAGCTTC ATTGTTATGC 14 4 0 

GGCATTTGAT TTTGCTCATA CATTCTATCC ATGAATGGTT ACACCTCTTA TTTCAATCCA 1500 

ACTTTATTGT TCAACTGTGT GTACGCGAAT TGTACCTTCA ACTTCTTTAT CTAATTffAC 1560 

AGG T AC AT T C GTATATCCTA GGGAAT GAAT TCCATTTGGT AAATCCATTT TACGTTTATC 162 0 

AATTTTAATA TCATGTTGTG CTTTTAGTGC TTCGGCAATT TGTTTTGTAC TTACTGACCC 1680 

AAACAATTTA CCACCTTCAC CAGTTTTTGC TGaTACTTCA ACTTCAATGT TTGATAACGT 17 4 0 

TTCTTTTAAT GCTTTAgCAT CTTCAATTTC TTGTTGGCGT TCTTGTTTTG CACGTTTTTT 1800 

CTGTAACTCT AATTGTTTAA GGTTACCTGG TGTTGCTTCT ACAGCATAAT TCTTTTTCAA 18 60 

T AAGAAGT T A TTTGCATAAC CTACTGGTAC TTCTTTAACT TCACCTTTTT TACCTTTACC 1920 

TTTACCTTTA ACATCTTGTG TAAAAATTAC TTTCATGCff CTTCACTCCT ACTTAATTGT 198 0 

TCTGTAATTG CTTGTTGTAA TTGTGCTATC GCCTCTTCGA CTGTCACACC TTTAAGTTGT 204 0 

GTTGCCGCAT TGGTTAAATG TCCACCGCCA CCAAGTGCTT CCATTGTTAA CTGGACATTT 2100 

ACTGAACCGA GTGAACGCGC AGATATACCA ATCAGATTAT CTTCACGTCT CGCAACAACA 2160 

TATGATGCTT CAATACCTTC TAAACTTAAC AGTTCATCTG CTGCTTGTGC AACTGTTACT 2220 

GGATGATAAA TTTTATCGTC TGAACCATGC GcAATGGCTA TGCCATTATC TTCAACTTTT 2280 

ACAGTTCGAA TTAATTCAGA TCGATTAATG TAAGTATCCA CATCATCTTT TAAGAAATGT 2340 

TGCGTTAAAA TCGTATCTGC ACCATGTGCA CGTAAATAAC TCGCTGCATC GAATGTTCTT 2400 

GATCCTGTTC GTAATGTAAA GTTTCTTGTA TCTACAATAA TACCTGCATA CATCACTGTT 24 60 

GATTCAAGAC GTGTTAAACG TTGTTCTGTT GGTTGATATT CCAGTAACTC TGTTACCAAT 2520 

TCAGCTGTCG AACTTGCGTA TGGTTCCATA TATATCAACA AT G GAT AG A GATGAAGCTT 2580 

TCACCACGTC TAT GAT GAT C GATAACAACT TTACGGTTTG CTTTATTTAA GACATTTTCA 2 64 0 

TCTAAAACCA GTTCCGGTTT ATGCGTATCA ACAATCACTA CGGTTGTCTT AGATGTCATC 2 7 00 

ATATCCCAAG CATCATCTGA TGTAATAAAT CGCTCTCTTA ACTCTGGCTT TTTATCTATT 2 7 60 

TCGTTCATCA CGCGTCGTAA TGTTGGATCA ATGTCAGTCT CATTTAATAC GATGTATGCT 2820 

TCTAAATTAT TCATCATTGC AAAT C TAG AC ACACCGATTG CTGCACCAAT TGCATCTAAG 28 80 

TCAGGACGTT TATGTCCCAT GATAATGACT TTGTCACCCT CTGCAAGGAT ATCTTTTAAC 2 94 0 
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GCATGTGAGA TCACACGTGC TCTTACAGSA GTACGTTTCT CCATCGGGTC AGTCTTACCG 30 0 0 

CCATAGAAAC GCACATTACC ATTAATACTT TTAATTGCAA CTTGGTCGCC ACCGCGTCCT 3060 

AATGCTAAGT CTAGGCCTGA TTGTGATAAT TCACCTAAGT CGATTAAATT TTCAGTACCT 3120 

TCACCAACAC CGATACTTAA TGTTAATTGG GCACGATAAC CAACACTTTT TTSCGTAAT 3180 

TGACTCAAGA TATCAAATTT AGATTCTTCT AAGTCAGCTA ATATTTTTTG ATTTAAATAG 324 0 

GCTACGAATT GATCGGAACT GTATCTTTTG AAAAATATAT TATACTCAGT TGCCCATCGA 3300 

CTAATGACAC GCGTTACCAT TGAGTTGATT TCCGAACGCT GCGTATCATT CATATTTTGC 3360 

GTAATCTCAT CGTAGTTATC TAAAAATAAT GTCGCAATGA TTGGTTTAGA ATTTTCATAT 34 20 

AGTTCATTTG TTTGTACTTG TTCAGTTATA TCAAAGAAAT AGAGGCAGTG ATCATTCTCA 34 8 0 

GAATAACGTA CTTGGAAATG ATACTGATTA TATTCTATTT cAACGGATTT CACTCTATCT 354 0 

AATTGCTTTA AAATGTTTGG AAATACTTCA TTTA(AGATT C AG AAAT G AC ATTCGCTTCC 3 600 

ATATGATCTG TCATAAATTG GTTAACCCAT TCGATGTGAT CATTTTCATC TAAAACAATG 3 6 60 

ATACCAATTG GTAAATGTTT GATTGCTTTA TTATTTGTTG TTGAAATTTG AG C AC T C AAA 3720 

CCATCTACAT AACTATCCAT TTTCATTAAA GCTTGTCTGA ATAAAATGAT GCTAACAATA 3780 

ATCATCACGA CAAGAACGAT AGATGCAATT AGTGCTATAA G AC TAT T AAA GATAAACCAT 384 0 

ACACCCATTA AAACAATTGC TGTGATGATC ATGATGACAA ATGGTATTAG TAAAGCTTTC 3900 

TTAGTGGACT GCCGATTCAT TATTCCACCT CTATTCACTT TTTAGAATTA TTTTTCATGA 3960 

TTCGCTTCAA ATTCAAACTT AAATCGATAA CACCAAGTAG TCCTACAATA TGTGTCGTAG 4 02 0 

GTGTCAGTAT TGTACCGATA ACCAAT AG T A AAATCGTTAC TGCATTCGGC AAACCTTTCG 4 080 

CTTTACCAAA GAAATGAATA ACACTTAAAC CTTGAATATA CATTACTAAT GATAACACAA 414 0 

GTTGGAAGTT TAAAAGAATG CTCTGGAACA CACTCGGTTG ACTGTAAAT AATAAACATA 4200 

TGATAACAAT AATGTATATC CATAATAAAA TACCGCTCAT TTGCCACGCG AAAAGTGGCT 4 2 60 

TAAATACAGG TGTAGCGATT TTAAATTTTC GTAAAATCGG AAATGTAACG ATTAAGTTAA 4 32 0 

TTAAGACGAT TAAAAATGTA ATGATAATGA TGAAACCTGG TAATTGAACG GTCGCTTGTC 4 38 0 

TAAACCCTTC TTCTAATATT TGGGTCATAT TCGCATCGGC ACCGCTCATC GTAATCGCTT 4 4 40 

CATGTAATGT TTGCTTGAAA GGTTTTACTA TGCTCGCTGA TGGTGGAATC CTTCCGAATG 4 500 

TTTGTAGTAA CATAAAAGCG AT T AAT G AAA TTnArCTCAT CGCTACTGTT GTTACGTATA 4 5 60 

ATATTCTTTC TTTAGACGTT CTTTCTTTGA GCAATTGACC AAT AAT T AAA CTTGCAATTA 4 62 0 

AGACTAATAT GATGGCACTT AAAACGAAAG TATTACCTAA AACAGTTGTT ATAATTACTG 4 68 0 

TAATAAGTGC ACTAATCCCG AAAGATTGTA TTGATTTATT CCATAAAACG ATACCTGGTA 4 74 0 

TGGTTGCAAA TAATGCAAAT ATTAGTCCTA AACCAGGCAC TAGATATAAAGCTAAAGCGA 4 800 
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CAAATACCAA CGCAATCGTT GCAATTATTG TTGCTTTAGG TTGTATTTTT GAAAACACAT 4 8 60 

AAGCCACTCC CATATTTTTA ACTATAGCTA TTATTTTAAC CTCTTTAATG AAAATTAACA 4 920 

ATTTATAGAT TGTATGCTTC TATTTCATTT AATTGAATAA TAACTTTCAT GTTTTATAAG 4 980 

TAATTAACAT ACTCATTTGA ATCGCTTTTG TGTGCTTTCA TTTTCAACAT GATTATTTAA 504 0 

TCCCACTACA TAGCAATCAA GCTTGATTTA GATTTACAAT ACATTTCCAC TCTCATGTAC 5100 

TCTAGATGTT TTTGAATATG ATAACTGTGA TTTAGTGGCT TCATTCTTTG AAAATATATA 5160 

TTATTACTTA CGCTTAAAAT GCTTTAAATT TLA.G AAAT G A T AT AAGT T AG GTGCCCAGGT 522 0 

ACTAAAGTTT AGTAGGaATC CATCATGCCC AACATTATCA GGCACGAAGA AATGACGATG 5280 

ATATTTAAAA CGTTCACCTA ATGCACGAAC TTGATCATCC GGATATAGCA AATCATCTAT 5 34 0 

GAACCCCATC GTTAACACTT TTGTTTCTAA ATTTTTAAAA ACATGCGTTA CGTCTGTCG 54 00 

ACCTCGGTCA ATGTTGTGAC TATCCAATAC ATCTAGCAGT GTCAGATAAC AATTCAAATC 5 4 60 

AAAATGTTCT TTAAATTTAT TACCTTGATG TTGTTGGTAT GCGACTACTT CATCCGGCGT 5520 

AAAACGTTCA TCATAACTTT TTGATGATCG AT AT G T C AAA AAACCTAATT GGCGTGCAAT 5 58 0 

ACTTAGACCT TCCTTACCAC CAAGATGAAT GGCTTGCCTT GCAATTTCAT TGAAAGCTCT 5640 

ACTATAAGAT GATGTTCGAC TTGTTGCAGC AAGGATAATG GCTTTATCTA CTTCAAACTG 5700 

TTGATTGTAG AGTAGTTCCA TTGCTTGCAT ACCTCCAAGA CTTCCCCCTA TTAAAATATT 5 7 60 

AATCTTATCA TAACCAAGGG CTTGTATACC TCGTTCATT; GCTCTGACTA TATCTCTTAA 58 20 

TGTTAATTTT TTAGGAAAAT GAGGGTCGTT TAAAGGTGAA CTTGAACCGA AAGGACTACC 5 880 

AATAACATCA AATGTTAAAA ATTGATAATC GTGAATGGGT ATATATCCCC CATCAATAAT 5 94 0 

TTCTCGCCAC CAACCCGGAT AAT CATCTGT TCCATATGTT AAATGATTGC CAGTTAATGC 6000 

AT G AC AAAC T ACAACTAATG GTTGTCCATG ATAACCGACA TGCTCATATC TCAAACGCAA 6060 

GTnATCTATG ACTTCCCCAG ATTCTGTAAT AAATTCCCCT AAATTTAAAG TATCTACTGT 6120 

GTAATTTGTC ATTGTTCTTT CCTCCTTAAA CAAAAAAACT TCTCACCCTA T T GAAAAGT A 6180 

AGAAGTCTTT AT AC T TAT C A TTCGAGTAAC TCGTTGGTTT TAGCACCGTG CTATAAAGTC 62 4 0 

GGTTGCTGAA GTATCACAGG G 62 61 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1222 base pairs 

(B) TYEE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATGCGATTAA CTCTGGAAAT ATCTTTTCCA TATTTACGTn TTAAATTATT CAGCAAATTC 60 

ATACGAGaTT CATACTCGTT yAACACTTGT TCGTCGAATT CTGATTAGC CATTTCATCA 120 

TATAACTCAT GTTTTGCATC TTCTAAAATG TAGTAAAATT GATCAATATC TTCTTTTAAT 180 

TTGTCATATT TGTTTGGAAC TATATCGTTT ATTGTTAACA AATGGTTGCT TAGTTCATAT 240 

AAAC GAT C AG TGATAGCATT TTCATCCGTT AATGTCATAT ATGCGTTATT AAGCGCTAAG 300 

CTTAATTTTT CAGAGTTTTG AATGCGTTTA ATATCTATTT CAAGTTGCTC TATTTCGCCT 3 60 

TCTTTTAGAT GTGCTTCAGA CAATTCTTCT AATTGGAATT TCATTAAATC TAAACGCTGT 4 20 

AGCAATGCTT GGTCTGCTGA TTCTAAATCT TCTAACTCTT GCTTTTTGGC TTTATAATTT 4 80 

TGAAAAGTTT GGTGATATTT ATCCmCAAA TCTTGATAAC GTGATTCTGC GTAATTATCC 54 0 

AATAATGTTA AATGGTATTT TTGTTTCAAC AAAGACTGCG TTTCATGTTG GCCATGAATA 600 

TCTAATAATT CTTGCATAAC TTT.TCGTAAA TCTTGTAAAG TAACTGTTTG ATTATTAATT 6 60 

TTACAAAGAC TTTTACCAGA GCTGAAAATT TCCCGTTTAA CTAATAAAAAATCTTCATCT 72 0 

ACATCAATAT CCATATTTTT CAATATATGT ATAGCATCTT TACTCTCGTC AATATCAAAT 7 80 

ATACCTTCGA TGACAGCCTT TTTTTCACCA TGTCTTACAA AATCAGATGA AGCTCTCATT 8 40 

CCAATTAATT GTCCAATTGC ATCTATAATA ATT G AC T T AC CTGAACCCGT TTCACCACTT 900 

AAAACAGTTA AACCATCAGA AAATTGAATT TCTAATTCTT CAATAATAGC AAATTGCTTG 9 60 

ATTGATAAGG TTTGTAACAT AAACTCATCG CATCCTTATA ACAAATTGAA AATTCTTGAC 1020 

TTGATTTCAT CACTTGCCTC TTTGCTTCGA CAAATAATTA AACAAGTATC ATCACCACAA 1080 

ATTGTGCCTA GTACTTCTTC CCAATTGATT TSSTCTAATA TAGCTCCAAT AGATTGTGCA 114 0 

TTACCAGGTA TGTTTTTAGA ACAAGTAAAT TATCAGTACC ATCTATATTA ACAAAGGAAT 12 00 

CCATTAAATA ACGTCCCAAT TT 1222 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TTTGTTATTA TTACnTnAAA TAATTGCATT ACTTTTTACT GATGGTACAA CTTTCATCC 60 

TAATAACATT GCTTCCCATT GTTCATCTGT AGGTGCTACT TCTGTAACAA CTTTCATTTC 120 

TTCTTTTGGC ACGACATAAT TGTCTTTATC TTGAACTAAA TATCCGCCAG ATACTGAAAC 180 
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AAACTCTTCT TCGTTACTGT CTATAGTCAT ATCAATTTCT AATAATCTTA CATTCTTCTT 24 0 

TTGTTTTAAA ATATCTAATG CTTCATCTGT AAATTTTGGT GCAATAATGA CTTCCAAAAA 300 

GATACTATGC AATTGCTCTG CTAACTCAGG TGTTACAGCT CGGTTTAATG CAACAATTCC 360 

ACCAAATATT GATTGACTAT CCGCTTCATA CGCATGTTGA AATGCTTGTT CTATCGTGTC 420 

ACCGATACCA ACACCACATG GATTCATGTG TTTAACGCA ACTGTAGCAG GTGTATCAAA 4 80 

CTTTTTAACT AAAGC TAG T G TAGCATCTGC ATCTTTAATA TTGTTATAGC TTAATTGTTT 54 0 

CCCATGTAAT TGTTTAGCGC CTGCAATCGT GTGCTTAGCA TTCGAAGTTC TCACAAAATA 600 

CGCTGATTGT TGTGGATTTT CTCCATATCT TAAAGTTTCT TTATCCCCTT TAAAGAAACG 660 

TACAATCGCT TCATCATATT CTGCAGTATG CTCAAAAACT TTAATCATTA ATGATTGTCT 720 

ATATGACTCA TCTAACGAAT CGTTTCTTAA TCGCGTCAAT ACTTCTTGAT AATCTGCCGG 7 80 

ATGTACAATT GTTGTTACAT GTTTATAGTT TTTAGCTGCA GCACGTAACA TTGTTGGACC 840 

ACCAATATCA ATATTTTCSA TTGCTTCGTC CATCGTCACA TCAGGGTTTG CAACAGTTTG 900 

TTGGAATGGA TATAAATTAA CTACTACCAT AT C AAT T AAA TCTATATGTT GTTCTGATAA 960 

TTCATTTAAA TGCTGCGGTT TATTTCGATC AGCTAAAATG CCACCATGAA CAGCCGGATG 1020 

T 1021 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCATTCACTC CTAAATTGTT ATTACACTAT TACACaTAGC TAAT CATCAA TGTGAAATCA 60 

CCTTCAAAGA C AC T AT C C AA ATCTTCAGAA GTCAAAATAA AGTTTGTACC AGTAGTCAGT 12 0 

TTGAAAATTT CACCATCGAC AATCATTTGC CCTTCGCCTT CCAACACTGT AACTAAACAG m 

AACTCTCTAG GCTTCATATA ATTTAACGTG CCAGAAATTT CCCATTTAAC CAATGTAAAG 24 0 

AAATCATTCG ATACAATGTG TGTACACTTA TGGTTTTCAA TAATTTCGCT TTCAGGCAAA 300 

ATATTAGGTA ATGGTGCATT GTACTGAATA ACGTCTAAAG CTTTTTCAAT ATTTAACGGT 360 

CTATCATTAT ATTGATTATC TTGACGATTG AAATCATAAA GTCTATATGT AATGTCTGAC 4 20 

GATTGCATCG TTTCGTATGC TAATATTCCT GAAGAAATAG TATGTACTGt TCCTGCTGGa 4 80 

ATAAAAtAGa ATTCyCCAGG kTTTACtTTA AtatATCyAA gTAtCGaCtC tATCGTTCCG 540 
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TGTTGAACAT GATTCGCAAC TTCTTCTCTA GACTCTGCTA ATGTCCCtm AACTATTTCT 600 

GCATCTTCTT CTGCATCTAT AATATACCAA CATTCAGATT TGCCATATTG CCCgTTTTCA 660 

TGCTCATAAG CATAAGAATT ATCAGGGTGC ACATGAATAG AAAGTGATTC TCTTGCATCC 720 

ACTATTTTAG TTAGAAGCGG AAAATCTTTG CTTGGGAAAT CACCAAACAA TTCACGATGT 780 

TCTGACCAAA TACGGTCTAA TGTTTGACCT TGATATGGTC CAT T AAT AAT CTCGCTCGTA 840 

CCATTTGGAT GTGCTGACAC ACACCAACAT TCCCCCAGTT GTATCATTGT CTAATTGATA 900 

TCCAAACTCA CTTAGACGTT GACCGCCCCA TAATTTTGTT TTTAAAATTG GTTGTAAAAA 960 

TAATGGCATT GTTGCACCTC CATTGTGATT AAGTAAGCAA TAGAACTCTG ATGTTGTTGT 1020 

TCCATTATAT TTTGATTTTG TTCTCATTTA CATCGTATTA TTAACTTCCA CAT T T C AAAT 1080 

TAACTATTAG TGATTGTACC ATATTTACTA ACATTGCAGT ACTGCCAATT AAAAGnGCTT 114 0 

CACTTAAATT TACAGTACTT TAACATTTTC AAAAATTTAT AGCATAGAGA TTATAOTCT 12 00 

CTTACATTTG TACATATTTC CCTTTAAATT TACTCGCCCA TTATACCAAT TAATAaACAA 12 60 

CTTTAATAGT TGTGCCATAC ATTGTTCAAA TTCTTTGTAA AACGCATAGA CAATACGTAC 1320 

T TAT T CAT AC TTATAATTCA TCATTTTCAA AAAATAACGA GTTACGAAAA AGTAACCCGC 1380 

TTCAAATCAT ATT T AC TAT C CTTATTAATC CGTTTCATTT TCAAATTGAG TTAAAGCATC 14 4 0 

TTTAATGTCC TGATCACCAC T AAT AAT T T G AAACTCTTGG TGATTAAAAT GATTGGATGT 1500 

GACAATTTCT TTTAATACTG TCGCAACATC TTCTCTAGGA ATTTCACCTT TACCATCAAA 15 60 

ATATTGTGCA GCTTCTATCT TTCCAGATCC TGCTGCATT GTAAGTGCCC CTGGATGTAA 162 0 

AATTGTATAA TTCAAACCTG nAACGTCTTA AATAGTCATC AGCGTAATGT TTAGCTATTG 168 0 

TATATGGCTT TAAATCACCG C TAT CAT C AA AAGCCTGACG TCTCGAATCA TATGTTGAAA 17 4 0 

CCATGACATA GTGTTTAATA TTGGCCTCTT TACTCGCAAT CATTGATTTA ACAGCACCAT 1800 

CTAAATCGAC AATAATTGTT TTATCTGCAC CCGTGTTCCC TCCAGAACCT ACTGAAAAGA 18 60 

TAACTTTATC GAATGGTTTA AACGTCTCAG TTAAAGTCTC TATTGAATCA TTTTCAACAT 192 0 

CAACAAGAAT TGCTTTCATA CCTTGTGATT TTAACGCATT AAGTTGATCT GATTGCCTAA 198 0 

CACCAGCAGT AAAT G G T AC A TTTTCTTTTG CTAATTGTTG CACTAGTAAC GAACCTACAC 204 0 

CGCCATTAGC ACCTATAACC AAAAT AT T C A T T T AC AAC AC TCTCCTATkT ATTATTCTCT 2100 

ATGCCATACC ACTTTATGAG ATATGTAAAA CTTGTTACAA CTATAAAAAT CAATTGACAT 2160 

ACTACTGGGA ACGTATTAAA TTAATATATG AACAAATATT CATAEAAAG GATTGTCATA 2220 

TAATGTCAGA ACAATATTCa G AAAT AAAT A CAGATaCATT aGAACGCgTA ACTGaAATTT 22 80 

tCaAGGCATT AGcGATTACA ATCGAATACG TATCaTGGAA TTGTTATCaG TCAGCGAAgC 234 0 

AAGTGTTGGT CACATTtCAC ATCAATTGAA TTTATCTCAA TCAAATGTCT CGCACCAATT 24 00 
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AAAATTACTT AAAAGTGTGC ATCTTGTGAA AGCAAAACGA CAAGGCCAAT CAATGATTTA 2 4 60 

TTCATTAGAT GACATCCACG TAGCAACTAT GTTAAAGCAA GCCATACATC ACGCGAATCA 2 52 0 

TCCTAAAGAA AGTGGGTTAT AATATGTCTC ATTCACATCA TCATCATGAC CATATGCATA 2 58 0 

GTCATGTAAC TACAAATAAT AAGAAAGTAT TGTTTATATC GTTTTTAATA ATCGGTCTAT 2 64 0 

ATATGTTTAT C GAAAT CATC GGCGGTCTCC TTGCTAACAG CTTGGCATTA CTATCTGACG 2700 

GTATCCATAT GTTTAGCGAC ACATTCTCAT TAGGTGTTGC ACTTGTCGCA TTTATTTATG 2 7 60 

CTGAAAAGAA TGCCACAACT ACAAAAACAT TTGGTTATAA ACGTTTCGAA GSCTCGCAG 2820 

CGTTATTTAA CGGTGTAACG CTTTTTGTAA TAAGTATTTT GATTGTTTTT GAAGCGATTA 28 80 

AACGTTTCTT TGTTCCTTCT GAAGTTCAAT CAAAAGAAAT GTTAATCATT AGTATTATCG 2 94 0 

GTTTAATTGT CAATATCGTT GTTGCATTCT TTATGTTTAA AGGCGGCGAC ACTTCACACA 3000 

ATTTAAATAT GCGTGGTGCT TTTCTACATG TTATCGGAGA CTTATTAGGT TCAGTTGGCG 3060 

CCATTACTGC AGCTAkTTTA ATTTGGGCAT TTGGATGGAC AATCGCCGAT CCTATCGCAA 3120 

GTATTTTAGT TTCCGTTATT ATTTTAAAAA GTGCTTGGGG TATCACAAAA TCTTCAATTA 3180 

ACATTTTAAT GGaAGGCACA CCAAGTGATG TTGAATAGA TGAAGTTATA ACTACTATTA 324 0 

AAAAGGATTC ACGAATACAA AGTGTGCATG ATTGCCATGT TTGGACAATT TCAAATGATA 3300 

TGAATGCATT GAGTTGTCAT GTTGTTGTAG ACCATACATT GACAATGAAA GAATGTGAAT 3360 

TATTATTAGA AAaCATTGAG CATGATTTAT TACATTTAAA T AT T C AC CAT ATGACTATTC 34 2 0 

AATTAGAAAC GCCTAATCAC AAAC AT GAT G AATCGATTAT ATGTTCAGGA ACACATAGTC 34 80 

ATT C AC AT AA CCATCATGCT CATCATCACG CGCATGTACA TTAATAATTT TAACCTACTG 354 0 

CCATTGCATC GAT T AAAC T T TTCAATGGCA GTAGGTTTTT TATGTCTTTA TGGCGACTTG 3600 

TTTGGTCTTT GAT G ATG C AA TGTTTATTAA CAAATTTTCA ACT ATT AT TT C T T AC AT TAG 3 6 60 

TCATATTTTT GACAATTTAC TATTATAATT CTCTAACTTT AGTCACTTTA ATTAATTTTT 3720 

ATTAGATATT AATATGAAAA TAACGTGTTT TTTGTTATT 37 5 9 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TAATTATCGC GCATAACAAA ACATTAGCAG GACAATTAA TAGTGAGTTT AAAGAATTTT 60 
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TTCCTGAAAA CAGGGTGGAA TACTTTGTAA GTtACTATGA TTATTATCAn CCAGAGGCAT 12 0 

ACGTACCGTC TACTGACACT TTTATTGAAA nAGATGCCTC AATCAnTGAT GAAATTGATC 18 0 

AACTACGACA TTCTGCTACA AGTGCATTAT TTGAACGCGA TGATGTAATT ATTATTGCTA 24 0 

GTGTAAGTTG TATATATGGT TTAGGTAATC C T G AAGAAT A TAAAGATTTA GTAGTAAGTG 30 0 

TTCGAGTTGG TATGGAAATG GATAGAAGTG AATTACTTAG AAAACTTGTc AGATGTGCAA 3 60 

TATACACGAA ATGACATCgA TTTcCAACGA GGAACGTTTC GAGTGCGTGG TGATGTAGTG 420 

GAAATATTCC CAGCCTCTAA AGAAGAACTT TGTATAAGGG TTGAGTTTTT CGGCGATGAG 4 80 

ATTGACCGTA TCCGAGAAGT TAACTACCTA ACAGGTGAAG TGTTGAAAGA AAGAGAACAT 54 0 

TTTGCGATAT TCCCAGCTTC TCACTTCGTA ACACGTGAAG AAAAGTTGAA AGTTGCGATT 60 0 

GAACGTATTG AAAAAGAATT GGAAGAACGA TTGAAAGAAT TACGBATGA GAATAAATTA 660 

CTAGAAGCGC AAAGGTTAGA ACAGCGTACC AACTATGATT TAGAAATGAT GCGAGAGATG 72 0 

GGATTCTGTT CAGGAATTGA AAACTATTCC GT AC AT T T AA CTTTGCGACC ACTGGGTTCG 78 0 

ACACCATATA CTTTATTGGA TTACTTTGGC GATGATTGGT TAGTAATGAT TGATGAATCA 84 0 

CATGTGACAT TACCGCAAGT TCGAGGCATG TATAACGGAG ACAGAGCGCG TAAACAAGTT 900 

TTGGTGGATC ATGGGTTTAG ATTACCGAGT GCATTAGATA ACCGTCCACT TAAATTTGAA 960 

GAATTTGAAG mAAAGACAAA ACAACTTGTG TATGTATCTG CAACGCCTGG AC CAT AC G AA 1020 

ATTGAACATA CGGATAAGAT GGTTGAJCAA ATTATTCGTC CTACTGGTTT ACTGGATCCT 108 0 

AAGATTGAGG TTAGACCTAC TGAAAATCAA ATTGACGATT TATTAAGTGA AATTCAAACA 114 0 

AGAGTgAGCG TAATGAACGC GTACTTGTTA CAACGCTCAC TAAAAAGATG AGTGAAGATT 1200 

a AC C AC AT AC AT G AAAGAa G CGGGTATTAA aGTtAATTAT CTGCATTCAG AATCAAGAC 12 60 

ATTAGAACGA ATTGAAATAA TTAGAGACTT ACGAATGGGT ACATATGATG TTATCGTAGG 1320 

TATTAATTTA TTAAGAGAGG GTATTGATAT ACCAGAAGTT TCTCTAGTTG TCATATTAGA 1380 

TGCAGATAAA GAAGGGTTTT TACGTTCTAA CCGCTCATTA ATTCAAaCAA TAGGTAGAgC 1440 

TGCGCGTAAC GATAAaGGTG AAGTCATTAT GTATGCCGAT AAAATGACTG AT T C GAT G AA 1500 

GTATGCAATT GAT GAG AC AC AACGTCGTCG AGAAATACAG AT GAAACAT A AT G AAAAAC A 1560 

TGGTATTACA CCTAAAACAA TTAATAAAAA AATACATGAT TTAATTAGTG CTACTGTTGA 1620 

AAATGACGAA AATAATGACA AAGCACAAAC TGTCATACCT AAGAAGATGA CGAAAAAAGA 168 0 

ACGTCAAAAG ACAATCGACA ATATAGAAAA AG AAAT G AAA CAAGCAGCGA AAGATTTAGA 17 4 0 

TTTCGAGAAA GCTACAGAAT TAAGAGATAT GTTATTTGAA T T AAAAGC AG AAGGGTGACA 18 00 

AGTAAATGAA AGAACCATCC ATAGTAGTAA AAGGTGCTCG TGCGCATAAC TTGAAAGATA 18 60 

TTGATATCGA AC T AC C T AAA AaTAAATTAA TTGTTATGAC AGGTTTATCT GGGTCAGGTA 192 0 
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AATCGTCATT AGCATTCGAT ACTATATATG CTGAAGGACA ACGACGTTAT GTTGAATCAT 198 0 

TAAGTGCCTA TGCGCGTCAA TTTTTAGGCC AAATGGACAA ACCAGATGTT GATACAATTG 2040 

AAGGATTATC GCCAGGAATT TCAATAGATC AAAAAACAAC AAGTAAAAAT CCAAGATCAA 210 0 

CTGTAGCAAC AGTAACAGAA ATATATGATT ATATACGTTT GTTATATGCA CGTGTTGGTA 2160 

AACCTTACTG TCCAAATCAC AATATAGAAA TTGAATCGCA AACAGTACAA CAAATGGTTG 2220 

ACCGCATTAT GGAATTAGAG GCACGTACAA AGATTCAATT ffiTAGCACCT GTCATCGCTC 22 8 0 

ATCGTAAAGG TAGTCATGAA AAGCTAATCG AAGATATTGG TAAAAAAGGT TATGTACGTT 234 0 

TAAGAATCGA TGGCGAAATT GTTGATGTAA ATGATGTACC TACTTTAGAT AAGAACAAGA 2 4 00 

ATCATACAAT AGAAGTTGTT GTAGACCGAT TAGTTGTTAA AGATGGAATT GAAACACGAC 2 406 

TAGCTGACTC TATAGAAACT GCCTTAGAGC TTTCAGAAGG ACAATTAACA GTCGATGTCA 2 52 0 

TTGACGGGGA AGACCTTAAG TTTTCAGAAA GCCATGCTTG TCCTATATGT GGATTTTCAA 258 0 

TCGGAGAGTT AGAACCAAGA ATGTTTAGCT TTAACAGTCC TTTTGGTGCT TGTCCGACAT 2 64 0 

GTGATGGCTT AGGCCAAAAG TTMCAGTCG ATGTAGACTT GGTTGTTCCC GACAAAGATA 27 00 

AGACGCTAAA CGAAGGTGCA ATAGAACCTT GGATACCGAC GAGTTCTGAT- TTTTATCCAA 2 7 60 

CATTGTTAAA ACGTGTTTGT GAAGTTTATA AAATCAATAT GGATAAACCT TTTAAAAAGT 2 82 0 

TAACAGAACG TCAACGTGAT ATTTTATTGT ATGGTTCTGG TGACAAAGA ATTGAATTTA 28 8 0 

CATTTACACA ACGTCAAGGT GGTACTAGAA AACGAACAAT GGTTTTCGAG GGTGTAGTTC 2 94 0 

CTAATATAAG TAGACGATTC CATGAATCTC CTTCAGAATA TACACGTGAA AT GAT GAG T A 3000 

AATATATGAC TGAACTACCT TGCGAAACTT G T CAT GG AAA GCGATTGAGT CGTGAAGCkT 3060 

TATCTGTTTA TGTAGGTGGT TTAAATATTG GTGAAGTAGT CGAATATTCA ATCAGTCAAG 312 0 

CGCTGAACTA T T AT AAAAAC ATTGATTTGT CAGAACAAGA TCAAGCGATT GCAAATCAAA 318 0 

TATTGAAAGA AATTATTTCC CGACTCACTT TTTTAAATAA TGTGGGACTT GAATATTTAA 324 0 

CGTTAAACAG AGCTTCAGGT ACACTTTCAG GTGGTGAAGC ACAACGTATT CGATTAGCAA 3300 

CGCAAATTGG GTCGCGTTTG ACTGGTGTCT TATATGTATT AGATGAGCCA TCAATTGGAC 3360 

TGCATCAAAG AGATAATGAT CGATTAATTA ATACACTTAA AGAAATGAGA GATTTAGGAA 34 2 0 

ATACTTTAAT TGTAGTTGAA CACGATGATG ATACAATGCG TGCGGCTGAT TACTTfiTGG 34 80 

ATATAGGTCC TGGTGCTGGT GAACATGGAG GGCAGATTGT GTCTAGTGGT ACTCCTCAAA 354 0 

AGGTAATGAA AGATAAAAAA T CAT T AAC AG GACAATACTT GAGTGGTAAG AAACGTATTG 3 60 0 

AAGTACCTGA ATATCGCAGA CCGGCTTCAG ATCGTAAAAT TTCTATACGT GGAGCTAGAA 3 66 0 

GCAACAATCT TAAAGGGGTT GATGTGGACA TACCACTATC AATCATGACG GTTGTTACAG 3720 
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GTGTATCAGG TTCTGGTAAA AGCTCATTAG TAAATGAAGT ATTATACAAA TCATTAGCTC 37 8 0 

AAAAAATTAA TAAATCTAAA GTAAAGCCAG GATTGTACGA TAAGATTGAA GGTATTGATC 38 4 0 

AACTTGATAA AATTATTGAT ATTGATCAAT CACCAATSG TAGAACGCCA CGCTCTAATC 3900 

CAGCAACATA TACTGGTGTG TTTGATGATA TACGTGATGT GTTTGCGCAA AC AAAT GAAG 3 9 60 

CTAAAATTCG AGGATATCAA AAAGGGCGTT TTAGTTTTAA TGTAAAAGGT GGACGCTGTG 4 02 0 

AAgcTTGTAA AGGTGACGGT ATTATTAAAA TTGAAATGCA TTTTTTACCT GATGTTTATG 4080 

TTCCTTGTGA AGTGTGTGAT GGTAAACGAT ATAATCGTGA GACACTAGAG GTTACTTACA 414 0 

AAGGTAAAAA TATTGCTGAC AT T T TAG AAA TGACTGTTGA AGAAGCAACA CAATTTTTTG 4 200 

AAAATATTCC TAAGATTAAG CGCAAGTTAC AAACACTAGT TGATGTTGGT CTTGGATACG 4 2 60 

T C AC AT T AGG TCAACAAGCT ACAACGTTAT CAGGTGGTGA GGCTCAACGT GTGAaACTTG 4 32 0 

CATCTGAACT TCATAAACGT TCAACTGGTA AATCTATTTA TATCCTAGAT GAACCGACAA 4 380 

CAGGGTTACA TGTTGACGAT ATTAGTAGAT TATTAAAAGT ATTAAACCGA TTAGTTGAAA 4440 

ATGGTGATAC TGTTGTAATT ATTGAACATA ACCTAGATGT TATCAAACA GCAGACTATA 4 500 

TTATAGACTT AGGTCCTGAA GGTGGTAGTG GCGGTGGTAC TATTGTTGCG ACTGGCACAC 4 560 

CCGAAGATAT TGCTCAGACA AAGTCATCAT ATACAGGAAA GTATTTAAAA GAAG TACT TG 4 620 

AACGAGATAA ACAAAATACT GAAGATAAAT AAGATTAAAA GAAGTGAAGG ATGTTATAAA 4 68 0 

TTTATCCTTC GCTTCTTTTT ATTAATTTAG TAATGAATAG TAGAAAGAAA AG AT G C G T AA 474 0 

AAAGAATTAT GTTAAGATAG GGTCAATCTA GAGTAGTTAA AC AT AAAT CG AACTGGGAGT 4 800 

GGGACAGAAA T G AT AAAGAA TCACTAATGA TTTATTATGT AGTGGTTCTT TGTCATTAGC 4 8 60 

CACAGCTATT GTGTACTTAA AAATAGGaat GCaTgAGTGC AACTCATGCA TAAGaAATAC 4 920 

TAATTTCTAA AGAAAAAG T A TTTCTTTATG TTGGGGCCCC GCCAACTTGC ATTGTTTGTA 4 980 

GAATTTCTTT TCGAAATTCT TTATGTTGGG GCCCCGCCAA CTTGCATTGT TTGTAGAATT 504 0 

TCTTTTCGAA ATTCTTTATG TTGGGGCCCC GCCAACTAAT TCCAATATAT C&TGTAGAG 5100 

CTTAGGTCAT TGATTTTTGG CTCGGACTTT TATGGCGATA TGAACCATGT AAAT T AAGCA 5160 

AGCAATAAAT TAATGATTGA TATTGACTTG TAAAATAATA ACAATAATGA ACAATTAATA 5220 

TTTATTTTAG CTTTTCAATG TAGATTGGTG TTATATTTTT GAT AT GAT AA GAAG AG AT G T 5280 

AAGAGTAGGG ATAAATACAA TTGAGGTGAA CCCATGTTAA C G AC AG AAAA AC TAGTTGAA 534 0 

ACATTAAAGT TAGATTTAAT CGCTGGTGAA GAAGGACTAT CGAAGCCAAT TAAAAATGCT 54 00 

GATATATCAA GACCGGGCTT AGAGATGGCA GGTTATTTTT CACATTATGC GTCAGATAGA 54 60 

ATACAACTAT TAGGAACAAC GGAACTATCG TTT'ACAATT TATTACCAGA TAAGGATCGC 5520 

GCAGGTCGTA TGCGTAAACT ATGCAGACCA GAAACGCCTG CAATTATTGT GACACGTGGA 55 8 0 
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TTGCAGCCAC CAGAAGAATT AGTTGAAGCT GCAAAAGAAT TAAATACCCC ACTTATAGTT 5 64 0 

GCTAAAGATG CGACTACAAG TTTAATGAGT CGCTTAACAA CGTTTTTAGA GCATGCACTT 5700 

GCAAAGACGA CATCTTTACA TGGTGTTTTA GTAGATGTTT ACGGTGTTGG TGTACTAATT 57 60 

ACCGGTGATT CAGGAATAGG TAAAAGTGAG ACTGCGTTGG AATTAGTTAA ACGTGGGCAT 5 820 

AGATTAGTAG C AG AT G AT AA TGTAGAAATA CGTCAAATTA ATAAAGATGA ACTAATAGGG 58 8 0 

AAACCACCAA AGTTAATAGA ACATCTATTA GAAATACGTG GACTAGGTAT TATCAATGTT 594 0 

ATGACTTTAT TTGGCGCGGG TTCAATATTA ACT G AAAAAC GAAT TAG AT T AAATATTAAT 6000 

TTGGAAAACT GGAACAAGCA AAAGTTATAT GACCGCGTAG GTCTTAATGA AGAGACGCTA 60 60 

AGTATTTTAG ATACTGAAAT CACTAAAAAA ACAATACCTG 21AGACCTGG TAGAAATGTT 612 0 

GCGGTAATTA TTGAGGTCGC TGCAATGAAC TATCGATTAA ATATCATGGG CAT T AAC AC G 618 0 

GCCGAAGAAT T TAG T GAAAG ATTAAATGAA GAAAT TAT C A AGAAC AG T C A TAAGAGTGAG 62 4 0 

GAGTAGGTTG AATGGGTATT GTATTTAACT ATATAGATCC TGTGGCATTT AACTTAGGAC 63GD 

CACTGAGTGT ACGATGGTAT GGAATTATCA TTGCTGTCGG AATATTACTT GGTTACTTTG 63 60 

TTgCACAACG TGCACTAGTT AAAGCAGGAT TACATAAAGA TACTTTAGTA GATATTATTT 6420 

TTTATAGTGC ACTATTTGGA TTTATCGCGG CACGAATCTA TTTTGTGATT TTCCAATGGC 6480 

CATATTACGC GGAAAATCCA AGTGAAATTA TTAAAATATG GCATGGTGGA ATAGCAATAC 654 0 

ATGGTGGTTT AATAGGTGGC TTTATTGCTG GTGTTATTGT ATGTAAAGTG AAAAATTTAA 6600 

ACCCATTTCA AATTGGTGAT ATCGTTGCGC CAAGTATAAT TTTAGCGCAA GGAATTGGAC 6 6 60 

GCTGGGGTAA CTTTATGAAT CACGAGGCAC ATGGTGGATC GGTGTCACG GCTTTTTTAG 6720 

AACAATTACA TTTGCCTAAT TTTATAATAG AAAATATGTA TATTAACGGC CAATATTATC 67 80 

ATCCAACATT CTTATATGAA TCCATTTGGG ATGTCGCTGG ATT T ATT AT C TTAGTTAATA 68 4 0 

TTCGTAAACA TTTAAAATTA GGAGAAACAT TCTTTTTATA TTTAACTTGG TATTCAATTG 6900 

GTCGATTCTT TATAGAAGGA TTACGTACAG ATAGCTTAAT GCTCACAAGT AAT AT TAG AG 6 960 

TTGCACAATT AGTATCAATT CTTTTAATTT TAATAAGTAT AAGTTTAATT GTATATAGAA 7 02 0 

GGATTAAGTA TAATCCACCG TTGTATAGCA AAGTTGGGGC GCTTCCATGG CCAACAAAAA 7 080 

AAGTGAAGTA GTGATAGTTT GAGGAAATTT TTATCAAAAA CACATCATCA TACAAACCCT 714 0 

TTATGGCGTG TATACCGTCT TGTTAAATTT TCGAAAGTTT TTAAGAATGT AATTATCATT 7200 

GAATTTTCGA AATTTATTCC AAGTATGGTA CTGAAAAGAC ATATATATAA ACAACTTTTA 7 2 60 

AAT ATT AAT A TCGGTAATCA ATCGTCGATA GCTTATAAAG TAATGTTAGA TATTTTTAC 7 320 

CCAGAACTGA TTACGATTGG TAGTAACAGT GTTATTGGTT ACAATGTAAC AATTTTGACG 7 38 0 
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CATGAAGCAT TAGTTGATGA ATTTCGTTAT GGACCAGTGA CGATAGGATC TAACACTTTG 74 4 0 

ATTGGTGCAA ATGCTACCAT TTTACCCGGT ATAACGATTG GTGACAATGT AAAAGTTGCA 7 500 

GCTGGTACGG TTGTTTCAAA AGATATACCG GATAATGGAT TTGCATATGG CAACCCTATG 7 5 60 

TATATAAAAA TGATTAGGAG GTGACAATTT TATGGCGCAA AAGAATAATA ATGTAATTCC 7 620 

AATGACTTTT GATGATGCAT TTTATCGTAA AATGGCTAAA CAGAAGTTTA AACAAAGAGA 7 680 

ATATAAACGA GCTGCTGAAT ACTTTGAAAA AGTGTTA&A TTGTCACCTG ATGATCTGGA 77 4 0 

AATTCAAATT GATTATGCAC AATGTCTAGT GCAACTTGGT ATTGCTAAAA AAGCAGAACA 7 800 

TTTATTTTAT GACAAT AT T A TTTATAATAG GCATCTAGAA GATAGCTTTT ATGAATTGAG 7 8 60 

TCAGCTCAAC ATTGAAGTTA ACGAACCAAA CAAGGCATTC TTGTTTGGTA TTAATTATGT 7 92 0 

TATTGTTAGC GACGACCAAG ATTATAGAGA TGAATTAGAT CAAATGTTTG ATGTGAAATA 7 98 0 

TCAAAGTGAA GAACAAATTG AACTTGAAGC TCAATTGTTT GTAGTTCAAA TACTATTCCA 8 04 0 

ATATCTTTTT TCTCAAGGTC GATTAAAAGA TGCAAAGAAT TATGTCTTAC ATCAACCACA 8100 

AGAAGTTCAA GATCATCGTG TAGTACGTAA TTTATTGGCA ATGTGTTATT TATATCTCGG 8160 

TGAATATGAT ACgGCTAAAG CATTGTACGA aGCACtATTA CAAGAGGATA GTACaGATAT 8220 

ATATGCATTA TGCCATTATA CTTTGCTACT TTATAACACT AAGGAAAATG AACAATATCA 8280 

AAAATATTTA AAAATATTAA ACAAAGTTGT ACCTATGAAT GACGffiGAAA GTTTTAAATT 8340 

AGGTATTGTA TTAAGTTATT TAAAGCAGTA TCGTGCATCA CAACAATTGT TGTACCCTTT 8 4 00 

ATATAAAAAA GGGAAATTTT TATCAATTCA AATGTACAAT GCTTTAGCAT ATAATTATTA 84 60 

TTATTTAGGT GAAGAAGACG AAAGTCATTA CTACTGGGAT AAATTGAAGC AAATTTCTAA 8 520 

AGTGGAAATT GGACATGCGC CTTGGGTAAT TGAAAATAGC AAAGAAGTTT TTGACCAACA 8 58 0 

TATTTTGCCA TTACTTCAAA GTGATGACAG TCATTATCGT TTATATGGTA TTTTTTTATT 8 64 0 

GGATCAATTA AATGGTAAAG AAATTGTGAT GACGGAAAGT AT T T GGCAGG TTTTGGAAAA 8700 

TCTAAATAAT TATGAGAAAT TGTATTTAAC GTATTTAGTT CAAGGTTTAA CGCTCAATAA 8 7 60 

ATTAGACTTC ATTCATCGCG GCTTATTAAC GCTTTACCAT AATGAATTAT TTGTAAGTGA 8820 

AAATGATGTA ATGGTTGCAT GGATTAATCA AGGTGAACTC ATAATTGCTG AAAAAG T AG A 8880 

TTTAACTGAT GTTGAGCCAT ATATCGGTGC GTTTATTTAT TTGTATTTTA AAATCAACC 8 94 0 

TCGAAACGTT ACAAAGAAGC AAATTACAAC ATGGTTAGGC ATAACACAAT ATAAACTGAA 9000 

CAAAATGATT GAATTTCTCT TGAGCATATA GATTTATGAA AAGTTAGATT TATTATATAA 9060 

TGCGCATAAT GATTAATAAT GAGGAGGCGT TAATAAAATG ACTGAAATAG ATTTTGATAT 9120 

AGCAATTATC GGTGCAGGTC CAGCTGGTAT GACTGCTGCA GTATACGCAT CACGTGCTAA 918 0 

TTTAAAAACA GTTATGATTG AAAGAGGTAT TCCAGGCGGT CAAATGGCTA ATACAGAAGA 92 4 0 
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AGTAGAGAAC TTCCCTGGTT TCGAAATGAT TACAGGTCCA GATTTATCTA CAAAAATGTT 9 300 

TGAACACGCT AAAAAGTTTG GTGCAGTTTA TCAAATGGA GATATTAAAT CTGTAGAAGA 93 60 

TAAAGGCGAA TATAAAGTGA TTAACTTTGG TAATAAAGAA TTAACAGCGA AAGCGGTTAT 9420 

TATTGCTACA GGTGCAGAAT ACAAGAAAAT TGGTGTTCCG GGTGAACAAG AACTTGGTGG 94 8 0 

ACGCGGTGTA AGTTATTGTG CAGTATGTGA TGGTGCATTC TTTAAAAATA AACGCCTATT 954 0 

CGTTATCGGT GGTGGTGATT CAGCAGTAGA AGAGGGAACA TTCTTAACTA AATTTGCTGA 9 600 

CAAAGTAACA ATCGTTCACC GTCGTGATGA GTTACGTGCA CAGCGTATTT TACAAGATAG 9 6 60 

AGCATTCAAA AATGATAAAA TCGACTTTAT TTGGAGTCAT ACTTTGAAAT CAATTAATGA 9720 

AAAAGAC GGC AAAGTG3GTT CTGTGACATT AACGTCTACA AAAGATGGTT CAGAAGAAAC 97 8 0 

ACACGAGGCT GATGGTGTAT TCATCTATAT TGGTATGAAA CCATTAACAG CGCCATTTAA 98 4 0 

AGACTTAGGT ATTACAAATG ATGTTGGTTA TATTGTAACA AAAG AT GAT A TGACAACATC 990 0 

AGTACCAGGT ATTTTTGCAG CAGGAGATGT T C GC G AC AAA GTTTACGCC AAATTGTCAC 9960 

TGCTACTGGC GATGGTAGTA TTGCAGCGCA AAGTGCAGCG GAATATATTG AACATTTAAA 10020 

CGATCAAGCT TAATTCGAAG TCGAATTAAG ATGTTGAGCT GTAAATTATT TGGATATTTA 10080 

TTTTAATAGT GTCATCACAG CGTTAAAATA ATGTCTTACT TTTAAATTAA AGCAAAT TAT 10 101 

ATAGAAAACT AGAACTTAGT ACGTATCATT TGTGCGTTTC AAT GAGTTCT AGTTTTTTTA 10200 

TATGTTATAT T AAAC T TATA ACTTTATGGG AGTGGGACAG AAATGATAAA GAGCCACTAA 102 60 

T GAT T TAT T A TGTAGTGGTT CTT AAAC ATT AGCCACAGCT AATGTGTACT TAAAAATAGG 10320 

AATACATGAG TAAAACTCAT GCATAAGAAA TACTAATTTC TATAGAAAAA GTATTACTTT 10380 

ATCGTTGTCC CACCCCAACT TGCACATTAT TGTAAGCTGA CTTTCCGCCA GCTTCTGTGT 104 4 0 

TGGGGCCCCG CCAACTTGCA CATTATTGTA AGCTGACTTT TCGTCAgCTT CTGTGTTGGG 10500 

GCCCCGCCAA CTTGCACATT ATTGTAAGCT GACTTTTCGT CAGCTTCTtB GTTGGGGCCC 10560 

CGCCAACTTG CATTGTCTGT AGAAATTGGG AATCCAATTT CTCTATGTTG GGGCCCACAC 10 620 

CCCAACTCGC ATTGCCTGTA GAATTTCTTT TCGAAATTCT CTGTGTTGGG GCCCACACCC 10680 

CAACTCGCAT TGCCTGTAGA ATTTCTTTTC GAAATTCTCT GTGTTGGGGC CCACACCCCA 10740 

ACTCGCATTG C C T G TAG AAT TTCTTTTCGA AATTCTCTGT GTTGGGGCCC C T GAC TAG AG 10800 

TTGAAAAAAG CTTGTTGCAA GCGCATTTTC ATTCAGTCAA CTACTAGCAA TATAATATTA 108 60 

TAGACCCTAG GACATTGATT TATGTCCCAA GCTCCTTTTA AATGATGTAT ATTTTTAGAA 10920 

ATTTAATCTA GACATAGTTG GAAATAAATA TAAAACATCG TTGCTTAATT TTGTCATAGA 10980 

ACATTTAAAT TAACATCATG AAATTCGTTT TGGCGGTGAA AAAATAATGG AT AAT AAT GA 11040 
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AAAAGAAAAA AGTAAAAGTG AACTATTAGT TGTAACAGGT TTATCTGGCG CAGGTAAATC 11100 

TTTGGTTATT CAATGTTTAG AAGACATGGG ATATTTTTGT GTAGATAATC TACCACAGT 11160 

GTTATTGCCT AAATTTGTAG AGTTGATGGA ACAAGGAAAT CCATCCTTAA GAAAAGTGGC 112 2 0 

AATTGCAATT GATTTAAGAG GTAAGGAACT ATTTAATTCA TTAGTTGCAG TAGTGGATAA 112 8 0 

AGTCAAAAGT GAAAGTGACG TCATCATTGA TGTTATGTTT TTAGAAGCAA GTACTGAAAA 11340 

ATTAATTTCA AGATATAAGG AAACGCGTCG TGCACATCCT TTGATGGAAC AAGGTAAAAG 114 00 

ATCGTTAATC AATGCAATTA ATGATGAGCG AGAGCATTTG TCTCAAATTA GAAGTATAGC 114 60 

TAATTTTGTT ATAGATACTA CAAAGTTATC AC C T AAAGAA TTAAAAGAAC GCATTCGTCG 11520 

AT AC TAT G AA GATGAAGAGT TTGAAACTTT TACAATTJ&.T GTCACAAGTT TCGGTTTTAA 11580 

ACATGGGATT CAGATGGATG CAGATTTAGT ATTTGATGTA CGATTTTTAC CAAATCCATA 11640 

TTATGTAGTA GATTTAAGAC CTTTAACAGG AT T AGAT AAA GACGTTTATA ATTATGTTAT 117 00 

GAAAT GG AAA GAGACGGAGA TTTTCTTTGA AAAATTAACT GATTTGTTAG ATTTTATGAT 11760 

ACCCGGGTAT AAAAAAGAAG GGAAATCTCA ATTAGTAATT GCCATCGGTT GTACGGGTGG 118 20 

AC AAC AT C G A TCTGTAGCAT TAGCAGAACG ACTAGGTAAT TATCTAAATG AAGTATTTGA 118 80 

ATATAATGTT TATGTGCATC ATAGGGACGC ACATATTGAA AGTGGCGAGA AAAAATGAGA 11940 

CAAATAAAAG TTGTACTTAT CGGTGGTGGC ACTGGCTTAT CAGTTATGGC TAGGGGATTA 12000 

AGAGAATTCC CAATTGATAT TACGGCGATT GTAACAGTTG CTGATAATGG TGGGAGTACA 12060 

GGGAAAATCa GAG AT GAAAT GGATATACCA GCACCAGGAG AC AT C AGAAA TGTGATTGCA 12120 

GCTTTAAGTG ATTCTGAGTC AGTTTTAAGC CAACTTTTTC AGTAEGCTT TGAAGAAAAT 12180 

CAAATTAGCG GTCACTCATT AGGTAATTTA TTAATCGCAG GTATGACTAA TATTACGAAT 12240 

GATTTCGGAC ATGCCATTAA AGCATTAAGT AAAATTTTAA ATATTAAAGG TAGAGTCATT 12300 

CCATCTACAA ATACAAGTGT GCAATTAAAT GCTGTTATGG AAGATGGAGA AATTGTTTTT 123 60 

GGAGAAACAA ATATTCCTAA AAAACATAAA AAAATTGATC GTGTGTTTTT AGAACCTAAC 124 20 

GATGTGCAAC CAATGGAAGA AGCAATCGAT GCTTTAAGGG AAGCAGATTT AATCGTTCTT 124 80 

GGACCAGGGT CAT TAT AT AC GAGCGTTATT TCTAACTTAT GTGTGAATGG TATTTCAGAT 125 4 0 

GCGTTAATTC ATTCTGATGC GCCTAACETA TATGTTTCTA ATGTGATGAC GCAACCTGGG 12 600 

GAAACAGATG GTTATAGCGT G AAAG AT y AT ATCGATGCGA TTCATAGACA AGCTGGACAA 12 660 

CCGTTTATTG AT TAT G T CAT TTGTAGTACA CAAACTTTCA ATGCTCAAGT TTTGAAAAAA 12720 

TAT G AAG AAA AAC AT T C T AA ACCAGTTGAA GTTAATAAGG CTGAACTTGA AAAGAAAGC 127 8 0 

ATAAATGTAA AAACATCTTC AAATTTAGTT GAAATTTCTG AAAAT CAT T T AGTAAGACAT 12840 

AAT AC T AAAG TGTTATCGAC AATGATTTAT GACATAGCTT TAGAATTAAT TAGTACTATT 12 900 



49 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



CCTTTCGTAC CAAGTGATAA ACGTnAATAA T AT AG AAC G T AATCATATTA TGATATGATA 12 960 

ATAGAGCTGT GAAAAAAATG AAnATAGACA GTGGTTCTAA GGTGAATCAT GTTTTAAATA 13020 

AGAAAGGAAT GACTGTACGA TGAGCTTTGC ATCAGAAATG AAAAATGAAT TAACTAGAAT 13080 

AGACGT 1308 6 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CATTAGTCAT GAAAATAGCC GACAACTTCA TJTGTGAAAT CACCGGCCTT TTATTTTAGC 60 

TAACTTTATT TCTGATTTTA CGATTTTAAT TGATCATACA GAGAAAGTGA TCTTTTTACA 120 

ATTTCTAAAA ACTCATGATC TATATTGGAC ATTTGATGAA AATAAGACAA AATGTTTTCT 180 

GTTAGCTTCT CTTGTTTTGG GAATGAATCA TCTTCTTTAA TCCAAATCGC TAATTQG3CT 24 0 

AATGGTGTTT TATCATCTTT AAATGTTTGT ATATATTCGT AAAAGC T CAT AGTATTCCTT 300 

CTCTCAATTT ACT TAT AT AA ATCCTACCAC GAAAGCTTTC AAGAAAACAC AATTAAATGT 3 60 

CTATTTAGTG AACTTTTTAA GGTTGTGCAC TCTTTTAATG TCTGCCAATT AGGTCAATTA 4 20 

AT CAT C AC AA TGTACAATTA ACTCTATTTT CAGTTCATAT ACTCACACAC CGTTTTTGAA 4 80 

CAACACATTA ACTTCTCATT TAGATAAAAC GCAAAAAAGC CTGGCACCAA TACAATAGAT 54 0 

GCCAGACTAA GAGTCTACTA TATAAATTTA TTTAGCGTAT GGTTTTACTT CGATTGCACC 600 

TTCATTTTCA T C AT G AAC AC CATGCTTATA ATAATCAAA TATTGTGGCT CTAAAGGCTT 660 

TCTGCCACGT ATAATGTCTG CTGCTTTTTC AGCTAACATT AAAACAGGTG CGTGTATATT 720 

GCCATTTGTC GTACGTGGCA TAGCTGATGC AT C AAC T AC A CGTAAATTTT CCATACCGTG 7 80 

GACTTTCATT GTTAACGGGT CAACTACTGC CATTGGATCT G AA G C AG G AC CCATTTTAGC 84 0 

ACTACAAGAT GGGTGTAATG CTGTTTCACC ATCTCTACGA ACCCAATCAA GAATTTCTTC 900 

GTCTGTTTGC ACTTCTGGTC CTGGTGAAAT TTCTCCACCA TTGAATGGAT CCATTGCTTT 960 

TTGAGATAAG ATATTTCTTG CTACACGAAT TGCTTCTACC CATTCTTTTT TATCTTCTTC 1020 

TGTTGATAAA TAATTAAAGC GGATACTTGG TTTTTCGAAT GGATCTTTAG ATTTGATTTT 1080 

CAAGCTACCA CGAGAGTTTG AATACATTGG TCCTACGTGA ACTTGATAAC CATGTGCGAC 114 0 

CGCTGCCTTT TGACCATCAT ATCTTACAGC TATTGGTAAG AAATGGAACA TTAAGTTAGG 1200 
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ATAAtCAACT TCGTTATTTG AACGTACAAA TCCGCCACCT TCAAASGGT TAGATGCTGC 



TGCACCTGTA CGTGTGAAAA TCCATTGTAA ACCAATAAAT GGcATGCGCT TGAtATCTAA 



GCTTGGCtGt AATGATACAG GTTCCTTACA 



1260 

1320 
1350 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

TAATGCTATT GGCAACACCA TATATGAAAn CTCCAAACGA TCCTAAACCG ACTATAGATT 60 

CACCAAATTT nACAATCCAT GAATAAAGTA GTGGCCATAA GAATAACAAT AT G AC AAC T A 120 

AAAATGTACA GTAAAATGCA GTCATAATTG GAACTAGACG TTTACCACTA AAAAAT GAT A 18 0 

ATGCTAATGG TAATTCTGTT TCACTAAACT TATTGTATGC ATAAGCTGCT ATTAAACCTA 24 0 

TTACAATACC AACAAAGACA TTGCGVTTAT TCATCTTTTC AAAAGCTGAA TTTATTTCCG 30 0 

ArGCTTTCAT TCCTAATAAA GGCGCTAATT TCATTGGTGA TAATACAACT GTAACTAAAA 360 

AATATCCTAA CGTrGCTGCA rGCGsGACTG CACCATCATT TTTCTTTGCC ATTCCTATAG 4 20 

CTACACCAAT TGCAAATAAA ATACCTAATT GCTCTAAAAT CGTAGTACCTACCGTAGTAA 4 80 

AGAACATTGC GATTTTCGGC GTCGCATGAA GTGCATTTAA CGTATTACCA ATTCCGGCAA 54 0 

TAATTGCTGC AGCCGGTAAA ATGGCAACTG GTAACATTAA CGAACGCCCT AAATTTTGGA 60 0 

AAAATTTATA CATTGAATGT CATCCTTCTT AAAATAATGT AGAAATATAA AGATTACTAA 660 

TGTAACTAGA ATAACTACTT CGATACTCCG TTATAGTCAC CTAGGCTTAC TAACCAGCTA 72 0 

TATTTCTACC TCAAGTTATT TTATAAACTT TTTACAATTT CATGCAATTC TTGTTGTAAC 780 

TTTGCTGTTC GTGTTTCAAT CTCTTTTGTA ATATAATCGA TACGCTCGTT TCGTTTTAAA 84 0 

TCTTTAGGTA AATCGTTAAT ATCGATTGGk TipCAATATT TATGTATGCT TGTCCTGTTA 900 

AAAGACCGTG AATCTTAGTA GGACCAACAT AAGCAACAGG TAATATTGGT GACTTACTTA 960 

ACATTGCAAT TGTTGAAGCA CCaCGTTTCA AAGGTGCACC TTCTTGCGAT GTGCGAGAAC 1020 

CTGTTGGGAA GATACCAACT GTCTTATTAT CTTTCAACAA ATTGATTGGG CGTTTTAfflG 1080 

TACTAGGTCC TGGATTTTCA CGATCTACAG GAAATGCATT TAAAGACGTT AAAAAT T T AC 114 0 

CAATCCATTT ATTTTTGAAT AATTCTTTTT T AGC CAT AT A ATGAATTTGA TTAGGATATA 1200 

ATGCCATACC TAGCATAATG ACTTCGTTAT AACTTTCATG CGTACAAGTT ACGACATATT 12 60 

TACTATCCTT AGGAATATTA TCTTTACCGA TTACGTATAA TGATTTTGAC ATTTTAACTA 132 0 
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AAATGAAATT CAAAATCTTA CTAATCACTG AATACATTGT GCCACCTACT TAACTT 



1376 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

TTGTCATACC AATATTTTGT AAAAT AT GGA ACACAAGTAA AGTGACGAAA CCAACGATAA 60 

AGATTTTGTT AAATTGATCT TCAATTTTCG CAGCTAiTCT TATTAGATGG AAGATTAAAA 120 

ATAAAAATAT TAAGATCAAT AT GAC AG AAC CGATAAAGCC AAGTTCCTCT CCAATCACTG 18 0 

AAAAGATAAA GTCAGTATGA TTTTCAGGTA TATAAACTTC ACCGTGATTG TATCCTTTAC 24 0 

CTAGTAACTG TCCAGAACCG ATAGCTTTAA GTGATTCAGT TAAATGaTAG CCATCACCAC 300 

TACTATATGT ATAGGGGTCA AGCCATGAAT TGATTCGTCC CAT T T GAT AC AGTTGGaCAC 3 60 

CTAATAAATT TTCAATTAAT GCGGGTGCAT ATAGaATACC TAAAATGACT GTCATTGCAC 4 20 

CAACaATACC TGTAATAAAG ATAGGTGCTA AGATACGCCA TGTTATACCA CTTACTAACA 4 80 

TCACACCTGC AATAATAGCA GCTAATACTA ATGTAGTTCC TAGGTCATTT TGCAGTAATA 54 0 

T T AAAAT AC T TGGTACTAAC GAGACACCAA TAATTTTGAA AAATAATAAC AAATCACTTT 600 

GGAATGATTT ATTGAATGTG AATTGATTAT G T C TAG AAA C GACACGCGCT AATGCTAAAA 660 

TTAAAATAAT TTTCATGAAT TCAGATGGCT GAATACTGAT AGGGCAAAC GTGTACCAAC 720 

TTTTGGCACC ATT GAT AAT A GGTGTAATAG GTGACTCAGG AATAACGAGC AAGCCTATTA 78 0 

ATAATAGACA GAT T AAG AAA TACAATAAAT ATGTATAATG TTTAATCTTT TTAGGTGAAA 840 

T AAAC AT GAT GATACCTGCA AAAAT T GCAC CTAAAATGTA ATAAAAAATT TGTCTGATAC 900 

CGAAATTAGC ACTGTATTGA CCACCGCCCA TTGCCGAGTT AATAAGCAGA AC AC T G AAAA 960 

TTGCTAAAAC AGCTATAGTG GCTACTAATA CCCAGTCTAC TTTGCGAAnC aATGCTTATC 102 0 

CGGCTGTTGA CGAGATGAAT AATTCATTGC AAACTCCTTT TATACTCACT AATGTTTATA 1080 

TCAATTTTAC ATGACTTTTT AAAAATfAGC TAG AAT AT C A CAGTGATATC AGCTATAGAT 114 0 

TTCAATTTGA ATTAGGAATA AAATAGAAGG GAATATTGTT CTGATTATAA AT G AAT C AAC 12 00 

ATAGATACAG ACACATAAGT CCTCGTTTTT AAAATGCAAA ATAGCATTAA AATGTGATAC 12 60 

TATTAAGATT CAAAGAT GCG AATAAATCAA T T AAC AAT AG GACyAAATCA BATTAATTT 132 0 

ATATTAAGGT AGCAAACCCT GATATATCAT T G G AG G AAAA CGAAATGACA AAAGAAAATA 138 0 
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TTTGTATCGT TTTTGGAGGG AAAAGTGCAG AACACGAAGT ATCGATTCTG ACAGCACAAA 14 4 0 

AT G TAT T AAA TGCAATAGAT AAAGACAAAT ATCATGTTGA TATCATTTAT ATTACCAATG 1500 

ATGGTGATTG GAGAAAGCAA AATAATATTA CAGCTGAAAT TAAATCTACT GATGAGCTTC 15 60 

ATTTAGAAAA TGGAGAGGCG CTTGAGATTT CACAGCTATT GAAAGAAAGT AGTTCAGGAC 162 0 

AACCATACGA TGCAGTATTC CCATTATTAC ATGGTCCTAA TGGTGAAGAT GGCACGATTC 1680 

AAGGGCTTTT TGAAGTTTTG GATGTACCAT ATGTAGGAAA TGGTGTATTG TCAGCTGCAA 17 4 0 

GTTCTATGGA CAAACTTGTA ATGAAACAAT TATTTGAACA TCGAGGGTTA CCACAGTTAC 1800 

CTTATATTAG TTTCTTACGT TCTGAATATG AAAAAT AT GA ACATAACATT TTAAAATTAG 18 60 

TAAATGATAA ATTAAATTAC CCAGTCTTTG TTAAACCTGC TAACTTAGGG TCAAGTGTiA 192 0 

GTATCAGTAA ATGTAATAAT GAAGCGGAAC TTAAAGAAGG TATTAAAGAA GCATTCCAAT 198 0 

TTGACCGTAA GCTTGTTATA GAACAAGGCG TTAACGCACG TGAAATTGAA GTAGCAGTTT 2040 

TAGGAAATGA CTATCCTGAA GCGACATGGC CAGGTGAAGT CGTAAAAGAT GTCGCGTTTT 2100 

ACGATTACAA ATCAAAATAT AAAGATGGTA AGGTTCAATT ACAAATTCCA GCTGACTTAG 2160 

ACGAAGATGT TCAATTAACG CTTAGAAATA TGGCATTAGA GGCATTCAAA GCGACAGATT 2220 

GTTCTGGTTT AGTCCGTGCT GATTTCTTTG TAACAGAAGA CAACCAAATA TATATTAATG 2280 

AAACAAATGC AATGCCTGGA TTTACGGCTT TCAGTATGTATCCAAAGTTA TGGGAAAATA 2340 

TGGGCTTATC TTATCCAGAA TTGATTACAA AACTTATCGA GCTTGCTAAA GAACGTCACC 2 4 00 

AGGATAAACA G AAAAAT AAA TACAAAATTG ACTAACTGAG GTTGTTATTA TGATTAATGT 24 60 

TACATTAAAG CAAATTCAAT CATGGATTCC TTGTGAAATT GAAGATCAAT TTTTAAATCA 220 

AGAGATAAAT GGAGTCACAA TTGATTCACG AGCAATTTCT AAAAAT ATGT TATTTATACC 2580 

ATTTAAAGGT GAAAATGTTG ACGGTCATCG CTTTGTCTCT AAAGCAT T AC AAGATGGTGC 2 64 0 

TGGGGCTGCT TTTTATCAAA GAGGGACACC TATAGATGAA AATGTAAGCG GGCCTATTAT 2 7 00 

ATGGGTTGAA GACACATTAA CGSCATTACA ACAATTGGCA CAAGCTTACT TGAGACATGT 27 60 

AAACCCTAAA GTAATTGCCG TCACAGGGTC TAATGGTAAA ACAACGACTA AAG AT AT GAT 2 820 

TGAAAGTGTA TTGCATACCG AATTTAAAGT TAAGAAAACG CAAGGTAATT ACAATAATGA 2880 

AATTGGTTTA CCTTTAACTA TTTTGGAATT AGATAATGAT ACTGAAAffiT CAATATTGGA 2 94 0 

GATGGGGATG TCAGGTTTCC AT GAAAT T G A ATTTCTGTCA AACCTCGCTC AACCAGATAT 3000 

TGCAGTTATA ACTAATATTG GTGAGTCACA TATGCAAGAT TTAGGTTCGC GCGAGGGGAT 30 60 

TGCTAAAGCT AAATCTGAAA TTACAATAGG TCTAAAAGAT AATGGTACGT TTATATATGA 3120 

TGGCGATGAA CCATTATTGA AACCACATGT TAAAGAAGTT GAAAATGCAA AATGTATTAG 3180 

TATTGGTGTT GCTACTGATA ATGCATTAGT TTGTTCTGTT GATGATAGAG ATACTACAGG 32 4 0 
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TATTTCATTT ACGATTAATA ATAAAGAACA TTAC GATCTG CCAATATTAG GAAAGCATAA 3300 

TATGAAAAAT GCGACGATTG CCATTGCGGT TGGTCATGAA TTAGGTTTGA CATATAACAC 3360 
AATCTATCAA AATTTAAAAA ATGTCAGCTT AACTGGTATG CGTATGGAAC AAC AT AC AT T 34 20 

AGAAAATGAT ATTACTGTGA TAAATGATGC CTATAATGCA AGTCCTACAA GTATGAGAGC 34 8 0 

AGCTATTGAT ACACTGAGTA CTTTGACAGG GCGTCGCATT CTAATTTTAG GAGACTTTTT 354 0 

AGAATTAGGT GAAAATAGCA AAGAAATGCA TATCGGTGTA GGTAATTATT TAGAAGAAAA 3600 

GCATATAGAT GTGTTGTATA CGTTTGGTAA TGAAGCGAAG TATATTTATG ATTCGGGCCA 3 6 60 

GCAACATGTC GAAAAAGCAC AAC AC T T C AA TTCTAAAGAC GATATGATAG AAGTTTTAAT 3720 

AAACGATTTA AAAGCGCATG ACCGTGTATT AGTTAAAGGA TCACGTGGTA TGAAATTAGA 37 8 0 

AGAAGTGGTA AATGCTTTAA TTTCATAGAG ATTAGTCGAG GGACCTTTTA CTTATAAAAA 38 4 0 

TGATTTGAAT TAATACTAAA AG AT T AC AAA GAAGAGGTGG TTTTGTGTGT AAATACAAAA 3900 

TTGCCTTTTT CTTTTTATGT TAAATCTATA AATTTG&AC TAAATCAAGG TTAATTCTAT 3960 

GTACACACTT TATATAGGAA GTAGTTTGAA TGTTTATATA ATGTTTTACA AAAAG AT G T A 4 020 

GTATTATAAT GTCTAATTTC ACATGTGTTT CAGTAAAATT TGTTGTGGAA TGTTAACGAT 4080 

ATACGTATTT TATAAAAaAT TTTTTATAAT GATTATTCGA ATGATGCGTA ACGCTTACAT 414 0 

CTTATCTAAT GCTAGCTTTT T G AC AAAAAT ATGACAATCA ATTAATGTGA TTCTAATAAA 4 2 00 

TATTCGCAAA TTGCTTTATT GCGATTAAAT TTTTTTGGTG G TACT AT AT A GAAGTTGATG 4 2 60 

AAATATTAAT GAACTTATAT GCAAAAGTAT AT T GAG AAA T AAACAGGTAA AAAGGAGAAT 4 320 

TATTTTGCAA AATTTTAAAG AACTAGGGAT TTCGGATAAT ACGGTTCAGT CACTTGAATC 4 380 

AATGGGATTT AAAGAGCCGA CACCTATCCA AAAAGACAGT ATCCCTTATG CGTTACAAGG 4440 

AATTGATATC CTTGGGCAAG CTCAAACCGG TACAGGTAAA ACAGGAGCAT TCGGTATTCC 4 500 

TTTAATTGAG AAAGTAGTAG GGAAACAAGG GGTTCAATCG TTGffiTTTAG CACCTACAAG 4 5 60 

AGAATTGGCA ATGCAGGTAG CTGAACAATT AAGAGAATTT AGCCGTGGAC AAGGTGTCCA 4 620 

AGTTGTTACT GTATTCGGTG GTATGCCTAT CGAACGCCAA ATTAAAGCCT TGAAAAAAGG 4 68 0 

CCCACAAATC GTAGTCGGAA CACCTGGGCG TGTTATCGAC CATTTAAATC GTCGCACATT 474 0 

AAAAAC G G AC GGAATTCATA CTTTGATTTT AGATGAAGCT GATGAAATGA TGAATATGGG 4 8 00 

ATTCATCGAT GATATGAGAT TTATTATGGA TAAAATTCCA GCAGTACAAC GTCAAACAAT 4 8 60 

GTTGTTCTCA GCTACAATGC CTAAAGCAAT CCAAGCTTTA GTACAACAAT TTATGAAATC 4 92 0 

AC C AAAAAT C ATTAAGACAA TGAATiATGA AATGTCTGAT CCACAAATCG AAGAATTCTA 4 98 0 

TACAATTGTT AAAG AAT TAG AGAAATTTGA TACATTTACA AATTTCCTAG ATGTTCATCA 504 0 
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ACCTGAATTA GCAATCGTAT TCGGACGTAC AAAACGTCGT GTTGATGAAT TAACAAGTGC 5100 

TTTGATTTCT AAAGGATATA AAGCTGAAGG TTTACATGGT GAT AT T AC AC AGCGAAACg 5160 

TTtAGAAGTA TTanAGAAAT TTAAAAATGA CCAAATTAAT ATTTTAGTCG CTACTGATGT 5220 

AGCAGCaAGA GGACTAGATA TTTCTGGTGT GAGTCATGTT TATAACTTTG ATATACCTCA 5280 

AG AT AC T G AA AGCTATACAC ACCGTATTGG TCGTACGGGT CGTGCTGGTA AAGAAGGTAT 5 34 0 

CGCTGTAACG TTTGTTAATC CAATCGAAAT GGATTATATC AG AC AAAT T G AAGATGCAAA 54 00 

CGGTAGAAAA ATGAGTGCAy TcGTCCACCA CATCGTAAAG AAGTACTTCA AGCACGTGAA 54 60 

GAT G AC AT C A AAGAAAAAGT TGAAAACTGG ATGTCTAAAG AGTCAGAATC ACGCTTGAAA 5520 

CGCATTTCTA CAGAGTTGTT AAAT GAAT AT AAGATGTTG ATTTAGTTGC TGCACTTTTA 5580 

CAAGAGTTAG TAGAAGCAAA CGATGAAGTT GAAGTTCAAT TAACTTTTGA AAAACCATTA 5640 

TCTCGCAAAG GCCGTAACGG TAAACCAAGT GGTTCTCGTA ACAGAAATAG TAAGCGTGGT 57 00 

AATCCTAAAT TTGACAGTAA GAGTAAACGT TCAAAAGGAT ACTCAAGTAA GAAGAAAAE 57 60 

ACAAAAAAAT TCGACCGTAA AGAGAAGAGC AGCGGTGGAA GCAGACCTAT GAAAGGTCGC 58 2 0 

ACATTTGCTG AC CAT C AAAA ATAATTTATA GATTAAGAGC TTAAAGATGT AATGTCTTGA 5880 

GCTCTTTTTT GTTTTCAATA ATTGATTCTC TGTAGATATC aAAGTaCTAA CGTTTTAAAG 5940 

GTT AAAT ATT TAATTGGATT GAG AT C T G T A TGCGGTTATA TCaTTCTGTG TAAATATGGT 600 0 

TCTCCACCAA ATGTGGTGAG TATATAATTT AAAGAACTAT TTTTAAATTA AGAATAATCG 6060 

AACATAAATA AACTTTATGA AATTTCAGTA TCATGTTCTT ATAAAAAACA ATAGGGCTTT 6120 

TTGctGACGC TAGTGCGCGA TAAATAATAA GTT G AAT AT A AAAAAG AT C A CTGCCAATCA 6180 

TTCGTTTAAT GGCAGCGATC TTTTTTATTT AATTATTTCT CTTTCCACTG CAACATTTGA 6240 

TAACCAATGC GTGGATGTGT TTTAATAATA TCTTTTGCGT CCTCATGACA TTGTGAAAGT 6300 

TTTTGTCTTA ATGATGTCAT ATGCACTCTT AAACTAGGCA TTTCAGTCTT ATTAACATAG 6630 

C CAT AT AT T C GTT TT AAT AT CATC T CAT AA GTGAGTACTT TTCCTTTATG ATTTGACAAT 64 2 0 

AGTTCTAACA AGCTAAATTC ATTTGGCGTC AAATGTACCT CCTGATTATT AATAACAACA 64 80 

GATTTGGAGC CAAAGTCGAT GCTTAGCAAA CCGTTAGTAA ATACAATGTT AGTTTCTTGA 654 0 

TGTGACTTAG CGATTCTCTC GATGACTCGT ATTCGTGCCC GAAGCTCATC AAC AT T AAAA 6 600 

GGTTTAGTCA TATAGTCATT CGCACCGTTA TCTAAAGCTT GAATAATTGT TTGTTCTTCT 6660 

TGTCTTGCAC TTATTACAAT GATAGGAATG TCAGTATGTT GCCTGATTTC T G AAAT C AAA 6720 

CATAATCCAT CTTTATCTGG TAAACCTAAA TCTAATAAAA TGACATCHG TTTATCAATT 67 8 0 

TGAATTTTAA AGTGTGCTTG TGTGGCATTG TCGGCTGTAG TTACATTGTA AT AAT CT AAA 68 4 0 

GTTAATGCAA CAT CAAG T AA ATGTGTGATT GCGTGATCAT CTTCAATTAT CAATATTTTA 6900 
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GATTGCATTA TACGTCTCCT TCGTTAAAGT CTGTATATAT ATTGAAATAG AATATACTGC 6 96 0 

CGTGTGGTTG GTTCGGTTTA TATTGTAAGT TTGATTGATG TTTGTGTAGG ATAGTCTGTA 7 02 0 

CTAAATATAA GCCTAGTCCC ATGCTTTCTT TTTGGTTATC TTTAAAATAT TTATTTGATC 708 0 

CTGTGTAAAA AGGCTCGAAT ATCTTTTGTt GTTCTTCTAA ACTAATTCCA GGTCCTTCGT 714 0 

CTATAACGGC AAATTCGATT TGTTCATAGC TAGCATAACG AATAGATAAA TTGATTTTGG 7200 

TGTCAGTAGA AGTGTGTTTA ACTGCATTTT CAATCAAATT GAAtAAAgCT TGTAAAATCA 7 2 60 

ACTTACTGTC AATGTGTATA AACtGTAAAT TTACTGAGGA TGATACAGTT ATACGCTTTT 7 32 0 

TTAAATGGCG ACGTTCTAAA ATACATATCG ATTTCTTATA CTA 7 3 63 
(2) IN FORMAT ION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: - 

(A) LENGTH: 10470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TTAACAATCG ATAACCACAA TACTTCTATT GTAATTGTTT AACGATTTCn CGATTAAAAT 60 

CATCTAAATC GTCTGGTACT CGACTTGTTA CAATATTGTT GTCTACAcTa CTGACTCATC 12 0 

AACTACATGT GCGCCTGCAT TTGATAAATC TTTGCGTACA TTTAATACTG CTGTTAACGT 18 0 

ACGACCTTTT AAATCGTCTG TATCTATTAG TATTTGTGGC C C AT G AC AAA TGGCAAATGT 24 0 

TGGTACATCA TTTTTAGTAA AGTATTTAGC AAATGTGCCA TATCGACCTT CTGTATCTCC 300 

ACGTAAATGA TCTGGTGAAA ATCCTCCAGG AATTAATAAT GCATCATAAT CTTCTGGTTT 3 60 

AGCTTCTGCA ATGCCTACAT C G AC AG T AAC TTTTTCACCG TGTTTACCAA CAACTTCACT 4 20 

ATTTGCAGTA TCTCCAATCA CTACAGTATT AAAGCCTGCA TTCTCTAATG CCTCTTTAGG 4 80 

GCTTGAATAT TCTATATCTT CAAATTCGTT TGCTAGAATA ATTGCTACTT TTTTAGTCAT 54 0 

T GAAAAT C AC CTTTCTATAT ATCATTGATA TAATTACTAT AGACAAGTAA ATCAGTGATT 600 

AAACATACAA GATATAAAAA ATATTAAGCG ACTGTCGCGA TATCTAACCC T AAC AC AT C T 660 

TATGTGGCAT TTACTTAGAT ACTAATTTAA CCTTTTCTTC AAGCTGATCT AACAATCCAA 720 

TCCATTCATC TATATCTTCA ACACGTACTT CATCAGGATT TACATGATCG ATATCCTCAA 780 

TAAACTTATT TAAACGCGCT TTTATCTGTT CGATTGTTTG CTGTTCATTC AT AAAAAG T T 840 

AACTCCTTTT ATTTTGTTTT CTTTTTCATT ATTATCCTAA CAGAAATTGC GTTAAAGCGA 900 

TATAATCTTA GCTATATTTA T G AC AT T C AA ATTATTTTGA CTTTTAAAAA TCCCCTTTTC 9 60 
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AATTAACTAA AATTAAGAGA TAATTTGTTA CGAGTGATAATACGAaGkGG TaTCATACCG 1020 

ATATGAACCA AATAGAAAGA AGGAAGTTTA AGACGATGAA TAGCGTCAAA TTGAAGCAAC 108 0 

CTGTTAGCAT TTACAATGAT CCATGGGAAG TGAAATTTAT AT AC AT T T AA ATTTCATGAG 114 0 

ACAATAAACG TTGATTTAAT GCGTTTTTTT GCCTTTTTTA TTTTCCTTAT TTTTTCTGTT ICED 

T T AC AAC AAA ATGGTATCAA AAATGGTATC ATTTGTAGTT ATTTTAGCTT C AC AT AT T AA 12 60 

AACAACCACA CTCCTAAATT AATAGGTGGT GTGGTTTTGT TGGTTGTGTG GGGATAAAAA 1320 

TAACCGCATC AGTTAAGATG CGGTTATCTA GCAAGGGCCA CGTATTTATA AATACGTTTA 1380 

GAATCTCTTC GGCAACTTTG C TAT AG AC AG TCTATGCTGT TACTAAATTA TACCACCACA 14 4 0 

CAAACCTACT CCCATTCAGG AACACAGAGC TTTGTCGCTC GTCAGCAACG TCATATGAAT 1500 

TCTCAGTTCA TGTTGTGGTG AC AC T T T AAA CGGTCTGTGC CAGTAGCGAC CGAGTCATTT 1560 

CAAGAATGAC CATTTCACAT TTATATTATA ACACTTGTCG TGCGTAACCG TATAGTTTTT 162 0 

CAGTTGTATT TAAAGTTAAG TTATCTACTT CGCGCTTTCC TTGCCTTAAT TGTGAAATTA 168 0 

CATATTGCGC TACGCCAGTT TGTTTGTGAA TTTGGTAACC TGTTATATCA CTTTTGATCA 174 0 

ATTCAATTAT TTTTAATTTA TAATCACTCA TAT TAT CT AC GTCCATTCTT TTTATCTAAA 1800 

CAATAAAAAT GTGTCTTTCT CCC GATAAAT AATAACAATG GTAGGCTTAA TAAAAACAAT 18 60 

ATTAAATACA TTTGTTCTGT CATAATTGAA AACCTCCAAA TAATATTATA TTATATAAGT 192 0 

GTAAGGAGGA GCCATCAGGC TCCAAGCATA ATGTTAATCT TTGTTGTTTG GCTTTCGGTC 1980 

TAGGTAGCCG AGATGCCaTT CTCTAAGTTG TTTTAACACT TCTGGAATTA TCAGTACTGC 20 4 0 

CAATACTTGA TGTTCTAGAA GTGTTTTTAT TATGTCTAGC ATGAGGCTTT TCACCTCCTT 2100 

ACACATAATT TGTAAGTCAT C AAC T AAC C T ACAAATATAA TTATACTAAA CAAATGTTTA 2160 

TTAGTAAAGT GTTTTTTAAA ATTTGCATAA AAAAATAGGC AAGTACCGTA GTACEGCCT 2220 

GTTATCTACA TTTAAATCTT GAGAGAAATG TTAAAAAGTT CTAGTAAAAT AATAGCACAT 22 8 0 

TTTATCTTTA AATGTAAATA GAAAGCAGGT ATGTAACGCA CCTGCTTAAA TAGaCATGAC 2340 

TATGTCATTC T AAC T GAT T T CTCCCCATAA GTCACCTAAT AT C T GAT TAG GTGGGGCAGA 2400 

ACCATTCCAT GTTCTAATAG GCAAGTAATA ACGTTGCCCC TCCCATGTAT ATCCTACCCA 24 60 

AACATGACCA TCTTGTAACA TCACTTCTGT ATAATCACAA TACCCACCAG GTTGGAACTG 2520 

ATAACCCACT GGACAAGATA AGAATGGCCC CACTTTTCTT ACTGTGATTG GTTGATTGCC 2 58 0 

GTTTGTGAAT CTAGCACTTT CTTCCATGTA GTAAGTAXA TATTTATTAC GTTTCCATGC 2 64 0 

ACTTGCAACT GGTTTAACTG TATTACTTGA AGCGCTTGAC TCATTAGAGA CAGTGGCAAC 2700 

CGGTATTTTA CCATCCATGT ACGCCCTAAT CTGCTTGATA AAGTAGTCTT TAAGTTGCAA 2 7 60 

CCGCTTGTCT TCTGGCAATA GACCGCGAGT TACTGGGTCA AAACCAGTGT GTAAAACCGA 2820 
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ACTTCTATGA GGGCATGATG TTGAAGTAAA TTCATTGTGC AATCTGATTG TATTTCTGTT 28 80 

TGCTGGTAAT CCCCATTTTT TCAACAATCT AGCGCATTCT TGGAAAGTTG CCTGTTCATT 2 94 0 

TTTTAAGAAT GTCGCGTTAT CTGCGCCCAT TGATTGACAT ACTTCAATAC CGTAATAATA 30 00 

TTTATTACCT ATTTGATTAG CGGTATGCCA ACCTACTTGT GATTCATCTA AGGCTTGCCA 30 60 
AACTGTGTTG CCTGATACGT AACTATGCGC AATGCCCGCT TCTAATCTTG ATAAAGGTGC 3120 
ATTTACTAAT CCGTTACGAT ATGCTTCAGC AGTCGCCCCT TTGCTCCCTG CGTCGTTGTG 3180 

TATAACTATA CCTTTAGGGT T AC T AC C AC G CTTAGGTAGG TCASACCTT TAACCACATC 324 0 
TTTGATGATT TTAAGTTCTA CTGCTTTAGG TTGTGGCTTA GCTGTTTCTT TTTTAGGTGC 3300 
TTGTGTAGGA GATTGAACTG ATCGTGGCGC TGTCTCACTT TTAAAATTCG GACGGATAAA 33 60 

CCACATAGGG AAATCATAAG CATGTTGTCG TCTTGTAACT TTTTCCCAAC CCCAGCCGGG 34 2 0 

TTGTTCGATT CCGTCAGTCC AGCCACCGCC TAGCCAATTC TGCTCATATA CAATGATGTA 34 8 0 

ATCTAAAGTT GCTTCAATTA CCCATGCAAC GTGACCATAT CCAGCACCGT AGTTGCTACC 35 4 0 

GAATACCACC ATGTCGCCAG GTTGTGCTAA GAAGTCCGGT GTATTTTGGT ATACAGTAGC 3600 

TAATCCGTCG AAGTTGTTAG CGAACGSAAT ATCTTTTGCA CCTAAACCTT TTAGAAGTAA 3 660 
T C C AAAC AAA ACTTTCCAAC CAGCATTGGC ATAATCAAAG CATTGAAATC CATACCATAA 3720 
GTCCACATTG AATTGTTTTC CCTCAGAAGT TTTCAACCAC TCTATAAACT CATTTTTAGT 37 8 0 

TAATTTTGCT TGCATTGTCG CCACCTCCAT GATGATACTC ATT C AC AT C A AGCCAACAT 38 4 0 

CGTTAGAGGC GTCTGTGAAA GGTTGTGATG TATCATATTC TTTTGGTGcT TTCGCGCTTA 3900 

ATTCCGGCGT TAAACTACTG TCTTGTGATG ATTTCCACGT AACTTGTTGT TCTTCTTTTT 3960 

TGCTATCTCT AGGCGCTTGA TATGTCTGTG CTATAGATGA ATCTGAGACG CCTTTTGACG 4020 

TTGGGTCAGT AATAACGCCA ATACCTGTAA GT AACGT GAG GATAGCGCCT ATAATTGCGC 4080 

TAGCTTGATT TAATTGAGTA GATAAATCTA ATCCGAATAA ATCCGTGACT TGCTTGATAA 414 0 

ATAGCAACAA TGCTCCAACT AAAC C AG T T A GTACTGCTTT GTTTTTGAAT CTCAATTTCC 4200 

AGTTAATATC CATTTGTTTG CTCCTTTTAT CCfflAATAAA AAAAC G AC T A AAAATTAGTC 4 260 

GTTTAAAATT ATTCAATGGT CAATGTCGGA GATCCTGAAT AAAC AT C AC T TATAGTGACG 4 320 

TACAACATCC CTGAAGGATT ACTAAAGTTG ATATTTTTAC TTGCAACTCC GCTATTGACT 4 380 

CCTGATATTC CTAAATCACT TGACCCTAAA TTAGTTTGCG AAATCCTCAT TATACCGCT 44 40 

CGTACATTTT CTATTGTCAC CTGATAACTT TTATTGGGTT CAACTCCATT TATTGTCCAT 4 500 

TTTGCTGTTG ATTCTTCTAT GCTATCCGGA TATTTATTTT TAGGTAAGGG TTTTATTACA 4 5 60 

AAAGATGAAG GCTTTTTCCA TACTTGGATA TTTCCAGCAT ATACTTTTGT ATATTCTTCA 4 620 
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CCTTCGTAAA TAAACTTCTT TACATTTTTA AAATTACCTT CCATAAAAAT CACCCTTTAA 4 680 

TTAAATATAA CGTATTCGGG TCTTTTTGAT ATATATAGTT ATATTCATTT TCTGTTCCTG 4740 

TCCAAATTTT AACCGTCGGT TGAGATGCGC TTTTTAGTTG ATATAAATTA TCCGCTTGTT 4 800 

GTTTAGTAAA AGCTTGAGAT GACAAAACAT ACCGCTCGTC ATGATTATGA TTTTTTGGAG 4 8 60 

CATATAAATC ATTTAGTGTT TGTTTGAATT CCTCAAAATC TTCTGTATTA ACTTTTGAGC 4 920 

CAATCTGTTG CAATACACTT TCTGAAATAG AGTTGTTTTG TATTGCTTCT GCTAATTCTC 4 980 

TTAATGTGTT CATAGATTCA GGCGCGCTAT CAACTAGTTC AGCAATTTTT GTATCCGTAT 5C0 

ACGTTTTAGA GTCGTTGAGA GTTGTATCTT TGATTTTTTC AACTTCTTGC AATTTATTTT 5100 

CTAACCCTTC AACATTTGCG ATATTGATTT TGTCCAATAA CTCAGGTTCT GCTTTGATAT 5160 

CTGTATCTTT ACCATCAATT TGCCACATTT TAGTGTCAGG ATTGATTGAT AC T AC AG T AC 522 0 

CGTTTTTACC GGGTGCGCCT TGTTCTCCTT TTTTACCTGC TTCACCTTTT GCTCCAGGTT 5280 

GTCCCGGTTC ACCTTTATCA CCTTTCGCAC CTTTAAATCT ACTTTCATTC TTTTCGATGT 534 0 

AAGAAATGAC ATCTTTATCT ATTTTCTCTT TAAAGTCTTT GCTCAATAAA TCTGTCGCGT 54 00 

TATCTTTTAA AATTCTCGTA ATAGCATCAT CTACCAATTT AACATCG2RT TCTTTTGCTA 54 60 

CAGCAGATTC AAT AC C AC T A T C AAC GAT AT TGAAAGAAAA GTTTGCGACA TGTATTTTTT 5 52 0 

CTTCTTCTTT CTCTAAAAAC AGCTTACAGC GAACATAACC AGCGTGTTTG ATAACCTTTT 5580 

TAGGTATCTT GTAGGTAAGG AAACCTTTTA CAACATCGTC GATAATAAGG GGCTCATTTT 5 64 0 

TGAATATAGA GCCATCTTCC ATAAACAAAT GTAATCTAGG TGTTAAGCCA TGTGCTTTTA 57 00 

GATCGATACG ACCTTGTTTG T CAT T GAT AC CTATTCTTAT AGATGCTGTA TTTTCATCTT 57 60 

CAGTGTAAAA TCGACAGCCA ATGTCACCTA AGTCAACACC ATCATTTTTT ATTCTCGTTT 5820 

CAACATCTTT TATTTTGTAC ATTTACACAC CTCTTTATTT ATATTTATCC CTTGTGAAGT 5880 

AG AT AC C T T T TAAGCCGATT TGTTTATATA ACTTAGCGAT TGTACTTGCT TGATGTTGGC 5 94 0 
ACCACTCTAT AGCAGTAGCG TATTGGTGGG TAGCTGGATT CTTAGGATTC CATCTAATTC ^ 6000 

GGTACAATGT GTTTTGACCT TTATTGATGT AATCCTTTCT TACGAAGCTA GCACGCCCA 60 60 

TGATTGCTTT TGCTGGAGAT GTCCAACCTT TATTCCTTGC AAACGTCATT GCGTAGTTAG 6120 

GATTGTTGTC GTAAGCGCCA ATGCCGAAGT AGTTGTATAC TCCATCTTTT CCGTTAGCGA 6180 

AGTTACTTGT TCCATATCCA CTTTCTAAGA AAGCATGCGC GATTAAATAA ATTTCATTAA 62 4 0 

TGTTGTGCTT TTTACAAGCT TCTGCGAACG CTTTACCTTG ATTATTCAAT GTTCCCTTAC 6300 

CTTTAAGTAT CTTATTAAGT GCGCTAACTG AAACACCTTG ATACTTGCCT AAATTAAGCA 6360 

TTTGGTAGCA TTGTGTGTTA CTTTCCCATA TACGCTTTAC ATTCATTGCT GAACTCGTTT 64 2 0 

GTGCTCGTGT AGCGTTAscC AACCCCAAGC ATTAGATTTT TTCGGGTTAC CTCTTGCCAT 64 80 
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TTGTTTATCC AGTGCTTGTT TGAATGTATA AGGACTCGTT TCTGTTATGA TCTGCGGTTG 654 0 

TTTAGATGCC GAACCATTGT TGGCTGTTGG TGACGAGTCT C T T AC AT T AG CTATATCAGC 6600 

GTTTTTATTA TCTACCATAA CTTTTATTCT AGATTTTGTT ACTGTTGGCT TAGTTATAGA 6660 

ATTTAATAAT TTTTCTCTGT TTTTAAATAT ATTAAGTAAT GCCTTTTCTA ATGCTTCGTA 6720 

TTTATCTTTA GGAGGAACAC CGTTGTCAAT CATATTCCAA TTAACATGTT CCAACATTGA 67 8 0 

ACGCCAAATG CTGTCGTCTA CTTTTAAATT TTCAATACTT AGAGGTATCT CATATTTGGC 68 4 0 

CATCATATCT ACAGCTACm CCATTGCGTG AATCTCATTA AAAATAAATT CATTTTTACT 6900 

CGCACTATAA TCTTCACATA CGTCTATAAC T AT AT AAT C A GGTTCATTAG GAACTTCAAA 6 9 60 

TACAGCTCTT CTAGGTGCCC AAATATTATG TCTATCAACA TAAAAGTGGG GAT AT T C T AC 7 020 

ATCCTGTTTG TATTTCTTCC TACTGTTATA TAAACTTTCT ACCSGCTCA TCGTTTGTGC 7080 

GTTTCTAATC ATTATTCCTT TAGGTTTTTC GAGTCGTCGA TTACCTTCTA CTATAAAGTG 714 0 

ATAAATATAT TCTGGATAAT TAACCTCTTG GCTAGAAATA GTGTACTTTA TAGTTGTTAC 7200 

ATCTTTCCAA ATTGGAACTT TTTTATTATT TTTTTCGTTA TCATCACTAT CATCTTCTGG 7 2 60 

TTTAGGTGCC GGTGTAGTTT TGTCTGGATG ATATGGTGGT CTAACAAAAT ATTTAACCCC 7 320 

TCCACCTGGT CCAT CATGAT AAGAGTGTTT AATTTTATAA GGTGGACTTC CTGTTGCGTT 738 0 

ATTTGTATAC CAGTTTTGAT CTACGCCATA CCAATAGTCT TTTGTGCATG GTCCCACTAC 74 4 0 

AATGTTTACA TGTCCTGCCC AACCACCAGT CCAAACACCC CAGTCGCCTG GTTGTGGTAC 7 500 

AAAATCTTTT GTATTTCTAA TTATCTTGAA ATCTCTACCT CTATAATTGG ATTTTTGAGC 7 5 60 

CATAGCATCA GCATTTCCCC ATGTTCTAAA CCCCCAATAT TTATCGAGTA AAT AAT T AGG 7 62 0 

TAAATCCCAG CATTGTGCTC CCATTCCAGA ACCAGGTACA TCAATAGCTA TTTGTTTTT 7 68 0 

AGCGATATAT AACGCCCATT CAACCACTTC ACTAGCTGTG GGCTTTCTAT TTTTCGGATT 77 4 0 

AGGTAATCCC ATGTATGCAC CTCATTTCAA TCAAAATAAA AAGCCAGTGC CGAAGCACTG 7 8 00 

ACTCTTAACT G T TAT T T AC A TTTACCAAAC CAGAAGCACG CCCAGAAGCT AT AT CC T AAA 7 8 60 

ATCCCTTTAA GCATGGTAAT CACCTCCTTT AAATACCAAA AACAGTTCTT AGTAAAGCTA 7 92 0 

TGACAATCGT ACTGAAGATA GTCCCTATCA AACCTAGAAT CCACATTTTT ATGTCTCTAA 7 98 0 

TATTCTTGGC ATTCTTTTCT TTATTCTTTT CATCTTCTAC CTTGTCGCGC TTTAATTCTT 8 04 0 

CAAAATTTCT ATCTAATTTG TCATAAATCT TTTJTTGCGC TCTAAGACTA TCTTCTATTC 8100 

TGTCGAATTT TTCAAACATA GTCTTATCAT TTTCTTCTAA TCGCGTTAAA CGCCAATCTT 8160 

GTTCATGTCG TTTGGTAAAT CCAAACATTA TGCCACCCAC TTTATTCAAA TTAAAAAGCC 822 0 

ACAAGCATTA CACCTGTGAC TTTTCATCTT TTGTTTCTGG ATATTTTTCT CCAGTGATff 8280 



60 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



AAGCGTATTC TTCTTTATCG ATTAAACCCT TGTCTACGTA CCACTTAATT TGCTCGTTTT 8 34 0 

TATAGTAACC CCAAACATAA AAAGTTTTAA TGTCTTTAAA AGTTGGATAA ATCATCTTCA 8 4 00 

TTATTTAAAC GTCCCCCTCA GTACTTGTTT TGTTAGTTTT CAGTTCAGTC AACTGTTGTG 8 4 60 

TTAACATAGC GTTTTGTTGA GCTAATTCCA TTGTTAATAC GTTTACTTGT GCCACCTGCA 8520 
TTTGCATACT CGCAACCATT CCGCGAAGTT CCTCATCACT TAAATCTGAC GCACTTTGTT 8 58 0 

GGTTTGATGC ATTCGGTACG TCTTCTTTTT CGAAATTGCT ATTGTATTTA ATTTCGCCGT 8 64 0 

TAGTGAAAAC AAACTTTCTA GGTTCGAACT CTTCTTTAAATTTAATAGGC ACATTGTTAT 8700 
CATC T AC AT C TAAACTATTG CGTAAACCGC CAGTATTAAC GAATCCGATA ACTTCGTTTT 87 60 

TATCGTTTAC TGTGATTTTC ATTATTTCCA CCCCATAATT TTAGTTATAG TAACTTTGTT 8 820 

GGCATTGGCT CCAGAACCTG ATGTTTTACC TAAATCAAAG TACACATCGT TATCTATTCT 8830 
TAAAGTAGTG CTACTTGTTT TGGATAGTAA GCACTCATAA ATACCGCCAC CGTTGCCGTC 8 94 0 

TGAGTCAACT ACATTCGCTT TACTCAATTG AATCGCGTTA GGTAATGCGG TTAGTCCGAA 9000 
TCCCTCAATA ACGCCACCTG GATAAGTTCC ACTTACCAAC AAAATAGAAT AGTTTGTGTA 90 60 

CGGTTCAGTT AGATTGATTG TTGTACCTAC ACCATTTGCG CCACCGTCGA ACAATACCGT 9120 
TGATTTATGT TCATTAGGAA CTGTCCACTG TTGCTCAAGT CTGCCGTTTG TGATTGATCG 918 0 

TGTGTAAATC TTTTTAGAGT TATAAGGTGT GAAGTTAAAT AGCTTGTTTG TATCATCTTT 92 4 0 

AACGAATACC GATAAATAAC CCTCATAACT TTCAACGCTA CCTGGTAAT CCGGCACTCT 9300 
TGTTGCATAG TAATTACCAG CAGTTAAATA TCCCAAATCG CCTTGCGCAT TATTTAAGTT 9360 
AACTTGAATT GATTGACCAT TCGCCTCTGT CATCTTATGT TGTTGCCAGC TCGTTGTTCC 9420 
GAATTTATCA TCTACATACT GCTTAGCTTG ATTTAAAGCG TTGTTAGACG TTTCTTCAAC 9480 
AAATTGCTTA GTTAAGTTTC CATCATTCTT TTTATAAAAC GGGTACCATG TGCCGTAGAT 954 0 

TTTGTATTTT GTGTACTCAT CGTTTGAATC GTCTGGGTAC CATGTTGCAC GAGCAGTATT 9600 
ATTATCAACA ACATAAACAA CTAACACACC AGATTTGCTT GATGTATAAG TTGATTCATC 9 6 60 

GAACGAAGAA CCGTCATCAA CACCATCTTG TCCAGGCTTC TCTAACGTGC CTATATCCGT 9720 
CTTTTCTGGC GCATCTGTTG CATTAGTAAT ATGAATAATC CTAGATGTGT TAACTGCGCT 97 80 

TAAAACGCTA TCTATGGACT GCTCATACGA TTCAATTGCT TTACCGTAAT CATCTGTAAG 98 4 0 

TTTAGACTTT TGCCAATTCG TTGTTGAATT ACCTTTAACA AGGTCAGCGC CATTSTTTG - 9900 
TTGTTCAACT TCGTTAACAC GTTCAAAAAT CGCTTGCTCT TTTTCAACTA TTTTATCGAA 9960 

TTCAGCTGTA ACAGCTTGTG TTGCAC TAGT TTGCGTCGCA GTAATAGCTT GTATAGCTTC 10020 

GTTTTGCTTG ATTTCGATTT GTTGAATGCC TTTTGTCGCA C TAT CAT T C A CTTTTGCTAT 10080 

TAACGTTTGT GTATCAGCCA TATTTTGCTT TAATTGGTTA AAATCTTTAC CGACAGCTTC 10140 
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GATAGTATCT TGAATAGATT T GAT AT AAAC AAGCTTTGTT AT AC CAT C AA ACCCACTAAC 10200 
TAAATCATTT TCAATATTGA AGCTAAATTG ACGTTCAACA ACAACATTAT TACTCCCGTT 102 60 

TTGTGTAAAG AATGCCTGAG CATGCACCTT GCCTGAffGT TTTAAAAATT CATTCGGTAT 10320 
CACATACTGC AAACGCCCAT TAATTGCGTC TACTATCGTT AATTCGTCTG AAATATAAGC 10 380 

GCCTCTATCT ACGTTATAAT CATCGGTTTT TAAnACGATA GATGTTTTAA CATGTTCAGA 104 4 0 

ACTTATAGAT AAGGGTCTGT TATnCTTAGT 104 7 0 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3647 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ D NO : 21: 

ATCAGATCTT GAGAATCGAG TTATTAAGTC TATCGAAGAC TTAACTAAAA TCCAACCATT 60 

CATGCCTACA CAAGATTTTG ATTTTAAAAC TAAAGAAATT CAATCAAACA TTTCTGAAGA 12 0 

AAGATTTATC GAAATGATTC AGTATTTCAA AGAGAAAATA ACAGAAGGGG ATATGTTCCA 180 

AGTTGTGCCA TCAAGAA.TTT ACAAATATGC GcATCATGCT AGTCAGCATT TAAATCAACT 24 0 

TTCGTTTCAA CTGTATCAAA ATTTAAAACG ACAAAACCCA AGTCCATATA TGTATTATCT 300 

TAATATCGAT CAACCATATA TTGTCGGTAG TTCTCCCGAA AGTTTCGTAA GTGTCAAAGA 360 

TCAAATTGTA ACAACTAATC CTATTGCAGG TACGATTCAA GTGGTGAGA CGACACAAAT 4 20 

AGATAATGAG AATATGAAAC AAC T AC T T AA TGATCCAAAA GAATGCAGCG AACATCGTAT 4 80 

GCTAGTTGAT TTAGGACGTA AT GAT AT T C A TAGAGTAAGT AAAATCGGTA CCTCAAAAAT 54 0 

TACTAAATTA ATGGTTATTG AAAAATATGA ACATGTTATG CATATCGTAA GTGAAGTCAC 6D 

AGGTAAAATA AATCAAAATT TATCGCCAAT GACAGTTATT GCGAATTTAT TACCAACAGG 6 60 

TACCGTTTCA GGTGCACCAA AATTACGTGC AAT TGAAAGA ATATATGAAC AATATCCACA 72 0 

TAAACGGGGC GTTTATAGTG GTGGTGTTGG ATACATAAAT TGTAATCATA ACTTAGATTT 780 

TGCATTAGCA ATTCGAACGA T GAT GAT AG A TGAGCAGTAT ATCAACGTAG AAGCTGGTTG 840 

TGGCGTTGTA TATGATTCTA TTCCTGAAAA AGAACTGAAT GAAACGAAAT TGAAAGCTAA 90 0 

AAGCTTATTG GAGGTGAGCC CATGATCTTA GTTGTAGATA ATTATGATTC CTTTACATAT 960 

AACCTAGTGG ATATTGTTGC TC AAC AT ACT GACGTCATTG TTCAATACC TGATGATGAT 102 0 

AATGTGCTGA ATCAATCGGT GGACGCTGTT ATTATATCTC CTGGTCCAGG GCATCCATTA 1080 
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GACGATCAAC AGTTAATGAA AAT CAT AT C A ACCTATCAAC ACAAACCCAT TTTAGGTATT 114 0 

TGTTTAGGGG CTCAGGCACT GACTTGTTAC TACGGTGGAG AAGTCATTAA AGGCGACAAG 1200 

GTTATGCACG GCAAAGTTGA T AC AC T AAAG GTTATATCGC ATCATCAACA TCTGTTATAT 12 60 

CAAGATATAC C AG AAC AG T T TTCAATTATG AGATATCATT CATTAATAAG TAACCCTGAC 1320 

AATTTTCCAG AAGAATTGAA AATTACTGGA CGTACCAAAG ATTGTATACA GTCATTCGAG 13 8 0 

CATAAAGAAA GACCGCATTA TGGTATTCAG TACCATCCTG AATCATTTGC TACAGACTAT 14 4 0 

GGTGTCAAAA T AAT T AC AAA TTTCATTAAT CTAGTGAAGG AAG GAT G AAA ACCATGACAT 1500 

TACTAACAAG AATAAAAACT GAAACTATAT TACTTGAAAG CGACAT T AAA GAGCTAATCG 15 60 

ATATACTTAT TTCTCCTAGT ATTGGAACTG ATATTAAATA TGAATTACTT AGTTCCATT 1620 

CGGAGCGAGA AAT C C AAC AA C AAG AAT T AA CATATATTGT ACGTAGCTTA AT T AAT AC AA 168 0 

TGTATCCACA TCAACCATGT TATGAAGGGG CTATGTGTGT GTGCGGCACA GGTGGTGACA 17 4 0 

AGTCAAATAG TTTCAACATT TCAACGACTG TTGCTTTTGT TGTAGCAAGT GCTGGcGTAA 1800 

AAGTTATAAA ACATGGtAAT AAAAGTATTA CCTCaAATTC aGGTAGTACG GATTTGtTAA 18 60 

ATCAAATGAA CATACAAaCA ACAACTGTTG AT GAT AC AC C T AAC C AAT T A AAT G An AAAG 192 0 

ACCTTGTATT CATTGGTGCA aCTGAATCAT ATCCAATCAT GAAGTATATG CAACCAGTTA 198 0 

G AAAAAT GAT TGGAAAGCCT AC AAT AT T AA ACCTTGTGG TCCATTAATT AATCCATATC 204 0 

ACTTAACGTA TCAAATGGTA GGCGTCTTTG ATCCTACAAA GTTAAAGTTA GTTGCTAAAA 2100 

CGATTAAAGA TTTAGGTAGA AAACGTGCAA TCGTTTTACA TGGTGCAAAT GGTATGGATG 2160 

AAGCAACACT ATCTGGTGAT AATTTGATAT A T G AAT T G AC TGAAGATGGA GAAATCAAAA 222 0 

ATT AC AC ATT AAATGCGACT GATTATGGTT TGAAACATGC GCCGAATAGT GATTTTAAAG 22 80 

GCGGTTCACC TGAAGAAAAT TTAGCAATCT CCCTTAATAT CTTGAATGGT AAAGAT C AG T 23 4 0 

C AAG T C G AC G TGATGTTGTC TTACTAAATG CGGGTTTAAG CCTTTATGTT GCAGAGAAAr 24 00 

TGGATACCAT CGCAGAAGGC ATAGAACTTG CAACTACATT GAT T GAT AAT GGTGAAGCAT 24 60 

TGGAAAAATA CCATCAAATG AGAGGTGAAT AAT AT G AC G A TTTTATCAGA AATTGTTAAA 2 52 0 

TATAAACAGT CACTTTTACA AAATGGCTAT TATCAAGACA AACTTAATAC CTTGAAAAGT 2580 

GTGAAGATTC AGAATAAAAA ATCTTTTATA AACGCAATTG AGAAEAACC AAAGCTAGCA 2 64 0 

ATTATTGCAG AAAT TAAATC G AAG AG T C C T ACAGTTAATG ACTTACCTGA ACGAGATTTA 27 00 

TCGCAACAAA TCTCAGATTA TGACCAATAT GGTGCAAATG CCGTGTCCAT TTTAACTGAT 27 60 

G AAAAG TACT TTGGTGGTAG TTTTGAAAGA TTACAAGCAT TGACGACAAA AACAACATTA 2820 

CCCGTATTAT GCAAAGACTT TAT T AT AGAC CCGCTTCAAA TTGATGTTGC TAAACAAGCT 28 8 0 

GGTGCATCTA TGATTTTATT GATCGTTAAC ATCTTATCTG ATAAACAATT GAAAGATTTA 2 94 0 
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TATAACTACG CTATATCGCA AAATCTAGAA GTGTTAGTTG AAGTACATGA TCGCCATGAA 3000 

TTAGAACGTG CCTATAAGGT TAATGCTAAA TTGATTGGTG TAAATAACAG GGACTTAAAA 3060 

CGATTTGTTA CAAATGTGGA ACATACAAAT ACTATTTTAG AAAATAAAAA AACAAATCAT 312 0 

TAT TAT AT T T CTGAAAGTGG TATTCACGAT GCATCTGATG TAAGAAAAAT CTTGCATAGT 318 0 

GGTATCGATG GCTTACTAAT AGGTGAGGCG CTTATGCGTT GTGACAATCT ATTGAATTT 32 4 0 

TTACCACAAC TGAAAATGCA AAAGGTGAAG TCATGATGAA ATTGAAATTT TGTGGCTTTA 3300 

CATCAATAAA GGATGTTACA GCGGCCAGTC AATTACCTAT TGATGCGATA GGTTTCATCC 3360 

AT TAT GAAAA AAGTAAAAGG CATCAAACAA TTACCCAAAT AAAAAAGTTA GCGTCTGCTG 3 4 20 

TTCCAAATCA TATCGATAAA GTATGTGTCA TGGTAAATCC TGATTTAACA ACAATTGAAC 34 8 0 

ACGTATTAAG CAATACGTCA ATTAACACAA TACAGTTACA CgGCACAGAA TCTATTGATT 3 54 0 

TTATACAGGA AATTAAAAAG AAATATTCAA GCATTAAAAT CACTAAAGCT TTAGCTGCaG 3 600 

AT G g AAAACm TwATCCCAAA caTtAAtnAA tnTTigGGGG TCCGTGG 3647 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5966 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CcAcCTTGAC CACCTTTACG TGGAATCTTT TCmCCTkGAG CAACaTCGaT AATaTATATT 60 

GAAAgTCAAC AAGTTCTGGA CTAAATGTTG CTGCTAAGTT ATCGCCACCA GATTCTATGA 120 

AAATTAGTTC TATATCGTCA TGACGTTCTA ATAATTCGTC TATTGCTGCA AAGTTCAAG 18 0 

ATGCATCTTC ACGAATCGCA GTATGAGGAC ATCCACCAGT TTCAACACCA AT GAT AC GAC 24 0 

TTTCAGGTAG AACTCCTGAA TTTACTAATA TCTTTTCGTC TTCTTTTGTA TATATATCAT 300 

TTGTAATAAC GCCGATACTC ATTTCTTTTG AAAGACGTTT TACAACTTTT TCAATTAATT 3 60 

GTGTTTTACC TGCACCTACA GGACCACCAA TACCAATTTT AATCGGATTT GCCACAATTA 4 20 

TAACCTCCTA T GAT AT G AAA tTCTAACATT GaCGTTCTCA TGCGCCATTT GATTTAGTTC 4 80 

TAAACCAGGC GCTGTCATGC CAAAATCTGC TTCTTTTAAT TCGAAAATCT GCTTTCTTGT 54 0 

TCCTTCTATA TAAGGAATCA TGTGAGTAAC TATCTTTT& CCAGCAGTTT GTCCAAGTGG 600 

AATAGCACGA ACAGCATTTT GAGTTAAACT TGAAACATTT T GAT AT AAA T AGTAATCAAT 660 

AATCGTTTCA ATATCTACAC CTAAATGATG GCCTAGCATA GTAAAACAAA TAGCTGGATT 7 20 
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TnACTTTGCT TTCTTATCTT GCATTTGTTG ATGATACCAA GCAATCCATG GGCTATtATA 7 80 

AAGTTCTAAA GCCAATTTAA CCATGCGAGT CCCCATTTGT kTTGCACCAA CACGTGTTTC 84 0 

TTTAGGTAAG TTTTGrACAr ACATCAGTTT ATCTATGTGT AATACTTTTT GTGTATCATC 900 

ATTTTCCAAT GCATCATAAA CTAaACGCAT GGCTAAACCA TCAGAATAGG TAAGTTGCTC 960 

TTGTAAAAAC ATTTTTAACC AAGCAATAAA AGTATGATCG TCATGAATTA TATTTCGTTG 1020 

AATATATGTT TCAAGACCAA ATGAATGACT GAAAGCACCT GTTGGAAACT GTGAATCACA 108 0 

GAACTGAAAT AATCTTAAGT GTGTATGATC AATCATGAGA ATGCCCTATA TGTCTGAAAG 114 0 

CCTTATTAAC TTTACGGTCT TCTCGAACAT ATGGGATGCC TAAACTTTT AATAAATCTT 12 00 

CAACTAAATA ATCATATTGT ACTAGCATTT CAGTCTCTGT AAATTGTGCT GGCAAATGAC 12 60 

GATTTCCTAA TTGATGGGCT ATATCTCCCA TTTCTTGCAA TGTTCTTGGT TGAATCACTA 132 0 

AAAGATCTTC TGAATTAACA TCCACAATAA TCATATTATG GTCATCTGCG TATAAAATAT 138 0 

CTCCATATTG TAAGTCAATA GGTTGTTTTA AACGAATGCC TATTTCAGTG CCATGGTCTG 144 0 

TAACGACTCT TTGAATACGT TTAACAAGAT CTGAATTTTC AAGGTATACT TTTTCGACGT 1500 

GCTTTTGTTT TTCTGAATTT GACAAATTGG CAATATTGCC TTGGATTTCT TCAACAATCA 15 60 

TTCTATGTTC CTCCTAGAAT AAGAAGTffC TTTGAGTTAA TGGTAACTCA GTTGCTGCAT 162 0 

TACTTGTAAT TTTTTCTCCA T C T AC AT AT A CTTCATATGT TTGTGGATCA ACGTCTAATT 1680 

TAGGTGTTGC ACTGTTATTT TTCATATCTG CTTTAGATAA TTGTCTAATA TTTTTAACTG 17 4 0 

GAC GC AC CAT GCGTTTTAAA TTTAATGCAC GAT T GAT AC C ATTTTCATAA GCSTTTTAG 1800 

ACACGAATGT CATTGACGTA CTTGTAAGGT TTCCGCCGTA TTGACCATAC ATTTTACGGT 18 60 

ACTTCATCGG TTCAGATGTA GGTATAGAAC CATTTGCATC GCCATTTACG GCAGAGTTAA 192 0 

TTAATCCGCC CTTTACAACT AATTCAGGTT TAACCCCAAA GAAAATTGGG TCCCATAAGA 198 0 

CAATGTCAGC TAGTTTGCCC GGCTCGATAG ATCCTACATA TTCAGAAATA CCATGTGTAA 204 0 

TTGCTGGGTT AATTGTATAT TTAGCGATAT AACGTTTGAT GCGATTATTA TCATTATGTT 2100 

C AAAATC AC C AT C T AAAGGA CCACGTTGTT CTTTCATGCG ATGTGCTACT TGCCATGTTC 2160 

GTGTAATTAC TTCACCTACA CGGCCCATTG CTTGEAATC GGAACTAATC ATACTGAATA 22 2 0 

CACCCATATC T T GC AG AAC A TCTTCTGCTG CAATCGTTTC TTTACGAATA CGTGAATCTG 22 8 0 

CGAATGCGAT ATCTTCAGGA ATAGCCGCAT TTAAATGGTG AGTAATCATT ACCATATCTA 234 0 

AATGTTCATC TACAGTATTA TGTGTATAAG GCAAAGTTGG ATTTGTAGAT GAAGGTAAAA 2 4 00 

TATTTGAAAA TGCAGCGGAT TTAATTAAAT CAGGCGCATG ACCGCCACCA GCACCTTCAG 24 60 

TATGGTACAT ATGAAGTACA CGGTCTTTAA CAGCAGCCAT TGTGTCTTCC ATAAATCCTG 2 52 0 

CTTCATTTAA AGTATCTGCA TGTAATGCAA T T T G AAC AT C AAAT TCATCA G C AAC AT C T A 2580 
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ATGCATGACT CAAAGCAGAT GGTGTTGCAC CCCAGTCTTC ATGTACTTTT AATCCAATTG 264 0 

CTCCGGCATT GATTTGTTCA ATGAGTGCAG TTGGATTTGT TGCTTGTCCT TTACCTGTAA 2700 

AACCGACATT AATCGGTAAA CcTTCGGCAG CTTCTAACAT TCTATGAATA TGCCATGGAC 27 60 

CTGGAGTTAC AGTTGTTGCT TTAGAACCTT CTGAAGCACC ACEACCACCA CCAATATGAG 2820 

TCGTAATACC ACTTTCTAAT GCGACCTCTG CTTGTTCAGG ATTAATAAAA TGAACATGAG 2880 

TATCAATACC ACCAGCAGTG ACGATTTTAC CTTCAGCGGC AATGATATCT GTTGTTGAAC 2 94 0 

CTATAATAAT GTCGACATTA TCCATTATAT CTGGGTTGCC GGCATTACCT ATGGCGAAAA 3000 

TATAACCATT TTTAATGCCT ATATCAGCTT TAACCACTTT ATCGTAATCG ATAATAACGG 3060 

CAT T AG AAAT GACAAGGTCT GCAACGTTCA CGTCATCACG TGTTACACGA GGATTTTGCG 312 0 

CCATACCGTC TCTAATAGAT TTACCACCAC CAAAAGTAGC TTCTTCACCA TAAACCGCAT 318 0 

AGTCTTTTTC TATTTGAGCA AATAGATTCG TATCACCTAA ACGAATGGAA TCTCCAACAG 32 4 0 

TTGGACCGTA TAAGCTCGTA TATTGATTTT GCGTCATTTT AAAGCTCATG ATCTTTTTCC 3300 

TCCTTTTTTA TTCACGTTTT CAGCACCGTT ATCTCCGAAT ACACCTGCAT ATTCATCATT 33 60 

TTCATCAGTT GGGCGATAGA CACGTGACTC ATCGATAGGA CCATTGACCATACCACGAAA 3420 

ACCAAAAATT TTACGTTTGC CAGCATATTC AACTAATTGA ACTTCTTTTT TATCCCCAGG 34 8 0 

TTCAAATCGA ACAGCTGCTC CAGCTGGAAT ATCTAAATGT TTTCCATATG CCATTTCACG 354 0 

TTCGAAATCT AATGCTGCAT TTGCTTCATA AAAATGAAAA TGTGAGCCCA CTTGAATTGG 3600 

TCGATCTCCT GTATTTTCAA CTTCGATAAC TGTTTCAGGA TGATGGTTAT TAATTTCAAC 3 6 60 

CTCTGTACTT TTTGTAATAA TTTCTCCTGG TATCATTTGA CTGCCTCCTT TAAACAATAG 3720 

GGTGATGTAC TGTGATTAAC TTAGTACCAT CGGGGAACGT AGCCTCGATT TCGATATCTG 37 80 

TAATCATGTG TTCGACACCA TCCATGACAT (TTCTTTGTT TAGAATTTGT CTACCATAAC 38 4 0 

TCATTAACTC TGCAACGGTC TTACCATCGC GTGCACCTTC TAATAATTCA TCGCTGATTA 3900 

AAGCTAATGC CTCAGGATGA TTTAGTTTCA AACCACGTGC TTTACGACGA CGTGCAACTT 3 9 60 

CCGCCGCCAC TACAATCATT AATTTGTCTT GCTCTCGTTG TGTAAAATGC AAATTAfflAC 4 020 

CCCCAATTTC ATATTAGATA CaATTTACAA AATTTATATT AATCCTAATT GTTGTGATAA 4 08 0 

ACAAGTAATA TACAAAGTTC AATGTGTAAT TAGAAAATTA TATTTTTAGC ATATCCGATA 414 0 

TTGAAGCAAA CAATCTAATC GAAAACAAAT AGTGGAATAT ATTTATGTAA AAACCAAAAT 4 2 00 

AGTTTTTAAT ATAACTTTTC ATAGAATAGT AGTATATTAA TGAGTAATGA TTCAAAGGAA 4 2 60 

AGGTGAAAGA TTTGAAGATA ATAGATGTGC TTTTGAAAAA TATATCTCAG GTTGTGTTAA 4 320 

TTAGTAATAA ATGGACAGGA TTATTTATCT TAATAGGATT ATTTGTAGCC GATTGGACAA 4 38 0 
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TTGGATTAGC GGCTATTGTA GGTAGCATCA TCGCCTATAI TTTTGCGCGT TTTATAAATT 4440 

ATAGTGAGGC AGAGAT TAAT GATGGGTTAG CTGGATTTAA TCCAGTGCTA ACTGCCATTG 4 500 

CGTTAACAAT CTTTTTAGAT AAGTCAGGAT TAGATATTGT TATAACAATG ATAGCAACTT 4 560 

TATTAACGTT ACCAGTTGCT GCTGCAGTGA GAGAAGTTTT AAGACCATAT AAAGTTCCGA 4 620 

T GC T G AC GAT GCCTTTTGTC ATTGTGACTT GGTTTACAAT TTTACTTTCA GGACAGGTTA 4 68 0 

AATTTGTAGA TACATCGTTA AAGTTAATGC CTCAAAACAT TGAAACGGTT AATTTTAGCA 4 74 0 

ACAATGATAG AATaCATTTC ATTCAGTCAT TATTTGAAGG ATTCAGTCAA GTATTTATCG 4 800 

AAGCGAGTGT AATTGGTGGC GTATGTATTT TAATCGGCAT ATTGATAGCA TCAAGAAAAG 4 8 60 

CAACACTCTT AGCTGTTATA GCTAGTTTGT TAAGCTTTAT CATTGTAGCT CTATTAGGTG 4 92 0 

GTAATTATGA T GAT AT TAAT CAGGGATTAT TCGGTTATAA CTTTGTATTA ATGGCAATCG 4 980 

CACTAGGATA T AC AT T T AAA ACAGCGATTA ACCCTTATAT TTCGATTTT TTAGGTGTGT 504 0 

TATTAACAGT AGTGGTGCAA CTAGGTACAA CAACATTGCT TGAACCGTTT GGCTTACCTG 5100 

CATTAACATT GCCATTTATT ATCGTGACAT GGATTTTATT ATTTGCTGGT AT T AAAC AT G 5160 

ACAAAGTAGA TGCTTGATAG T T AAAT C AAA C C TAAT AT T G TTTGAATATC ACCTTAAACT 522 0 

ATACAGCGAA TTGTATAGTT TAAGGTGTAT TTTTATGGAT AAAATTAAGT GCATACTTAA 528 0 

TCGAATTATC AGTTTTCTTT TAATTGTTCA CAGCACGTTC CTTTTTATTT TTAAAGTCGA 534 0 

GTGTTAAACT AGGAATAAAT AATTTATATT GTGTGTTGTG TGGGGTGACT AATATGAATG 54 0 0 

ATATGGATAA TTCCTTTTTA ATAACAACEG AAATTCAAAG AAAATGGATT GAAAAATTCA 54 60 

AAGTAATTAG AGATACATTT AAGGCTAAAG CTGAATATAA TGATCAACAT AGCCAATTTC 552 0 

CATATAAAAA TATTGAATGG TTAATTAAAG AAGGTTATGG AAAATTAACG TTACCAAAAG 558 0 

CATATGGTGG TGAAGGTGCG ACCATAGAAG ACATGGTTAT TTTGCAATCA TTTTAGGCG 5 64 0 

AACTTGATGG TGCCACAGCA TTATCTATTG GTTGGCATGT GAGTGTCGTA GGACAAATTT 57 00 

ATGAACAGAA ATTATGGTCT CAAGATATGT TGGAGCAATT TGCTGTTGAA ATTAATAATG 5 7 60 

GTGCATTAGT TAATAGAGCA GTTAGTGAAG CTGAAATGGG TAGTCCAACA AGAGGGGGAA 5 82 0 

GACCAAGTAC ACATGCTGTT AAAGCTGATG ATGGGTATAT TTTAAATGGT GTGAAGACAT 5880 

ATACATCAAT GAGTAAAGCA CTAACACATA TTATTGTTGC TGCTTATATA G AAGAAT TAG 5 94 0 

AAAGTGTTGG TTTTTTCTTA GTAGAC 5966 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CTGTGTCATC GCGAAATAGT TAGGGTCATT (ATTAATCCT TTTGAACGTA TTTCATCAAA 60 

ATATAACAAT TTCATTAGTA AAGGGGACTT GTTCAAACCA GCTATAATAC AAAATAGACC 12 0 

TATAGTCACA CTGCTTATAA TATAAGAGGT AACGATCACT TTTTTGCTAT TACCTAACTT 18 0 

AAAGATGATC ATCCCTAAAT AGAAATAAAT G AC T AC AAAT GCATATTTAA CTGTAGRGC 24 0 

AAGAACTTCC TTAACCGTAA TAAATATCAA ATCATCAAAA AATaGCaAAC AArGCGTAAT 30 0 

AATCATACGA TAT G TAT AC A AAATAATGAm AAACTGTmAA AAATGATTTG CCTTTAATAA 3 60 

ATGGTTAGCG AAAAACAGTA AATAAACTAA TATTAGTAAT GTGATAAAGT CAGCTATAGA 42 0 

AACATTCACA CCGGCAATAA CCGAAGATTG CTGAATAAAA ACCGCTAAAC CGATAAGTAA 4 80 

CAATGTTAGT AATTTACTAT TGTGTTGATT TTCCATTATA AACGTCTTCC ACTTCTTTAA 54 0 

TCATTTTCTC CTCAGTAAAA CATTCTAAAT AACGTTTTCT AGATTGATTA CTCATTTTGA 600 

TGTAATCACT GTCTATTAAA TATTTTTCCA GGACTTTAG AATAGTTTCG GGTTGGTTGT 660 

TCATCATACA TATACCATTA TCAGCTACTA ATTCTGAAAT ACCGCCAACA TGACTGGCTA 720 

TAACAGGCAA ACCTGTAGCC ATAGCTTCTA TAATGCTAAT TGGCAAACCT TCATGCTTAC 78 0 

TTATTAAAAT AAACGTATCG TATTGTGATA ATAAATGACT CGCATTAATG ACATTGCCCA 840 

AAAAT G T G AC ATCATTTTCT AACCCAGCTT GTACAACTTG TTGCTGACAA TCATTTAATG 90 0 

TAGGTCCATC GCCTATAAAT GTAAAATGCG CATGATTACT GTTATGTAAT TTCAATATCT 9 60 

CTATTGCCGC GATTAGATTT TGTGGCAATT TTGGATAAGC AAATCTTGCA ATCATAACAA 1020 

ATTGATGCTT TGTCGGGGCA TTAATCTGTA AATCTTGTTT ATTAGGCAAC ATTCCAACTA 1080 

CTTCGCCAAT ATTGTTATGT GATTGGCTTT TTAGCGTTTG CTTAACAGCG GGAACATCTG 114 0 

CAATACCATT ATGTATTGTG GTTAATTTCA ATCGATTAAA TCGATATTTT AACGCTAACT 1200 

GTTTATCGAA ATCTGAAACA CAAATAATGC TATCTGTAAT AAGTGEATT AATTTTTCGA • 12 60 

T AAC T AAAT A TAG AAAT T T T TTAGCTGGTT TAACACCCTC TGTAAAAGCC CATCCATGTG 132 0 

CAGTAAAAAC TATACGTGTG TCTTTCGATT TCGAAAT GAa CTtCGCAATT CGTCcGACCG 138 0 

TtCCAGCTTT GGAAGAATGT AAATGGATAA CATCAGGTTT AATTTTCGAG AATAACTGTG 14 4 0 

C T AAC AC T T T GACAGC T AAA ATATCTTGTT TAAAGTCAAT TGGACCTACT AAATGTTCGA 150 0 

TAATAATTAC ATTAACTCTT GCATCTAGTT GTTCAATCAT TGGTCCATGA TTGCCTACAA 1560 

T G AC AT AAAC ATCATTGTGT AC G C AAAAAT GGTTGGCGAG TTGAATGAGA TGTGTTTGTG 162 0 

CACCACCATT GTCTGCTTTA GTAATACmT ATATAATTTT CAACTGTTAC AAACCCCTTT 1680 
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AATGCTATAC TTTCAATTTC TTAACATGGC TATCTCATCA GATGAATAGT ATTTATAGCC 17 4 0 

ATGCAAATCA ATGATGGCAC ATATTTCTTA ATGCCATTTG ATACTGTCTC AAGGGATTCC 18 00 

TCGTTATACT GTAACAATTG GTCACAATCT TTAAAATATA ACTTTTATTT G AST TAT T A 18 60 

AGTAAATTAA GACTACCTTG AGCCTTCCCC TGTAATAACA ACCATCAATG TTCTAATTGA 192 0 

T AT AT AT AG T TCCATCATTA AACTACCTTT ATGTATATAT TTCATGTCAT ATTTCAGTTT 198 0 

TTGTTGCGGT GTTAAGTCAT ATCCACCTTG AATTTGCGCA AGTCCTGTTA ACCCTGGTGT 204 0 

AACAAGACAT CTTTGCTCGA AACCTATCAC TTCTGAACTA AATAATTCTA CAAATTCCGG 2100 

ACGTTCCGGG CGTGGTCCAA TAAAACTCAT TTCCCCTTTA ACAACATTAA TTAGTTGTGG 2160 

TAATTCATCA ATGCGTGTTT TACGAATAAA CTTCCCGACA TTTGTTATAC GATCATCATC 2220 

TTTATCAGCC CATTGCGCAC CGTTTTTCTC TGCGTTTTG CACATCGAAC GTAATTTGTA 2280 

TATTTTAATT AATTTACCCA TCTTCCCAAC TCTAACCTGA CTATAAATAG GGTTTCCTGG 234 0 

CGAATCTATG ACGATAGCAA TGGCGAATAT AACCATAATC GGTAAAGTTA AAAATAATAA 24 00 

AACAATGCTT AAAATTAAGT CAATCGCACG TTTAATTGGG TAATAGCTTT TTCTCACTTC 24 60 

TTCTAGTTTG TCTAATTTTC TTTGATAGGC ATAACCCTTA TTATTATGGA CAGCTTCAAT 2520 

GTCATTTTCA CGACATTTCC ACTTCTTTAG CTGTAACTTC TTCAATATTA TCTACCTCAA 2580 

AATTAAAGTA ATCCTTTAAA CCTGTTTCTA CTGTATATTT AGGAACAAAT CCTAATGCCT 2 64 0 

TTAAGTTAGA AATATCTGCA TAAGAATGCT TAATATCTCC TTTTCGTGCT TCTTTAAATT 27 00 

CATGCTCGAC TGATTTTCCA TATAATTCAC CAATAATACG ATAAACCTCT AAT AAAT TAG 2 7 60 

TAAAAGTGCC TGTACCAATG TTATAACCGT GTCCAATTGC ATCTTTGTGT TCCATAATTA 2820 

AGCGTACAGA TTGAACAACA TCATATACAT ATACAAAATC TCEAGTTTGC AGTCCGTCAC 2 8 80 

CAAAAAATGT AAATGGCTTG TTATGCTCAA ATGAATCGAA CATCTTTGAA ATCACACCTG 2 94 0 

AAT AT T G T GA CTTAGGATCC TGTCTTGGCC CAAATACATT AAAAAATTTA ACAACCGCTG 30 00 

TTGGTATGTT AT AT AAC G AA CAATAATTTA ATGTCGTCCG TTCGCCGTAA TATTTATCTA 30 60 

TTGCATATGG TGATAATGGT AAGATTAATG ATTGATCACT TTTAGGCAAA TCAGGAAGAT 312 0 

CACCATAAAC AGCTGCTGAC GAAGCAAAGA TAAAACGTTT TATATGATTA TTATATTTTT 3180 

TAATGATTTC TAACAATCTT AATGTTGCTA CGACGTTTAT TTCTTGAGAT AAGATAGGTT 32 4 0 

TCTCAACCGA CTCAGCAACA CTAPCTAATG CTGCTAAATG AATAACATAA TCAAATTGAT 3300 

ATGTCTTCAT GATTTGTTCA ACTGCATCAT ATTCACGAAT ATCTAATTCA AACACATGAT 33 60 

CGTCAGCCAA ACTTTTAATA TTTTCTCGTT TACCTGTTCT ATAGTTATCT AGAACATAAA 3420 

CATCATAATC TTGTTGTAAA TCATCTACTA AATGCGACCC AATAAAACCAGCCCCACCAG 34 8 0 

TTATCAAAAC TCTTTCCAAA TCTTCCACCT CATTTATACA TTAAAAATAT ATCATAAAAA 354 0 
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CATAAAGTAT TGTAAGCTTT TTATCGATAT TTTTTATTTA TAAAAATAAA ATGAGATAAC 3600 

TTTGTGAATT TTTATTGAGA TAAATTAGAT AGTGGTGTTT TTGTGATGTT TTATAATATC 3660 

TTGGGTGTGT TAATACTAAT AATGCTTTCA ACTGATGCAT TAGACTGTGA CATCATAACT 372 0 

CACTTAAGAA CTTCGCTTAT TAATTTTCTA CCAATACACT CCCTTCTAAG TGCACTAAAA 37 8 0 

AATCCTTACT GCTAAGTGAT TAAACTTAAC AATAAGGATT TATTTATCAT TAGTGGATGA 38 4 0 

TTATTAACGG AATCTCATAC CACCATCTAC ATAATTGTT TGTCCAGTAA TGTAATCAGA 3900 

GTCTTTACCA GCTAAGAAGC TCACTACATT TGAAACATCT TCTGGTTGAG AAACTCTGCC 3960 

CAAAGCAATC TGACTTGTAA ATTGTTCCCA ACCCCATGCT TCAGGTTTAC CTGCTTCTTC 4 02 0 

GGCTGTTGCC ACTGCGATAC TTTCCATCAT TGGTGTTTGA AC GAT AC C AG GTGCGAAGC 4 080 

ATTCACAGTA ATACCTTCAG ACGCTAAATC TTGTGCGGCT ACTTGTGTTA AACCTCGCAC 414 0 

TGCGAATTTT GTACTGCAAT ATAAAGACAA GCCTGGGTTA CCCTCAACGC CTGCTTGAGA 4200 

TGTTGCATTG ATAATTTTAC CGCCATGATT GAATTTTTTA AATTGTTCAT GTGCGGCTTG 4 2 60 

AATACCCCAT AGCACACCTG CAACGTTCAC GCCATATACT GTTTTAAACT GTTCTTCAGT 4 320 

AATTGTATCG ATTGGTGTTG TTGGTCCAAG GCCGGCATTG TTAACCATGA CAT GGAAAT C 4 38 0 

GCCAAATTGC GCGGCAGTTT GTCTTAcTGC GTTAAATACA TCATCACGGT TTGATACATC 4 44 0 

TGCTTTGATA GCAATAGCTT TTGTACCATC ACTTGATAA 1 TTAAGTGCAG CTGCTTTTGC 4 500 

CCCTTCTTCA TTGAAATCAA CAACTGCTAC TTTGAAACCA TCTTCCACTA AACGTTCTGC 4 5 60 

AATTTTAAAA CCAATCCCTT GTGcTCCGCC AGTTACTAAT GCTACTTTGT TGTTTGTCAT 4 620 

AAAG AT C AC T CCTCAAATTT CTTTCCTTTA ATTACATTTT ACTCCTCTTC ATTTGAATAG 4 680 

TACAACAAAG GTAGCTCCAT TTAACAAAAT ATTCAGATAT TTAAGGTATA GTTAAACGCA 4 74 0 

CTACCATTAG TGATTGGCAA TGCGTTTAAA TGTCGTTTTA AAAGTTCTTA TGTTGAATAT 4 8 00 

TATTTTTTTA AGTCTCTCGA TTAGTTTGTC ATCAATCTTT TTTCGAGACA TGGTCTTTTG 4 8 60 

ATTCAATAGG CGGTTCCGTG TTATCACTGA CAACTTTAGT TGTAGCTTCA TCTTTATGTA 4 920 

TTTCTTCGTT AAATCCTTCA AGGTTTTTAG TCGTGGGATT TTTAACCTCA GGATGTTCCA 4 98 0 

TCATGTCTTG ACTATCAAGT TCCTTTTTAC ACGTGTCTTT ATGTGATGCT TGATTTGCGT 504 0 

TCCCTTTACT TTTTTGAATA GTGGTAGTAT CTGCTGCAGC TACTAfflTTT TTTCTACCTA 5100 

AAATAGATAT GGCTGAAACA AACCAGAGTA T T G C AG AT AC AAAGTTGCAT AAT ACT AAAG 5160 

CGATAATAGC CAATACAATT AATATGACAC CTTTTGAAAT CCTTTCTTTA AATAAGTCAG 5220 

ATGCCAATAC GATGACAGGT ACGATTGAAA GTATAATTAC AAATATAGAA ATTATTGCCG 52 8 0 

ATATAACTAT TGTTACTATT AAATAATCAG CTCTGCTACC TGATAATAAA TAGAAAAGGC 534 0 
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CGAAAAT TAG TCCATAGCAA ATTACAAACC C AC AT AAAG T TATAGCCATG AGTACTATAT 54 00 

AAGCTATTTG AAAATATAAA CCTATCTTTA TGAATGATTT TTCTACATTT TTTTCCATGT 54 60 

CTATTCCCCA TTTATTTAAA ATTTATACTT TACCTTAAAT ATTCTCTTTA TTCTTTAGTG 5 52 0 

ATTTTATCTT T AGAT T C AAA TTGATTCTCT GTACTTTCAA TATCAACTTT TTCATTTTCG 5580 

TCTGTCGATT CATCTTTTGA GTATTTATTC CAAATCAGCA AAA T AC C AC C AATCAGCCAT 5 64 0 

AAAATTGACG AAAGGAAATT ATATAAACAC AGTGCAATAA TAGCATAAAC AASAAAAGT 5700 

GCACCTCCGA TTACAGAGTA ACTTTCCATA TAAATCGCAG TAAAGATGGT TGGTAAAACA 57 60 

GTGAAAAGAG CCAATATTAA TCCTAATAAA AAAATTGTTT CGTAATCAGA TCCTCCAGCA 58 20 

ATATTAATAG ATATCATCCT AACAAAAACG AC AC T AAAAT ATATTTGAGC TACGATGCCT 58 80 

ATCCAAATTG CTATTTTTCC TATAATTGAG C T CAT AC T C A TTCCCCATTT ATTTAAAATT 5 94 0 

TATACTTTAC CTTAATATAC CTTATTTTAT TTAATTTTTA T AT GC AAAAT ACAAAAATGG 6000 

AGAACTTCAA T AT T TAT AAA ATATCAAAAG TTCTCCACAC TATATTGTTT TATTATATTT 60 60 

TCGCTATCAA TACGCTAAAT CATCATATTT CCCT(AACAT C AC AG T AAAA CTATTGCTCC 6120 

ACCAAGTATA CTTTTTCAGT ATAGCATCTC ATATTGGCAA ACTTATCTGC TAAAAACTGC 618 0 

TTCCAATTGC GCAGTTGTTC AACATCATCA TCTTGTTTAA GTAATGCCAG TGGTACTTGA 62 4 0 

AGATTAAGAC ATCGTCCTGA AAT AT T AAAG CGTGTCACAC CTGCTGGCAC AGTTTCCCCT 6300 

TTATGAACAA CCGCTTCAAT TTCCTTATAA CTCAATGGCT GATACTTCAT GAGTACATCT 63 60 

TGTTGAGAAA GACAAGGATA TGTACCTTGT GCAATTCTCT CTACAGAACA ACAACCACTA 64 2 0 

TAACTTGCGA CAACCTTTTC CCATACTTGA AAATGTGCTT CGCCTAAATC TTTTGTATAC 64 8 0 

AAATATTGTT CTGTATCACC AT GAC AC AT T GTAATAAATG GCGCTTCTTG TCTTGTCTCA 654 0 

GTAGTCCATG GCAAGCGATG TTCTTGTTGT AACGTTTCCC ACCACACACC AAATGGAACT 6600 

TTATGTTGCC AT GT ACT AAT TGAATATTGT GTTTCATGGA TTTCTTGCAC TGGAACTTTC 6 6 60 

TTACATCCTA ACGCTTTCAA ACTTGTATAC CGATGCACAC CfflCTATAAC CATATATCTA 672 0 

CCATGTTGCA TCGCTGTCAC T AAAAT AGGA TGACGTATAA AATCATCTGC TTCAATACTA 67 8 0 

CTTTTCGTTT TTTCCAATCT TAAAGGTTCG AATGTTTCGT GAAG AT C AAT CTTATCTACT 68 4 0 

GGTACCAATT TTAAATGTTC ATGAATATGA TTCAATAGTT ATTCATCCTC CTTTGTTTGT 6900 

GTTAAATAAA TAAATTCAGG ATGTGGATGG CTTAAGAAAT CGTGATGTGA AATAGACCAT 6 9 60 

CCGTATGCAC CTGCATATTT GAAAACAATA ACGTCGCCTG TACTGATTGC GTCTATCTGT 7020 

ACTTCTCTAG C AAAGAC AT C TTTCGGTGTA CAT AAT T GAC CGACTAACGT TGTGTCCTGT 7 08 0 

CTCGAAATTG AAACTTTTTC AAA TG AAT AT GGATTGTCCT TATAGCGATA AATGTCAAAA 714 0 

GGATGGTTAT GTTGCCAAGA TACCGGCAGT CTAAATTGTT GCGTACCTCC TCTTAATATG 7200 
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GCATACCAAG CACCATGTAC TTTCTTAATG TCTAGCACTT CTGTCACATA GTAACCAATA 72 60 

TGTGCCACAA TAAAGCGCCC ACATTCAAAG TTCAATGTCA CATCTTCCATTTCTTGCTCA 7 32 0 
ACGATAAGTG TTTTAAAACG TTCTACAAAA TTATCCCATT CAAATTGGTT AGTTAAATCT 7380 
GCATAGTTAA CGCCTATGCC ACCACCAAGA TTGATATGTT TGAGTGGAAA TCGATGTTTT 7440 
TCAGACCATG CCTTTGCTTT TTTAAAATAA AGTTTCACTA CATCGACATG TAAATTCGAG 7500 
TCTAAATTGT TAGAAATAGA AT G AAAAT G A AATCCATCTA GATGAATCTT TGGCATTGCG 7560 
AGCGCAgcTT cAATGACATC ATCAACTTCG TCTTCAGAAA TACCAAATTG TGTTGGGCGT 7 620 

CCTGCCATAT GCAACGTTGC ATTGGGAAAT GGTCCTGCTA AATTAACACG CAATAAAATG 7 68 0 

TGTTGTGTCT TATCTTCATC TTCTAAGATG (EATTTAGCC GTTGTAATTC ATGCATACTT 774 0 
TCAACATGAA TACGCTGAAC ACCTTCACTT ACTGCATATC TTAGTTCCTC GTCTGTCTTA 7 8 00 

CCAGGGCCAC CAAAAATAAT ATGATTTGCT GGTTTAAAAG CAAGACCTTT TGCTATTTCA 7 8 60 

CCTTGAGATG CAACTTCGAA TCCTTCAACA TACTGACTAA TTGTATCTAG GATTTTOGT 7 92 0 
TCACTATTTG CTTTCATTGC ATAGTACATT TGACAATTAC TCGGCAATGA CGATGTAATA 7 98 0 

TGTTGCAAAT GATGTTCCAG TCCGACTAAA TCATAGATAT AATGACAAAC TGGATGAGAT 804 0 

TGTGCTTTTA ATTGTTCAAT AACAGGTTGA ACTATACGCA TTAGCCTTCA TCCCCTTTCT 8100 

GTTTAGACGT CGCTAGAGAT GCACTTAAAT GGCGATATAT TTTTCCGCGA TCATCACCTA 8160 
AAATAAATGT TTGTACACCT TGTGCCTGCC ATTTTGCAAT ATCTTCATCT TCACGTGGTA 8220 
ATGCACAAAA ATGTTTACCA TGTGCATTCA CAACTTCAAA AATATGTTGA ACATGTGATG 8280 

TTACTTGATC ATCACGCGTT TGCCATGGTA TGCCAAGT& CTGCGATAAA TCTGCGGCAC 8340 
CTTCGACTAT CATGTCTAAA CCTTCGACTT GTGCTATATC GTCAATGGCC ATAACCCCTT 8400 
CAACATCTTC TATCATGGCA ATCACCATAA TATGCTCATT AGCCATCTCC AT T G CAT C AA 84 60 

GTAATGGTGT ACGTCCAAAT CTTGCCATGC GACCACCATT CAAACTTCTT AATCCTTGCG 8 52 0 
GGTAATAACG ACTTAATTTC ACAATATGCT CAACTGTCTC ACGATCTTTA ACGTGTGGCA 8 58 0 

CAATAATACC TCTCGCACCC ATATCCAACA CTTTAATGAT ATCTCTATCT ATCACTGCAG 8 64 0 

TGACACGTAC AATTGGTATA ATATGCGCTG CTTCAGCTGC ACGAATTAAA TGCGCTAGTG 8700 

TCTCATCATT AATCGCCACG TGTTCTGTAT CAATCACAAC AAAG T CAT AC CCGCTTGCTG 87 60 
CGATAACCTC GAT CAT C AAT GGGTCCGGTA T AG AAT T AAA AATGCCATAA ACTGAATCAC 8 82 0 

CATTGTTTAA TCTATGTTTC AGAG AT AG T T GTTGCATCAT TGATACCTCC TACACCTAAT 8880 

GGATTTGTAA CATGATGAAT TCTTAACTCG GAGTCACTTA AT AAT G AC G TGTCGTTAAC 8 94 0 

1 

TTTTCAACTT GAATCGTAGG T T C AAAC AAA TCGAAATGTT GATAGTTATT CAACTCTGGA 9000 
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AATGCTTCTT GATACGCCTC GATGATGCCT TTAACCCATT GCCATTGCAG CTCCTCATCG 90 60 

ATACCATATT GCTTTTCAAT AAATAAGATG ATTTCGGCGA TATTAATAAA GAAAAATGCA 9120 

T CAT G T AAAA AGTCGCGTAC TAAACGTTCG TCATCTGTTT CAATAAATGA ATTACTATTC 9180 

ACTTTTTTAT GTGCTTCTGG CATTGGCTTT AATGTCAGGT GTGAAGCAGC TTCACTTAAA 924 0 

TGctCACGCT TAAAACGAAC ACCATCATGG AAATCTTTTA AGGCAATACG TGTAGGCCAA 9300 

CCATTTTCAT GAATGAGCAT CATATTTT3T GCATGCGATT CAAAGGCAAT ACCGTGATAA 93 60 

TAAAGCATAT GAATCATTGG ACGAATCGCT ACAGC T AAAA ATTGCTTTGT CCAAGCTTCA 94 2 0 

GAACCAT AT T GTTTAATCCA ATTTTCAATG AATGGTACAC CATCCTTATC ACTTGCATAA 94 8 0 

AGTGCATTAA ATGGTATCGC ATCCTCTTCA TCGATTAACA TATGATATAT ATTTCACGC 954 0 

CATATAACAC CTAACGCACC ATAAACTTGA GTTTGTTTAT AAGGCGAAAG TTGTGTATTT 9600 

AAATAAGACT GTCCTAAGAC TTCCCCTAGA AAAACTGTCT TTAATTCATC TTTTAAATAC 9 6 60 

ATATCTTGTT GCTGTATCTG CTTTAACCAA TCCGTAATTT GCGCTGCATT TTCAATTGTA 97 2 0 

TGAGGCGCCA ACACTCGTTT CGTTGAAGTG TTCGTTATAC TTATTGGTAC CTTTAAATAA 97 80 

TATTTTGTCG TGTCTATTGG CGACATCGTA CGAATCGATT GTTGAGGGTG ATATAGCTCA 984 0 

TCACTTTCCC CTAACCATAG TACTGTGCCA TTAAGCCTTT CTTCAGCCAA ATCAACTTGG 9900 

ATGACATGTT CAAACTGCCA TGGGTGTACA GGTATJATCT CAACATCATT TACATGTTTG 9 9 60 

CCAGATGCTT CAATTTGCTG TACAAAATGT TCATAAGTCT TATCGCCAAC TTGTTGACGT 10020 

AACATTTCGT TAACTACAAC ATTTCTTGAT ACCGTCGTTT CTACTTTATC TTTGTCGATA 10080 

GCTAACCACT GCAGTTTAAC GTTTGGTACA AAAT CAGGAC CAAATTTCAA ATTATCACTC 1014 0 

AACGTAAATC CTAAACGTGA TTTGTAACTT GGATGATACT GATGCCCTTC CATCGCATAA 10200 

AATTCATAGT CGTTAAATGT CTCAGGTGTT GCTGGTGGGT TTGATTCTCG ATACTGCATA 102 60 

CTTTGCGTAT CTTTTAATTC TGTCTGTAAT AACTCGACAA TAAATTGTTC TAGCTTTTCA 10320 

TCATTTTTAG GAAATGTAAA TACAACCTCT CTCAATAATT GTGTATAGTC TGTTGTTGTA 10380 

TCTGCCTCAT CTCCTACGAC ACGCTCAATT GGTGATGTGA TACGTATACG ATCAAAGCTA 10440 

TGTGTCTTTT CAGCAGTAAA ACGATACTCT GAATCATGTC CTTCTATTGT AAAATGACCG 10500 

ACACCGTCTT GATATGACGC T T TAT AC AC A ACAATATTCT GffiAAATAAG TGATGATACC 10560 

AGTTGGTGCA TCACTCTAGT CTTTACACGA TTAAGAATTG TTTGATTCAC AATACGATAC 10 620 

CTCCTTGTTA TGACAAATTG GATTTGGTAT ATGTGTATAA ATAGGGTTTG CACCACAATC 10 680 

ATTCAATTTA CTCATCAAAT TCGCTTTAGC CGcAATGGTC GGCGTTTGAT ATAAATCTTC 10740 

TACACAGTCA ACAAATACTG CGTTATTCGC GTATTCTTTT TTCCAAGTCA TAAGACGATG 10800 

CGCTACAAGT TGCCATAACA CAACTTCATT TCTAGTCGCT TTACCAATAG TTGATACTAA 10 8 60 
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ATGTCCTAAG T GAT T TACT A CAACGTAATA TTTAAGACGA TGCCATGCTT CATCATGTGC 10920 

ATATACAACA GGGCTTGATG CTGCCACAAC ATTTGGCACA AGCTGTTTTT CAGTAGCAAT 10 980 

CGTTCTAGAT AGACAAATGC CTTCAAGATC TCTGACAAAG CATACGTCGG GTATGCCATC 11040 

TTTTAATTCA ATTAATGTAT TTTGTACATG TGCTTCTAGA CTAATGCCTG TGTTACTAAA 11100 

CAGCTTTAAT ATCGGCAATA ATGTACGATT CAAATAACAT TCAAGCCATGCTTCTGGTGC 11160 

TAAACCACTT TGCTCAATCA CTTGTGATAA CTTAGACATC GGTGAATCAG GCATCGTTTC 1122 0 

AAATAATGAC GCCAATACAT GAATATCTTT ATCAGCATGG TAATTCGGTA TCCCTTCACG 11280 

AAC AAT CAT G GCACTATTTG TTAATAAATC CATTTCAGGT TCAACTGTTT GCCCTAATGG 11340 

ATTCGGTAAC AATGCACGAT ATCCTTCTTC AAACATCAAT TTAAAATGGG GTGTTTCAAC 11400 

CTCATCTTTG ACTGATGCGA TAACTTGCGC GGCATCAATT GTCCGTTCAA TCTGTTCAAG 114 60 

GTCATTCGTA CGTATAAAAT TAGTGATTTT AACGTGTATC GGTAATTTTA AATAAATGTT 11520 

TAATGCTTTT GAAAATACAG TTCTAACCGA KACGTCGGG TACACTGAAT CACCGGAAAC 11580 

GCCAAGGTCT TTTATTAAAC CTTGTTCACT ATATTGCATA TACTGTGGAT GCTGTCGCAA 11640 

CACATTGATT TGATAAGGAT GTGTTGGTAA TAAAATAAAA TCTTTGGGTA TCTCTGATAT 117 00 

ATCTATGTCT GCTAATTGAT ACAACACTTT CTCAACCTGA TCTTCTTTAC CTTCTACTA 117 60 

GCGCGTGAGC AGAACATCTT GATGCACAGC TAAATAATGC AATTGGAATG AT G TAT G AC A 11820 

TTCGGGTGCA TATTTCTCTA AATCTGCTTC TGAAAACCCA CTTGCACTCT TAGGAGTCGG 11880 

ATGAAATGGA TGACCTAAGT AT AAAG AT T G TTCTGAAACG ATATAACGAT CCTCTACGTA 11940 

GTCTATTGTG TTACTTTGCA AATAACGTGC CGTGCGATGA ATGCTATTAT CGATGTCAGA 12000 

CATAATTTGC GCCATATGTT GTTGCACTGC CGTTTGATTA TCTGCACTTT GAG C C AT AT G 12060 

TTGCAAAATA CGCGCAATTG CTTCTTTATA AGTTGTTATT TTTTTACTTT TTCCATCGAT 12120 

AAGCCATACC TCTGGATGAT ACATATGATG CCCCATCGS GACCAATAGC GAAATTCACC 1218 0 

CGTTAAAGTT TCGAGCTCTG AT AAT T GT AT AGACCATTGA TGATTTTGAG GTGGTACTTG 12240 

ATATAAATTT TCTTCTCTAA AATATTCATT TAAAATGCGT TCGATAGCCG CATACGCTGC 12300 

ATGTTGTATT AATTCTTTAT TTTGCACTTT TTTGTTTCAA CTCCCATAAT TTCATTAATG 21360 

TGTGATCGTT GATTTGATTA GTGATGGTTG AACAAATTAA AAATAAACTA CTTACTGCAA 12420 

ATACTACGCC CATAACGATA AACGTAGTAG CTGGTGTAGT ATAACTTGTA ATGGCAGCGC 124 80 

cACTaAGACT GCCAATAATT TGACCAACAA CT AAC AT ACT GTTCGTCGTT CCAACAAATG 1254 0 

TGCCTTTAAG TTGTTGATGA CACGCATTCA C G AC AAC AAA CATGACACTT TGAATCAATG 12 600 

CACTATATGT TAATCCTTGA AGTATTCTTG CAGCCATTAA AAACTCTATA TTCGTCGCTA 12660 
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AACCTTGCAG TATCGCACTA CAACCACATG CAATCGTGGC AAATATATAT ACTGATTTAA 12720 

CATATGATTT ATCATTAAAG CGTCCCCATA AAGGCGCGCT TAATAtCGAA GCCGTCCAAA 127 8 0 

ATGCGGACTG TAAAAATCCA ATCAC AC T AC GGTCATCTAT CGCTGTATGA TTCACTGATG 1284 0 

AAGCAAGTGG TGATAATGCA GTTAGCATGC CATACATAGC AAAGTTTGCT AAAACGCCAA 12 900 

CGATAATAAA TCGACATGTT TGTTGTGTGC ATAATAGACA TTGAAATGAA CGGCGAATAC 12 9 60 

CTTTATTAAT ATTTGGTGTT TGTGATTTTG GCATATGTGT CGTTTCAATC AATTTTAATG 13020 

CACCGAAAAT ACAGACAATA AAAGTAATAA CGGCAATACT CATCAGTAAC GCACTAAAAC 13080 

CTAATATCGA AGCTGTAACA CCGCCAATTA ATGGCCCCAC AAGAGACCCT GCGCTGACTG 1314 0 

AACTTTGCAG TCTTCCTAAT ACCTTTCG\C GATCTTCAGC TGGCGCCTCT GCACTCGCAA 13200 

ACGCACTTGA TGCATCAACA ACACCACCAA ATAGTCCCTG CAATAACCTC" ACAAGTACAA 13260 

ACTGTAATGG TGTCGTACAC AATGCCATTA AAAATAAGCA TACCGCCAAA CCAAGTAACG 13320 

CTCTTAACAC CATCCATTTT CGGCTGATCT TATCACCTAG CTTCCCCCAT ATGGCGAAG 13380 

ctATCATCGT CGTTACAGCT GGAGCAGCAA TCGCTATACC ACTCCACAAC TGTATTTCTA 13440 

CGACTGATAG ATTTTGTAGT GATGCCATAT AAATTGGCAA TAATGGCACA AGTACTGTCA 13500 

GTCCAGCAAT CGCTATAAAC TGACTGAGCC ATAAAATGCG AAAGTTACTG CGCCATATAG 135 60 

ACTGATTAAT CATATGTCAC CATTGGATTT GGTACGGTAG TTAAACCTGA AGGCATACTA 13620 

CCTCCACCAC TATCACGTTG ATATAGCAAT GGTAATAAAA TTTGTTTGAA TGGCCACGTC 13680 

TGTTTATCAA ATAAAATGTG TCTGACAGCT AGCTGATCAG TTGTAACCCA GGAAATAGTT 137 4 0 

GCCACTTCAT TTTTTAAAAT TTGTTTTAAC AACGA:ATAA GTTCATGCTC ACTTACACCA 13800 

AATAAATCTT GAATTGCATC AATAATGGCA TATAGATTTA CCGATACAGC TAATGTTTGA 13860 
AAATAAGCAA AGAATGTTTC CAAATCCTCA TTAATTAGCG TATTAGGTGT ATCTTCTCTG ' 13920 

ACGACATACT TCGGCAATGA AAGCTGATGT GCTGTTAGCC ATGGTTTATA AATTCTGACA 13980 

GTATCATGAT CACGTAACAC GCATTTTTGT ACACGTCCAT CTTCAAATGA CAACAATATA 14 04 0 

TTTTGACCAT GCAACTCTGG TAATGCGCCG TATTGCATAA ATGATAGTGT TACCTTTAAA 1410 0 

AAGACTTGCG CGATATCTTC AAATAACGTC ATGACATCAT TTTTAGAAAT ATTATCTTTT 14160 

CCACAAATCA TTTGATATAA AGTGCGATCA TTTGCCGCGA GTGCTGCCAT TGACACTAGC 14 220 

TGTTGCGTAT CATTTTTGGC TAGCACTTCG GGATACTTTC TTAGCTGAAC AG T TAG AT G A 14 28 0 

CCTAATTGAT CTTTGAAAAT ATCATTATCT TGACCCATAT ATGACCACCA AGCTGTTTCA 14 340 

TCACAAACCA TGACATACTT AGCTAGTGCT TCATCTTTTT CB.TAAGCTG ACGTAATAAT 14 4 00 

TGTTCTGCTT GTTCTCCGTT TTTCATGTAA CGCGTAGGCG TTAGCCTTAA TGCGCCTAAT 14 4 60 

GACTGCATTG CAAATGGTAC TTTGACATGG TTATACGGTG CGCCAATATC AATTAATGAA 14 520 
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CGCATACTTG AAG AC G AC AG ATAATCTCCA AATTTTAACG GTAATAGTAC AACCAACTTT 14580 

TCACTAATCT CTTTCGCAAA GACGTTCGGC AGAATATGCT GATATTGCCA AGGATGTACC 14 640 

GGAAATAGTA CATAGTCATC TATTGATAAC CCTTGATCAT TTAACATGTC TGTCGCTTGT 14 700 

TCTTTTATAG GTACTGTCAA ATTTTCTAAT TCATCGATAT TTGCAGTATC GCCATGAATC 14 7 60 

ATATGTGTCT TTTTAACTGC TGCSkCCATT AAAGGAAATG ATTGATTTAA TTCAGCTTGA 14820 

TACACTTGAT AATCCGCTTC TCTTAATCCT CTTTTTTCTT TAGCTAATGG ATGAAATGGA 14 880 

CGATCTTTTA AACTTGCAAA CTGCTCTGAC ATCACAAAAG GATGTGACGC TAAATCTAAT 14 94 0 

TCTGATAATT GTTTAGCAAG CTGTGTGGCA GCAGTAGTCA GTCCTTCTTCAACGCGAGCC 15000 

ACTTCCCATT CATGACTTAG ATCACAATTC AT AT T AGCAA TTGTTTGCCA AAATTCAGCT 15060 

GCCGTTAAAG GTTGCTTAGA CACCCTTCCC TCTATCGTAA TTGGTTGTGA ACTTTCGTAA 15120 

CGAAACATAT TTAAAGCACT AAAATAAACA GGTATCTTTA TTTGTTGTGT TTCACGTTCG 15180 

TATATCAAAA GCGTTTGTCC GTTTTCTTTA GTAATCTCAC TATTCGATAC AATTCCGGCT 15240 

ATATCTTCAA ATAATAATGC ATCAACTAAA TCTCTTAATA TTATCGCTTG TGCTGTATTG 15300 

ACTGCTGTAT GATTCTGCAA TGTTCAGACA CCTCGCATTC TTAATATAGG TTCAATGTTG 15360 

TCCCAATATT TTGTTGTTGT GCCTGTTGAT ^ATAAAATA AGCACTTGAA ATATCTTCGA 154 20 

TAGCCATACC CATCGGATTA AGTAATATGA TCTCATCATC GTCTTCACGT CCTGGTATGT 15480 

CACCTGTCAC AAGTTGTCCT AGTTCAGCAT G AAG AG C T T C TTTGCTGAAT TTACCTTCTA 1554 0 

ACACCAATTG GTTAATAGTT TTCTTTTCTC GATTACATTG T G AC C AG T C A TCTACTOGA 15 60 0 

CTTTGTCAGC TTTAATAAAG ACTTCTTTAT GCACATCCAT GATAGAAATG TTGCTAATAA 15 660 

ATGCACCCTT TTGTAACCAA TCATATTCAA TGTATGGTTG ATCCGTTACG GTACATGTAA 15720 

TGACTACTTC ACCATTTGAT ACTGCTTCTT TAGCATTTTC TGTCGCAATA AAATTAATTT 15780 

CCGGACGCTG TTGTTGCCAT CTATCAACAA AGCGTGCACA TGCTTCAGAG AATTGATCGT 15840 

AAACAAACAC GCGTTCAATA TGATCGAATT GCTCTAACAT ACTTTGTAAT TGCTTGTCTC 15900 

CGATTAGCCC GCATCCAATG ATTGTTAAGT CTTTAAATCC TTTTTTAGCC AAATGCTTTG 15960 

CTGCAATCAC TGAAACTGCT GCAGTACGCA TACTACTAff TAAACTTGCT TCCATAACTG 1602 0 

CAATTGGATA ATTCGTTTCT GGATCATTCA AAATAATGAC GCCACTTGCA CGCTCCATAT 16080 

TACGTTTCGA TGGATTGTCG TGCTTACTAC CTATCCACTT AATACCTGAA ATTGCGTGTT 16140 

CACCACCGAT ATGACTTGGC ATTGCAATAA TTCGATCTGC GATGTGTCCA TTTTCAGGAT 32 0 0 

CCtGTCTTAA ATACGGCTTA AGCGGTTGTA CAAAATCATT GTGCGCATGG GCTGTTAATG 162 60 

CTTCTGTTAA TGCGTCCACA TAAACTTGTG AATGATTACC TCCCGCTTGT TCAATATCTG 16320 
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ATCTATTTAA ATACAACATC TCTCTatTCa TTCTGaTTTA ACTCCTTGTC TTGATTTCAT 1638 0 

TTTTTCTAAC CATGTATCTG AATAAACTAA ATCTAAGTAA CGATCGCCTC GATCTGGTAA 16440 

AATCGTGACA ATTGTTGCAC CTTCTTCAAT TGACGTTATC AACTGCTCAA TCGCTGCAAT 1650 0 

AATCGAACCT GTTGAAcCTC CGGCAAATAT GCCTTCATAA TCAATCAGTT TTCGACAGCC 16560 

CAAAGCAGAT TGATAATCAT CTACATGGAT CACTTGATTA ATTTCEATC TATTCAATAT 16620 

TTCGGGTACA CGACTAGCAC CGATACCAGG TAATTCTCTA TTAATAGGTT TGTCACCAAA 16 680 

AATGACTGAC CCTTTCGCAT CAACAGCAAC AATTTGTGCG TTTGGATGCA CTTCTTTTAT 16740 

TTTTCTACTC ATACCCATAA TGCTACCTGT CGTGCTGACT GGCGCGACAA AATAATCTAT 16800 

AGGTTGCTTA ATTGTTTCAA CAATCTCTGT GCCTGCACCA TGATAATGGG ATTGCCAATT 16860 

TAACTCATTC GCATATTGAT TAATCCAATA TGCATCGTCA ATAGTGGCTA ACAGTTCTTG 16 920 

CACCTTTGCA AT AC GAG T C A TTAAATAACC CCCATGTGCA TCAGGTTCTT CAACCATTTC 16980 

TACATTGGCA CCATAACTTT TAATAATTTT CAAATTTGTT GGTGATATTT TAGGATCAAC 17040 

AACACACGTG AGTTTTAATC CCTTGATTTT AGCTATCATT GCCAACGCAA TGCCTAAATT 17100 

AC C AG AAG T A CTTTCAATTA AATGTGTATT CTCAGTGATT AAACCATGTT T AAT AC CAT G 17160 

TTCAATGATG TACTTGGCAG GTCGATCTTT CATGCTGCCT CCAGGATTCA TASCTCTAA 17220 

CTTTGCAAAC ACTTCATGTT TCGGAAATAG TTGATGAAGT TGAACCATAG GTGTTTGCCC 17 28 0 

TACAGAATCT AAC AAT G AAT CGTGCACATG 17310 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

AT AC TAG T AA GCGCATCGGT TAT T G AC AT C G AAT T C AAC T TTAACAGTTT TCATGTTCGG 60 

TGATGTTTCa ATAGAATGTG TGTGTTGTAC TTGCGCATTT ATATTTCCAC CTAAATTACT 12 0 

TAAGTTTCCT GTAATACTAG AAATGTCAGG TGCGTTTAAT GTAGGTTGAA ATGCATCAAC 18 0 

TACTTTATCT GCAACATTAG AAACATTACG GATAACTTTA CTTGAATGAT TATCTATACC 24 0 

TTTAACGAAA CCTAACATTG AATACATACC AA:ATCCATG AATTCACGTG AAGGTGAGTG 300 

AATACCTAGC GCTCTTTTGG CTGCATTTAA AGCACCTTTT GCTACACTAG CTGCTTTTTC 360 

AGCTAAGTCT CTAGCCATAT TACCAATACC TCTCATCAAA CCACGGATCA TATCAGCACC 4 20 

TGCTGATACA AAGTCATCCA CAAAGCTTTT AACTTTATTT ACTGCATTTG TCATACC2G 4 80 
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ACTAACTTTG T T T AC AAC AT TAACGAATCC TTGAACAACT CTATTAACAA rGTTAATTAG 54 0 

CGTACtTGTt ATAGTAGATA CCCaTnGCAT ACCTTTAGTG ACmATGAAGT TCCAAGCTTG 600 

AGACATTTTG TCTGATATAG TTGAAACAAC TTGTGTGAAT ATGCTTACAA CTTTATTCCA 660 

AATTGTCGTT • AATATACCAG ATAAGAAACT CCAAATCGTA TTCCATATAT TAGAAATAAA 72 0 

ACTCCATGCC GCTTGTAACG CAGTAGATAT AGCTGTAGTG ATAGCGTTCC AAACCTTAGT 7 80 

TGCCACAGTA AC TAT AG T G T T C C AC AAC G T TTGTAAGAAC GTCCAAATAG CGTTCCAAAT 840 

TGTTATTGCG ATAGTCATAA TTGTGGTAAA CACTGTAGTT ATTACAGTGA CTAACAAATT 900 

CCAAATCGTA GTAGCGATTG TAATTATCGT ATTCCAGATT GTACTTAAGA ACGTCCAAAT 9 60 

AGCTGTCCAT ATCGTCATAA CTATTGTCAT TATCGTCGTG AAAACAGTTG TAATGATTGT 1020 

AACTAAAAGG TTCCATACTG TTGTTGCAAT AGCGATAATT CCATTCCATA GCCCTTGTAA O8 0 

ATAAGCGACT ATTTGATTCC AAAC AAT CAT TATAAAATTG TAAACATTCG ATACTGCTGT 114 0 

AGTGATAGCT GTTAAAATAG CATTCCATAC AACCGAAGCT ACAGCTTTTA ATACATTCCA 12 00 

AACAT TAACC ATAAACGTTT TTATCGCATT CCAAGCATTT ATAATAAAGT TTCTGAATCC 12 60 

TTCATTTTTA TTCCACAATA iAACGAATAT AGCTATTAAT GCAGCAATTA CACCAATTAC 132 0 

TATTGTTATT GGACCGCCTA AAAT AC C AAA C AC AG T T AC T AGTCCTGTGA TAGCATTTCT 138 0 

AAT T AAT C C A ATCTTACCGA AT AAC AAT TG G AAT AT AAC T GAT AT AAT T T TTAATGGTCC 14 4 0 

TTTTAATAAC ATGAACGCAC CTTTTAAAAT TGTTAATCCC GCTCTTATA AACCGAACTT 1500 

ACTTACTAAT GCAATGrTTC TACCTATTAA TCCGCCACCC ATAAAGTTAG ATACAGCAAG 1560 

AATAATCGGT AT T AAAAAT C TAAATGCACC AACTAAAGTT ATAAT GACAC CAACTAATTG 1620 

TGCTGTAGCT GGATGCGCCT CAAACAAGTT AGCTATCCAA CCAGTTATTG CAACTGCAAC 168 0 

GCGTAATACT GCACTAGCTA TAGGAGCCAT TGCTGTTGCG AATGCArmTA ATCCTCTTGC 17 4 0 

GATGTTTCCA ATCAATTGCA TTATTAGTGG TCCATTTGTT TGTATATAAC TGACAAAGTC 1800 

TTTAAACCCT TGAGATTGTC CTACTTGTTC AG AC CAT T C C CTAAACTTAG CTGTCATTTG 1860 

TTCAAGAGAT TGGAATATGC CAGTTGATGV TCCGCTGAAT GCAT TCATCA AATTGTTAAT 192 0 

TCCAACGAAA ACATTTTTGA AAATATTACC AATGATAGGT AAGTTTGTTT TTGTGTATTC 198 0 

AATAAAACGA GTTATCGAAT TTTCTCCAGC TGCACTATTA GCCCAGTTAG AG AAAG AT T G 204 0 

ACCTAATCTA TCCAACCAAT CAGCCGACCA T T G AAAC AG T GGTGCTAATT GCGGAATAC 2100 

ATTGACTAAT CCGTCACCAA AACCACCTGC AGCACTTAAT AGCTTGTTAA ATACCGAAAC 2160 

ACCCGTTGTA TTCATCATAT T AAAG AAT C T TGAAGCTACA CTGCTATTTT CAGCCCATTT 222 0 

AAGCACGCTT TGAGACGCTT CTTCCATTCC TCTTGAAATA CC ACT AAAAA ACGGTTGTAA 228 0 
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GCTCTGCATT GCAGTTTTAA CAGTATTTAA ACCATTTGCA AGAGTTGTGA AGATAGCGGA 234 0 

TTGATTTTGC TTTATAATAT CAGTCCATGC TGACTTTACG CCATCTAACG CTTTTTTGTA 2400 

TTCGTTTGTT GCTGAGCTAG CTTGTAAAGT GCCATCATTA AGCATCTTTA TAGCGCTGAT 24 60 

AGCCATTGCG CCAAACGCTA CAAATCCTGC TCCCGCTATT GCTACGGCAC CACCTAAAGC 2520 

AAGTACACCA CCAGTTAACA CTTTGATAGC GTTTAATAGC GCAAATACTA CAGGTACTAC 258 0 

GCTCGCTATT ACAGGTATTA AG AT AC T AAA A GAT GATGT A AGTAATCCAC CAACCATATT 264 0 

AGAACCTACA GTACCGAACA CACGGAACAT AT TAGCTAAA TTCCCCATCT GTCTTTGAAA 2700 

ATTGTCATTT GCTTTTATTA TGTAGGCATA AGCTTTCTTT AAACCATTAG TATCGACATC 2 7 60 

TACCTTTGTT GTTTTTTTGT TCGGCAATGC GTCTAATGAT TTTTTAAACG CATAAATAGT 2820 

TGGTATAGAA AGCCCTGTAT CTACATCAAG TCGAGATCTA GTTTTGTTCG GAATACTTTT 2880 

AAGTTCTTCT TTAGTACGTT TGATTTTAGA GTTAGCAACA CCATTGTCCA CGTCTATAAT 2 94 0 

AGCTTTGGCT TTAGACCTAT TTAATGCTTC GAGACTAGCT T TAG AT AC T T TTAACACTCG 3000 

ATTGAATTTA CTGTTATCTG CATTGACGTC AATATTGACA CGTTTCTTTT CTAATTCTGA 30 60 

TAATTTAGCT TCTGTTTCAG CGATATCTTT AATCAACTTT TGTTTTGCA ACTTAACTTC 3120 

TGGTGTAACT TCTTTAGAGT TTAGTTTGTC TAGTTCAAAA TTCGATTCTA GTACCTTTTG 3180 

TTGTAAATCT T G TAT AC TAG CATCTAATTT AGCTTTTACA TTTTTGTTAC TAAAGGCATC 32 4 0 

TAAAGACTTT TTAGCAACTT TGATAGTTTT TTGTAAATTT TTATCGTTAG CGTTTAATTC 3300 

AACATCTTTA GTTTGATCTG CTACTCGTTT AAATCTTTGC AC AG AC T T AA CCGCACTATC 33 60 

AATTTGCCTT TTGAATTTGG CTACACTAGC TTCAATAGTC GCTTTAATTT TATATTCCGT 3420 

CACATTAACA CCTCTCTTTC TATTGCTTAT TAAATTCTGC TATAACTTTA AAGAAT T CAT 34 80 

TATTTTGTGG TTCGTATTCA TCACCTTCGC TACTAAATCT TATATCTTTA CCTTCGTTAA 354 0 

GCCGTTGGAT ATTTTCTTCA TAAGGCAATA CGTCGTTTGC ATTGTTAAAA ACATATTCCT 3600 

CTTTAGGTTT ATTTTCTGTC CCAACATTTT TAGTAGCTGC AGCATCACGA AT AGCAAAC G 3660 

CAAGTTTGTA ACGTTCGAAT TCTTGGGTTA GCATTTCATA CTCTTTCGCATACATTCGAT 3720 

AGTTATATTC TGTTAATGTC ATTTGCTCAA TAACGTTCAA ATCTGTAATA CCAAGTGTTG 37 80 

ACATACAAGT TATAACGATT CTGTCGTAAG TTATTAGGcT TCCGCTGGTT TTTCTTCCGT 3840 

TTCCACTACT TCGACTAGGT TTCGGGTCAT AGGTCGCTTT CCCAAcTCCG TTAAAATATC 3900 

CGAACCGAAT TCTTCTAGTC CGATATTTTC TGCGATTTCA TCTAATGCTT CATCAATGTT 3 9 60 

ATTAATAGTA ATTGCTTGTT TTTTTAAGTG AGATGTAGCT GCGATTAAAA cTTCGCCAAT 4 02 0 

CACAACCGGA TTTCCACTTT CTAAACCTAC AGGCAACATT GATACACCTT GACCGATAGA 4 08 0 

AGCTTGTTCA ACTTTTAAAC CTAATCGGTT ATJGATTTCT CTTAAAAATT TAAAACCAAA 414 0 
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ACTTAATTCT AATGACTTTC CGTTAATTTC TACATTCATA ACTTAAAATC TCCATTCATA 4200 

ATTAATTTAA ACAAAATAAA mArGCTTAAC GCCCTATTTT TATACCTCTC TTGGTGCAAC 4 2 60 

CGGTGGTGAA TCTACTTTAG GTTGTGGAAT TGCTGTTAAA TCTTCGCCAG TTAATGCAC 4 320 

TGCTTTTGTA GTGTCGTGGA ATCTGTATcC AGTCGCCTTA AGTTTCTTTG TTACAGCCTC 4380 

AGGTAGTGTT GCAAATCCAC GTTGGAAACG ACCATTCACT CCATATTCAT ATTCATATTC 44 40 

ATCAATACCG TTAGCTTCTG CTTTTAATTC AAATTTATTG TGGAAACCTT GGAAATATTT 4 500 

CGCTTTAAAT TTAGCGGAAT CCCCATTTTT GCCTGGTATT CTACTTTCAA CTTCCCAAGC 4 5 60 

TTCATACAAT ACGCGATCTA CAACTGCATC TTCAATTTCA TCTGCAAAAT CGTCAGCATA 4 620 

AAACATTTTA GCAGTACCAG ACATTGTTGA CTCAACAGAA CCACCAGTGT TATAAGAACC 4 68 0 

GTCCATTGTA TCCTCTGTAT CTGTATCAGC TTCATGTGAT AAGCCGTATT CAGTTAAAAA 4740 

AAGCATTTTA GTAGCATCTA CTTTTTCGCC AGCTTTTCTA AATAAAATAA TACGATCATT 4 8 00 

ACTATTTTTC ATATTTGCCA TTCAATATTC CTCCGTTTTT TAAAATGTTT TGTAAGATAT 4 8 60 

CGTTACTGAT GTGTGTAGCA ATTCTTGATT GGTAGTATCA TCAACTAACT GTGTGATGTT 9S2 0 

AGTATCTTCT TCTTCAAAGT CATAATCGTT TGTTTTAACG CTAGGTGTTA AATCATCAAT 4 98 0 

ACATCTTTTA ACAAGTCCGT CATGATGTCC TAAATCATCG CTTACACTCC AAATATCAAT 504 0 

AACTAAATTC GTATCGCCAG AATAACTATC AAACGTGTAC TTACTTCTAT TTGACTCCGG 5100 

CATTTTTATT ACAAAAAAAG GYTACGGAAT CTCTTGTTGC ATCTCTTTAC GAGAAATAAC 5160 

AGGGAATCCA TATCCTTGTA GCGTTTCATA CGCTTTATTA TAAAGTTGTA AGTTCGGTGT 5220 

CATGCTTTTA TCTCCTATTC AAACAACGCT TTCAATTCTT CTACAGTTGA TTTCCTAATC 5280 

ACTTCGTATA CCGGCCACAT AAAAGGTTCA GCCTCCATGT ATCGAGffiCC AAATTCTAAG 5 34 0 

AAAC C AC TAT AAGCTGCGTG CGATGTGATA GTGTATTGCA AATCGCCAGT TTTTTTATAT 54 00 

CTGATATTGC GTGATaAATT ACC 5423 



(2) INFORMATION FOR SEQ I D NO : 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AAACGCAGAT GTTCAATTAG AACCAGTCTA TCGTATTAAG GAAGGTATTA AACAAAAGCA 60 
AATACGAGAC C AAAT TAG AC AAGCGTTAAA TGATGTGACA ATTCATGAAT GGTTAACTGA 12 0 
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TGAACTAAGA GAAAAATATA AAT TAG AG AC CTTGGACTTT ACTTTGAACA CATTACATCA 180 

TCCTAAAAGT AAAGAGGATT TATTACGTGC TCGTAGAACC TATGCATTTA CTGAACTGTT 24 0 

TTTATTCGAA TTACGTATGC AATGGCTAAA TAG AT T AG AA AAGTCATCTG ACGAAGCAAT 300 

TGAAATTGAT TATGACATAG ACCAAGTTAA ATCATTTATT GATCGTTTAC CTTTTGAACT 3 60 

AACTGAAGCA CAGAAATCCA GTGTTAATGA AATTTTTAGA GATTTAAAAG CACCAATACG 4 20 

TATGCATCGA TTACTTCAAG GTGATGTAGG TTCAGGAAAA ACAGTAGTTG CGCAATTTG 4 80 

TATGTATGCG TTAAAAACTG CTGGTTATCA ATCAGCATTG ATGGTACCAA CTGAAATTTT 540 

AGCAGAGCAA CATGCTGAAA GTTTAATGGC TTTATTTGGA GATTCTATGA ACGTTGCATT 600 

GTTAACTGGG TCAGTAAAAG GTAAGAAACG AAAGATACTT TTAGAACAAC TTGAAAATGG 6 60 

TACGATTGAT TGTTTAATTG GAACCCATGC TTTGATTCAA GAT GAT G T G A TTTTCCATAA 720 

TGTTGGTTTA GTAATTACAG ATGAACAACA TCGATTTGGT GTGAATCAAC GCCAGCTTTT 7 80 

AAGAGAAAAA GGTGCAATGA CGAATGTGTT ATTTATGACA GCAACGCCGA TACCAAGAAC 8 40 

ACTAGCAATA TCAGTTTTTG GTGAGATGGA TGTSTCTTCA ATTAAACAAT TACCAAAAGG 900 

TCGTAAACCT ATCATTACTA CTTGGGCAAA GCATGAGCAA TACGATAAAG TTTTGATGCA 9 60 

AATGACCTCA GAGTTGAAAA AAGGTCGTCA AGCATATGTC ATTTGCCCGC TAATAGAAAG 1020 

TTCTGAGCAT CTCGAAGATG TTCAAAATGT TGTCGCATTG TACGAGTCTT TACAACAGT 108 0 

TTATGGTGTT TCCCGTGTAG GGTTATTGCA TGGTAAGTTA TCTGCCGATG AAAAAG AT G A 114 0 

GGTCATGCAA AAGTTTAGTA ATCATGAGAT AAATGTTTTA GTTTCTACTA CTGTTGTTGA 12 00 

AGTAGGTGTT AATGTACCGA ATGCAACTTT TATGATGATT TATGATGCGG ATCGCTTTGG 12 60 

ATTATCAACT TTACATCAGT TACGCGGTCG TGTAGGTAGA AGTGACCAGC AAAGTTACTG 1320 

TGTTTTAATT GCATCCCCTA AAACAGAAAC AGGAATTGAA AG AAT G AC AA TTATGACACA 1380 

AACAACGGAT GGATTTGAAT TGAGTGAACG AGACTTAGAA ATGCGTGGTC CTGGAGATTT 14 40 

CTTTGGTGTT AAACAAAGTG GaTTGCCAGA TTTCTTAGTT GCCAATTTAG TTGAAGATTA 1500 

TCGTATGTTA GAAGTTGCTC GTGATGAAGC AGCTGAACTT ATTCAATCTG GCGTATTCTT 15 60 

TGAAAATACG TATCAACATT TACGTCATTT TGTTGAAGAA AAT TT ATT AC ATCGTAGTTT 162 0 

TGACTAATTG CCATGCTGAT TTGTCAATTT GAGTGCAACa CTTCGTTAAT T GAG T GAT AT 3350 

G AC AC T T G AA CTATTTAAAT GTAAAGTGGT ATTTTAACAA TTTATAAATT TTCGACTAAA 17 4 0 

TAATAGCTAA ATATTACAGT TATTTGTTGA GTCGGTTAAA TAGAAAGTGT TATGATATGT 18 00 

GAG G AAT G T T TAAGACTAGG TACTAAAAAA TGAGGGGTGA GACGTTGAAA CTAAAGAAAG 18 60 

ATAAACGTAG AGAAGCAATC AGVCAACAAA TTGATAGCAA TCCCTTCATC ACAGACCATG 192 0 

AACTAAGCGA CTTATTTCAA GTGAGTATAC AAACAATTCG TTtAGaTCGC AC T T AT T T AA 1980 
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AC AT AC C AGA ATTAAGGAAG CGTATTAAAT TAGTTGCTGA AAAGAATTAT GACCAAATAA 204 0 

GTTCTATTGA AGAACAAGAA TTTATTGGTG ATTTGATTCA AGTCAATCa AATGTTAAAG 2100 

CGCAATCAAT TTTAGATATT ACATCGGATT CTGTTTTTCA TAAAACTGGA ATTGCGCGTG 2160 

GTCATGTGCT GTTTGCTCAG GCAAATTCGT TATGTGTTGC GCTAATTAAG CAACCAACAG 2220 

TTTTAACTCA TGAGAGTAGC ATTCAATTTA TTGAAAAAGT AAAATTAAAT GATACGGTAA 2280 

GAGCAGAAGC ACGAGTTGTA AATCAAACTG CAAAACATTA TTACGTCGAA GTAAAGTCAT 234 0 

ATGTTAAACA TACATTAGTT TTCAAAGGAA ATTTTAAAAT GTTTTATGAT AAGCGAGGAT 24 00 

AAAATTATGG TTAAATTAGC AATTGATATG ATGGGTGGCG ACAATGCGCC TGATATCGTA 24 60 

TTAGAAGCCG TACAAAAGGC TGTTGAAGAC TTTAAAGATC T AG AAAT TAT ACTTTTCGGT 2520 

GACGAAAAAA AGTATAATCT GAACCATGAA CGAAT CGAAT TTAGACATTG TTCTGAAAAG 2580 

ATTGAAATGG AAGATGAGCC TGTTAGAGCG ATTAAACGTA AAAAAGATAG CTCAATGGTA 2 64 0 

AAAATGGCTG AAGCTGTGAA ATCTGGTGAA GCAGATGGAT GTGTGTCAGC AGGTMACT 2700 

GGTGCTTTAA TGTCAGCTGG TTTATTCATT GTTGGACGTA TTAAAGGTGT AGCTAGACCG 27 60 

GCTTTAGTAG TAACATTGCC AACGATTGAT GGAAAAGGTT TTGTCTTTTT AGACGTTGGT 2820 

GCAAATGCTG ATGCTAAACC TGAACACTTA TTACAGTATG CGCAACTAGG GGATATTTAT 28 80 

GCTCAAAAAA TTAGAGGTAT TGATAATCCG AAAATCTCAT TATTAAATAT AGGAACCGAG 2 94 0 

CCAGCTAAAG GTAATAGTTT AACGAAAAAA TCATATGAGT TAT T AAAT C A TGATCATTCA 3000 

TTGAATTTTG TTGGGAATAT TGAAGCGAAG ACATTAATGG ATGGCGATAC AGATGTTGTA 30 60 

GTTACCGATG GCTATACTGG GAACATGGTC CTTAAAATT TAGAAGGTAC TGCAAAATCA 3120 

ATCGGTAAAA TGTTAAAAGA TACGATTATG AGTAGTACTA AAAATAAATT AGCAGGTGCA 318 0 

ATATTGAAGA AAGATTTAGC TGAATTCGCT AAAAAGAT GG ATTACTCAGA ATACGGTGGT 32 4 0 

TCCGTATTAT TAGGATTGGA AGGTACTGTA GTTAAAGCAC ACGGTAGTTC AAATGCTAAA 3300 

GCTTTTTATT CTGCAATTAG ACAAGCGAAA ATCGCAGGAG AACAAAATAT TGTACAAACA 33 60 

AT GAAAG AG A CTGTAGGTGA AtCAAATGaG T a AAAC AG C A ATTATTTTTC CGGGACAAGG 34 20 

TGCCCAAAAA GTTGGTATGG CGCAAGATTT GTTTAACAAC AATGATCAAG CAACTGAAAT 34 80 

TTTAACTTCA GCAGCGAACA. CAT TAG AC T T T GAT AT T T T A GAGACAATGT T T AC T GAT G A 354 0 

AGAAGGTAAA TTGGGTGAAA CTGAAAACAC ACAACCAGCT TTaTTGaCGC aTAGTTCGGC 3 600 

ATTATTAGCA GCGCTAAAAA ATTTGAATCC TGATTTTACT ATGGGGCATA GTTTAGGTGA 3 6 60 

ATATTCAAGT TTAGTTGCAG CTGACGTATT AT CAT T T G AA GATGAGTTA AAATTGTTAG 37 2 0 

AAAACGTGGT CAATTAATGG CGCAAGCATT TCCTACTGGT GTAGGAAGCA TGGCTGCAGT 37 8 0 
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ATTGGGATTA GATTTTGATA AAGTCGATGA AATTTGTAAG T CAT TAT CAT CTGATGACAA 38 4 0 

AATAATTGAA CCAGCAAACA TTAATTGCCC AGGTCAAATT GTTGTTTCAG GTCACAAAGC 3 9 00 

TTTAATTGAT GAGCTAGTAG AAAAAGGTAA ATCATTAGGT GCAAAACGTG TCATGCCTTT 3960 

AGCAGTATCT GGACCATTCC ATTCATCGCT AATGAAAGTG ATTGAAGAAG ATTTTTCAAG 4 02 0 

TTACATTAAT CAATTTGAAT GGCGTGATGC TAAGTTTCCT GTAGTTCAAA ATGTAAATGC 4 080 

GCAAGGTGAA ACT G AC AAAG AAGTAATTAA ATCTAATATG GTCAAGCAAT TATATTCACC 414 0 

AGTACAATTC ATTAACTCAA CAGAATGGCT AATAGACCAA GGTGTTGATC ATTTTATTGA 4 2 00 

AATTGGTCCT GGAAAAGTTT TATCTGGCTT AATTAAAAAA ATAAATAGAG ATGTTAAGTT 42 60 

AACATCAATT CAAACTTTAG AAGATGTGAA AGGATGGAAT GAAAATGACT AGAGTGCTT 4 32 0 

TAGTAACAGG TGCATCAAGA GGAATTGGAC GTAGTATTGC GTTACAATTA GCAGAAGAAG 4 380 

GATATAATGT AGCAGTAAAC TATGCAGGCA GCAAAGAGAA AGCTGAAGcA GTAGTCGAAG 44 4 0 

AAATCAAAGC TAAAGGTGTT GACAGTTTTG CGATTCAAGC AAATGTTGCC GATGCTGATG 4500 

AAGTTAAAGC AATGATTAAA GAAGTAGTTA GCCAATTTGG TTCTTTAGAT GTTTTAGTAA 4 5 60 

ATAATGCAGG TATTACTCGC GATAATTTAT TAATGCGTAT GAAAGAACAA GAGTGGGATG 4 620 

ATGTTATTGA CACAAACTTA AAAGGTGTAT TTAACTGTAT CCAAAAAGCA ACACCACAAA 4 680 

TGTTAAGACA ACGTAGTGGT GC TAT CAT C A ATTTATCAAG TGTTGTTGGA GCAGTAGGTA 4 74 0 

ATCCGGGACA AGCAAACTAT GTTGCAACAA AAGCAGGTGT TATTGGTTTA ACTAAATCTG 4 8 00 

CGGCGCGTGA ATTAGCATCT CGTGGTATCA CTGTAAATGC AGTTGCACCT GGTTTTATTG 4 8 60 

TTTCTGATAT GACAGATGCT TTAAGTGATG AGCTTAAAGA ACAAATGTTG AC TC AAA TO 1 4 920 

CGTTAGCACG TTTTGGTCAA GACACAGATA TTGCTAATAC AGTAGCGTTC TTAGCATCAG 4 98 0 

AC AAAG C AAA ATATATTACA GGTCAAACAA TCCATGTAAA TGGTGGAATG TACATGTAAT 50 4 0 

ATATTTGAGC TAAAGCTCAT TGACGCAGTG GTTGACTGGT CATCCAATGG AGAATTGTCT 5100 

GACCTAGTCA ACTT1GCGGG GGAAATTCTA AGC AAC C TAG ATAAGGTTCC AGAATTTCTC 5160 

CCTAAGAAAC ACTAATCAAT aAATTGwTAA GTGTTTCTAA AATTTCTACT TGTTTTTTAG 5220 

AATTTAAAAT GGGAAAATAT AGTAGTCTAT GTATAGGCAT TTTTAAAGGA GGTGAATCGA 52 80 

CGTGGAAAAT TTCGATAAAG TAAAAGATAT CATCGTTGACCgTTTAGGTG TAGACGCTGA 534 0 

TAAAGTAACT GAAGATGCAT CTTTCAAAGA TGATTTAGGC GCTGACTCAC TTGATATCGC 54 00 

TGAATTAGTA ATGGAATTAG AAGACGAGTT TGGTACTGAA ATTCCTGATG AAGAnGCTGA 54 60 

AAAAATCAAC ACTGTTGGTG ATGCTGTTAA ATTTATTAAC AGTCTTGAAA AATAATAAAT 5250 

CTTACATCTG GGTCGTCAGT ATTGTCGACT CAGTTTTTTT CTTTAATTAT CAATAGTTTT 5580 

AACGTAAAAT T AAAGAT GAT TCAAGAGCAA CACATAAAGG AGATAAAATA ATGTCTAAAC 5 64 0 
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AAAAGAAAAG TGAGATAGTT AATCGTTTTA GAAAGCGCTT TGATACTAAA ATGACAGAGT 5700 

TAGGCTTTAC TTATCAAAAT ATTGATTTAT ACCAACAAGC ATTTTCGCAT TCGAGTTTTA 57 60 

TTAATGATTT TAATATGAAT CGTTTAGACC ATAATGAGCG TTTAGAGTTT TTGGGTGATG 5 82 0 

C GG TAT TAG A ATTGACGGTT TCACGATATT TATTTGATAa ACATCCCAAC TTGCCAGAAG 58 8 0 

GGAATTTAAC AAAAATGCGT GCCaCTATTG TATGTGAGCC CtCACTkEA ATATTTGCGA 594 0 

ATAAAATTGG ATTGAACGAA ATGATTTTAC TTGGTAAAGG TGAAGAGAAA ACAGGGGGAC 6 00 0 

GTACAAGACC AT CAT T AAT A TCAGATGCAT TCGAAGCATT TATTGGGGCA TTGTATTTGG 6060 

ATCAAGGACT AGATATAGTT TGGAAATTTG CTGAGAAAGT CATTTTCCCA CATGTAGAAC 6120 

AAAATGAGTT ATTAGGCGTG GTAGATTTTA AAACACAATT CCAAGAATAT GTGCACCAGC 6180 

AAAATAAAGG TGATGTAACC TATAATTTAA TAAAAGAAGA GGGACCGGCA CATCATCGTC 62 4 0 

TATTCACTTC A 6251 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

ACCTACTGAA GTTGCTAATT TTTTGGA^CA ACTAAGCACT GAAATTGAAC GTCTTAAAGA 60 

AGATAAAAAA CAACTTGAAA AAGTAATCGA AGAGAGaGAT ACTAATATTA AGTCTTATCA 120 

AGACGTGgCA TCAATCTGTA AGTGaTGCTT TGATACAAGC TCAAAAAGCT GGTGAAGAAA 180 

CTAAGCAAGC TGCAGAGAAA CAAGCTGAAG CGATTATAGC TAAGGCAGAA GGCAAgcTA 240 

ATcAAATGGT TGGTGACGCG G T AG AAAAAG CACGCCGTTT AGCATTCCAG ACTGAAGATA 300 

TGAAACGTCA AT C AAAAG T A TTTAGATCGC GTTTCCGTAT GTTAGTTGAA GCGCAATTAG 3 60 

ACT T ATT AAA AAACGAAGAT TGGGATTACT TGTTGAATTA TGATTTAGAC GCTGAACAAG 420 

TGACGCTTGA AAATATTCAT CATTTGCATG AAAATGATTT AAAGCCAGAT GAAGTTGCAG 4 80 

CAAATGCACA AAATAATGCA T C AAAT AC AC CAGACAATAA TCAACAATCC AAT GAT T C AG 54 0 

AAACAACTAA GAAGTAAGAA TTAAATAAAG ACAGACGCGT AAT AT AC ATT TAACTTTTCA 600 

C AGCGAAT T A GGTAATGGTG AGAGCCTAGT AAAffiCATGT ATGTTATATC ACTGGCTTTT 660 

TAATATTTAA AT AAT GT AAT GAGAGAACTC TAAGTTGAGT TAATAAGGGT GGTACCGCGA 720 

GCAATCGTCC CTTTTAATTT AACTTAGAGT TTTTTAAATT TTTAAGGAGT GAAAAAAATG 7 80 
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GATTACAAAG AAACGTTATT AATGCCTAAA ACAGATTTCC CAATGCGAGG TGGTTTACCA 84 0 

AACAAGGAAC CGCAAATTCA AGAAAAATGG GAT GCAGAAG ATCAATACCA TAAAGCGTTA 900 

GAAAAAAATA AAGGTAACGA AACATTCATT TTACATGATG GCCCACCATA CGCGAATGGT 960 

AACTTACATA TGGGACATGC CTTGAACAAA ATTTTAAAAG ACTTTATTGT ACGTTATAAA 1020 

ACTATGCAAG GGTTCmTGC AC CAT AC G T A CCAGGTTGGG ATACACATGG TTTACCAATT 108 0 

G AAC AAG CAT TAACGAAAAA AGGTGTTGAC CGAAAGAAAA TGTCAACAGC TGAATTCCGT 1140 

GAGAAATGTA AAGAATTTGC T T TAG AAC AA ATTGAATTAC AGAAAAAAGA TTTTAGACGT 12 00 

TTAGGTGTTC GTGGTGACTT TAATGATCCA TATATTACAT 2LAAACCTGA ATACGAAGCT 12 60 

GCACAAATTC GTATTTTTGG AGAAATGGCA GATAAAGGTT TAATTTATAA AG G T AAAAAG 132 0 

CCAGTTTATT GGTCTCCTTC AAGTGAGTCT TCATTAGCAG AAGCAGAAAT T GAAT AT C AC 1380 

GATAAACGTT CAGCATCAAT TTACGTTGCA TTTGACGTTA AAG AT G AC AA AGGTGTCGTT 1401 

GATGCAGATG CTAAATTTAT TATCTGGACA ACAACGCCAT GGACAATTCC ATCAAATGTT 15 00 

GCGATTACCG TTCATCCTGA ATTAAAATAT GGTCAATACA ATGTAAATGG cGAAAAATAT 15 60 

ATTATTGCAG AAGCCTTGTC TGACGCTGTA GCAGAAGCAC TGGaTTGGGA TAAAGCATCA 1620 

AT C AAAT TAG AAAAAGAATA CACAGGTAAA GAATTAGAGT ATGTTGTAGC ACAACATCCA 168 0 

TTCTTAGACA GAGAATCGTT AGTGATTAAT GGTGATCATG TTACTACAGA TGCTGGTACA 17 4 0 

GgTTGTGTAC ATACAGCACC AGGTCACGGG GAAGATGACT ATATTGTTGG T C AAAAAT AT 18 00 

GAATTGCCAG TAATTAGTCC AATCGATGAT AAAGGTGTAT TTACTGAAS AGGCGGCCAA 18 60 

TTTGAAGGGA TGTTCTATGA TAAAGCTAAT AAAGCCGTTA CTGATTTATT AACAGAAAAA 1920 

GGTGCACTAT TAAAATTAGA CTTTATTACA CATAGCTATC CACACGACTG GAGAACAAAA 198 0 

AAACCTGTAA TCTTCCGTGC TACACCACAA TGGTTTGCCT CAATCAGTAA AG T AAG AC AA 20 4 0 

GATATTTTAG ATGCAATCGA AAATACAAAC TTCAAAGTAA ATTGGGGTAA AACACGTATT 2100 

TACAATATGG TTCGTGACCG TGGCGAATGG GTTATTTCTC GTCAACGTGT GTGGGGTGTA 2160 

CCGTTACCAG TATTTTATGC TGAAAATGGC GAAATTATCA TGACGAAAGA AAC AG T GAAT 22 2 0 

CATGTTGCTG ATTTATTTGC AGAACACGGT TCAAATATTT GGTTTGAAAG AGAAGCGAAA 2280 

GACTTACTAC CAGAAGGATT TACACATCCA GGCAGCCCTA ACGGTACATT TACTAAAGAA 234 0 

ACAGACATTA TGGACGTTTG GTTTGATTCT GGTTCATCAC ACCGTGGCGT GTTGGAAACA 24 00 

AGACCGGAAT TAAGTTTCCC AGCGGATATG TAT T TAG AAG GTAGTGACCA ATATCEGGT 2 4 60 

TGGTTCAACT CTTCTATCAC AACTTCAGTT GCTACAAGAG GAG TAT C AC C TTATAAATTC 2520 

TTACTTTCTC ATGGTTTTGT TATGGACGGT GAAGGTAAGA AAAT GAG T AA ATCTTTAGGT 25 8 0 

AATGTGATTG TACCTGACCA AGTGGTTAAA CAAAAAGGTG CTGATATTGC GAGACTTTGG 264 0 
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GTAAGTAGTA CGGACTATTT AGCTGATGTT AGAATTTCTG ATGAAATTTT AAAACAAACA 27 00 

TCTGATGTTT ATCGTAAAAT CAGAAATACA TTAAGATTTA TGTTAGGTAA CATTAACGAT 2 7 60 

TTCAATCCTG ACACAGATAG CATTCCTGAA TCAGAGTTAT TAGAAGTGGA TCGTTACTTG 2820 

CTAAATCGTT TACGTGAATT TACTGCAAGT ACGATTAKA AC TAT G AAAA CTTTGACTAC 288 0 

TTAAATATTT AT CAAGAAGT TCAAAACTTT ATCAATGTTG AGTTAAGTAA TTTCTATTTG 2 94 0 

GATTACGGTA AAGATATTTT ATATATTGAA CAACGTGATT CTCATATCCG TCGTAGTATG 3000 

CAAACAGTGT TATATCAAAT TTTAGTTGAT ATGACGAAGT TGTTAGCACC AATCTTAGTG 30 60 

CATACAGCTG AAGAAGTTTG GTCTCATACA CCACATGTTA AAGAAGAAAG TGTTCACTTA 3120 

GCAGACATGC CTAAAGTTGT AGAAGTAGAT CAAGCTTTAT TGGATAAATG GCGTACATTT 318 0 

ATGAATTTAC GTGATGATGT GAACCGTGCA TTAGAAACTG CTCGTAATGA AAAAGTTATT 32 4 0 

GGTAAATCAT TAGAAGCTAA AGTTACGATT GCTAGTAACG ATAAATTTAA TGCATCTGAA 3300 

TTCTTAACTT CATTTGATGC ATTACATCAA TTATTTATCG TGTCACAAGT TAAAGTTGTA 3360 

GATAAGTTAG ACGATCAGGC AACAGCTTAT GAACATGGTG ATATTGTCAT CGAACATGCA 3420 

GATGGTGAAA AATGTGAAAG ATGTTGGAAC TATTCAGAGG ATCTGGTGC TGTTGATGAA 34 80 

TTGACGCATC TATGTCCACG ATGCCAACAA GTTGTAAAAT CACTTGTATA ATTGAAATTG 35 4 0 

TATAAAGTAC T CAT AC AG AT GATATAAATT AAAGCTCTCT TCATAATCAT GTTGTAGTTT 3600 

TTGTTGACAT GATGAAGAGA GTTTTTTTGT GAATAAAAAA AT G ACCAAGT TACCGGTCAT 3660 

AT AT G T AAAA AATGTGCGAT TTACTAAAAT AAAAATTATT CAGGAATGGT ACAAATTCTC 37 2 0 

TGAGGCATAT AAATGCGTTA TAGTTGCTAT TCTCAATTAT GTTCGCGATA ATTTTAAGTA 3780 

AAAGTAAGCA CAGATATTGA ATTTGATAGG AGTTAATTGA ATGTATCATA ACAGTAACGC 384 0 

AAACTTTGTC AATGGTATCA CTTTAASTGT GAGAGATAAG AATGAATTAA AGCCATTTTA 3 90 0 

TGAGGACATA TTAGGATTAA AT AT TAT AAA TGAGACATTA ACATCGATAC AATATGAAGT 3960 

AGGTCAAAAT AATCATGTCA TTACACTTGT TGAATTACAA AATGGACGTG AACCTTTAAT 4 02 0 

GTCCGAAGCG GGACTGTTTC ATATCGCAAT TAAACTACCT CAAATTAGTG ATTAGCTAA 4080 

TTTACTAATT CATTTAAGCG AATATGATAT TCCAGTTAAC GGAGGTATAC AGCCTGCTTC 414 0 

GTTATCATTA TTTTTTGAAG ACCCGGAAGG AAACGGTTTT AAATTTTATG TTGATAAAGA 4 200 

CGAAGCGCAA TGGACGAGGC AAAATAATTT AGTAAAAATT GATATTAGAC CATTAAATGT 4 2 60 

ACCGAGATTA GTGAGTCATG CAACAAAATT GTTATGGTTA GGTATTCCAG ATGACGCTAT 4 320 

TATAGGTGCA TTGCATATTA AGACAATTCA TTTATCAGAG GTAAAAGAGT ACTACCTCGA 4 380 

TTATTTTGGA TTAGAGCAAT CGGCATATAT GGATGATTAT TCAATATTTT TAGCATCGAA 4 44 0 
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TGGCTATTAT CAACATTTGG CCATGAATGA TTGGSTATCA GCAACGAAAC GTGTAGAAAA 4500 

TTTTGATACG TATGGATTAG CAATTGTTGA CTTTCATTAT CCTGAAACAA C AC ATT T AAA 4 5 60 

TTTACAAGGT CCGGATGGTA TCTATTATCG CTTTAATCAT ATCGAAGTTG AAGATTAGTA 4 62 0 

TATACTTTGA ATGGACGAAC CATATAATGA ATCGTTTTTA ATGATCTTTT TATACAAGTT 4 680 

ATGAAGGAGG CTGGGACATT AAGTTCTTAG GCAATGTAAA AAGCTGATTT CTATTAATTA 4740 

TTTGATAGAA ATCAGCTTTT TTGATATGTA TTTTATAATG TACAGCTCGT TGAGCTGCTA 4 8 00 

TTTTCCTTAT ATTAATTGCC ATTAATACAA AACCTAGCTC TCGTTTAACT TTATTTATTC 4 8 60 

CTCGAACTGA CATTCGnGTG AACTCAAAAT nGCCTACTTn CTTAAATTAC CAATATCTAT 4 92 0 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 626 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TGGATTGCCA TTACATGGAC AAGATTTAAC TGAATCAATT AC AC CAT AT G AAGGTGGTAT 60 

CGCTTTTGCA AGTAAACCAT TAATTGATGC TGATTTTATT GGTAAATCTG TATTAAAAGA 120 

TCAAAAAGAA AATGGTGCAC CAAGAAGAAC AGTGGGATTAGAATTACTTG AAAAAGGAAT 180 

TGCAAGAACT GGTTATGAAG TTATGGATTT AGATGGAAAT ATTATTGGAG AAGTAACTTC 24 0 

AG GAAC AC AG TCTCCATCAT CAGGAAAATC AATTGCACTT GCAATGATAA AAAGAGATGA 300 

GTTTGAAATG GGTAGAGAGT TGCTTGTTCA AGTTCGTAAG CGTCAATTAA AAGCGAAAAT 3 60 

TGTTAAGAAA AATCAAATTG ATAAATAATT AAAAAGGGGT GTGCATTGTG AGTCATCGTT 420 

ATATACCTTT AACTGAAAAA GACAAGCAAG AAATGTTACA AACAATTGGT GCAAAATCTA 4 80 

TAGGAGAATT ATTCGGTGAT GTACCAAGTG ACATTTTATT AAATAGAGAT TTAAATATTG 54 0 

CTGAAGGCGA AC GGAGAAC A ATGTTACTTA GAAGATTnAA TCGCATTGCA AGCAAGAGTA 60 0 

TCACTAGAGG AACGCGTACA TCGTTT 62 6 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
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nGGAAGTGGT GTATATATTT GTAATGAGTG TATTGAATTA TGCTCAGAAA TCGTCGAAGA 60 

AGAATTAGCT CAAAACACTT CTGAAGCGAT GACAGAATTA CCTETCCTA AAGAAATTAT 12 0 

GGATCATTTA AACGAATATG TTATTGGTCA AGAAAAAGCT AAAAAATCTT TAGCTGTAGC 180 

TGTTTATAAC C AC T AT AAGC GTATTCAACA ATTAGGACCA AAAG AAGAT G ATGTTGAATT 24 0 

ACAAAAAAGT AACATTGCAT TAATTGGGCC AACAGGTAGT GGTAAAACAT TATTAGCTCA 300 

AACCTTAGCC AAGACGTTGA ATGTACCATT TGCAATTGCA GATGCGACAA GTTTAACTGA 360 

AGCTGGTTAT GTAGGCGATG ATGTTGAAAA TATCTTGTTG AGATTAATTC AAGCAGCTGA 420 

CTTTGACATT GATAAAGCCG AAAAAGGTAT TATTTATGTA GATGAAATTG ATAAAATTGC 4 80 

ACGTAAATCT GAAAACACAT CTATA3CACG TGACGTTTCA GGTGAAGGTG TTCAACAAGC 54 0 

ATTGCTTAAA ATCTTAGAAG GTACGACTGC AAGTGTTCCG CCACAAGGTG GACGCAAACA 600 

TCCAAACCAA GAAATGATTC AAATTGATAC AACAAATATC TTATTTATTC TTGGTGGTGC 6 60 

CTTTGATGGT ATTGAAGAAG TGATTAAGCG CCGTCTTGGT GAAAAAGTTA TGGTTTCTC 720 

AAGCAATGAA GCTGATAAAT AT G AC G AAC A AGC AT T AT T A GCACAAATTC GCCCAGAAGA 780 

TTTGCAAGCC TATGGTTTGA TTCCTGAATT TATCGGACGT GTGCCAATTG TAGCTAATTT 84 0 

AGAAACATTA GATGTAACTG CGTTGAAAAA CATCTTAACG CAACCTAAAA ATGCACTTGT 900 

GAAACAATAT ACTAAAATGC TGGAATTAGA TGATGTGGAT TTAGAGTTCA CTGAAGAAGC 960 

TTTATCAGCA ATTAGTGAAA AAGCAATTGA AAGAAAAACA GGTGCGCGTG GTTTACGTTC 1020 

AATCATAGAA GAATCGTTAA TCGATATTAT GTTTGATGTG CCTTCTAACG AAAATGTAAC 108 0 

GAaGGTAGTT ATTACAGCAC AAACmATTAA TGAGaACTG AAC C AG 112 6 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4392 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATTGACTTCT TAGCAATnAA TaTGAGTGAA GAACGTACTG TTGAAGTACC AGTTCAATTA 60 

GTTGGTGAAG CAGTAGGCGC TAAAGAAGGC GGCGTAGTTG AACAACCATT ATTCAACTTA 12 0 

GAAGTAACTG CTACTCCAGA CAATATTCCA GAAGCAATCG AAG TAG AC AT TACTGBTTA 180 

AACATTAACG ACAGCTTAAC TGTTGCTGAT GTTAAAGTAA CTGGCGACTT CAAAATCGAA 24 0 

AACGATTCAG CTGAATCAGT AG T AAC AG T A GTTGCTCCAA C T G AAG AAC C AAC T G AAG AA 300 
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GAAATCGAAG CTATGGAAGG CGAACAACAA ACTGAAGAAC CAGAAGTTGT TGGCGAAAGC 360 

AAAGAAGACG AAGAAAAAAC TGAAGAGTAA TTTTAATCTG T T AC AT T AAA GTTTTTATAC 420 

TTTGTTTAAC AAGCACTGTG CTTATTTTAA TATAAGCATG GTGCTTTTTG TGTTATTATA 480 

AAGCTTAATT AAACTTTATT ACTTTGTACT AAAGTTTAAT TAATTTTAGT GAGTAAAAGA 54 0 

CATTAAACTC AACAATGATA CATCATAAAA ATTTTAAET ACTCGATTTT AAAATACATA 600 

CTTACTAAGC TAAAGAATAA TGATAAT TGA TGGCAATGGC GGAAAATGGA TGTTGTCATT 660 

ATAATAATAA ATGAAACAAT TATGTTGGAG GTAAACACGC AT GAAAT G T A TTGTAGGTCT 720 

AGGTAATATA GGTAAACGTT TTGAACTTAC AAGACATAAT ATCGGCTTTG AAGTCGTTGA 7 80 

TTATATTTTA GAGAAAAATA ATTTTTCATT AGATAAACAA AAGTTTAAAG GTGCATATAC 84 0 

AATTGAACGA ATGAACGGCG ATAAAGTGTT ATTTATCGAA CCAATGACAA TGATGAATTT 900 

GTCAGGTGAA GCaGTTGCAC CGATTATGGA TTAT TACAAT GTTAATCCAG AAGATTTAAT 960 

TGTCTTATAT GATGATTTAG ATTTAGAACA AGGACAAGTT CGCTTAAGAC AAAAAGGAAG 1020 

TGCGGGCGGT CACAATGGTA TGAAATCAAT TATTAAAATG CTTGGTACAG ACCAATTTAA 1080 

ACGTATTCGT ATTGGTGTGG GAAGACCAAC GAATGGTATG ACGGTACCTG ATTATGTTTT 114 0 

ACAACGCTTT TCAAATGATG AAATGGTAAC GATGGAAAAA GTTATGAAC ACGCAGCACG 12 00 

CGCAATTGAA AAGTTTGTTG AAAC AT C AC G ATTTGACCAT GTTATGAATG AATTTAATGG 12 60 

TGAAGTGAAA TAATGACAAT ATTGACAACG CTTATAAAAG AAGATAATCA TTTTCAAGAC 1320 

CTTAATCAGG TATTTGGACA AGCAAACACA CTAGTAACTG GTCTTTCCCC GTCAGCTAAA 1380 

GTGACGATGA TTGCTGAAAA ATATGCACAA AGTAATCAAC AGTTATTATT AATTACCAAT 14 4 0 

AATTTATACC AAGCAGATAA ATTAGAAACA GATTTACTTC AATTTATAGA TGCTGAAGAA 1500 

TTGTATAAGT ATCCTGTGCA AGATATTATG ACCGAAGAGT TTTCAACACA AAGCCCTCAA 15 60 

CTGATGAGTG AACGTATTAG AACTTTi&CT GCGTTAGCTC AAGGTAAGAA AGGGTTATTT 162 0 

ATCGTTCCTT TAAATGGTTT GAAAAAGTGG TTAACTCCTG TTGAAATGTG GCAAAATCAC 1680 

CAAATGACAT TGCGTGTTGG TGAGGATATC GATGTGGACC AATTTCTTAA CAAATTAGTT 17 4 0 

AATATGGGGT ACAAACGGGA ATCCGTGGTA TCGCATATTG GTGAATTCTC ATGCGAGGA 1800 

GGTATTATCG ATATCTTTCC GCTAATTGGG GAACcAATCA GAATTGAGCT ATTTGATACC 18 60 

GAAAT T GAT T CTATTCGGGA TTTTGATGTT GAAACGCAGC GTTCCAAAGA TAATGTTGAA 192 0 

GAAGTCGATA TCACAACTGC AAGTGATTAT ATCATTACTG AAGAAGT GAT CAGCCATCTT 1980 

AAAGAAGAGT TAAAAACTGC ATATGAAAAT ACAAGACCCA AAATAGATAA ATCAGTGCGC 204 0 

AATGATTTGA AAGAAACGTA TGAAAGCTTT AAATTATTCG AAAG T AC AT A CTTTGATCAT 2100 

CAAATACTAC GTCGCTTAGT AGCGTTTATG TATGAAACAC CTTCGACAAT TATTGAGTAT 2160 
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TT CC AAAAAG ATGCAATCAT TGCAGTTGAT GAATTAATC GTATTAAAGA AACTGAAGAA 2 22 0 

AGTTTAACAG TAGAGTCTGA TTCGTTTATT AGCAATATTA TTGAAAGTGG TAATGGATTT 2280 

ATAGGACAAA GTTTTATAAA ATATGATGAT TTTGAAACAT TGATTGAAGG CTATCCTGTC 2340 

ACTTATTTTT CATTATTCGC TACAACAATG CC GAT AAAAC TAAATCATAT TATTAAATTT 24 00 

TCATGTAAAC CTGTCCAACA ATTTTATGGG CAATATGACA TTATGCGTTC TGAATTTCAA 24 60 

CGATATGTTA AT C AAAAC T A TCATATCGTG GTTTTGGTCG AAACCGAAAC TAAAGTTGAA 2520 

CGTATGCAAG CGATGTTAAG TGAAAtGCAT ATTCCATCAA TAACAAAATT GCATCGCTCA 2580 

ATGTCATCGG GGCAAGCAGT GAT T AT T G AA GGCAGTTTAT CTGAAGGATT TGAACTACCT 2 64 0 

GATATGGGAT TAGTTGTCAT TACTGAGCGT GAgcTTTTTA AAT C AAAAC A G AAAAAG CAA 27 00 

CGAAAACGTA CGAAAGCTAT CTCAAATGCT G AAAAAAT T A AGTCTTACCA AGATTTAAAT 27 60 

G T GG GAG AT T ATATTGTTCA TGTGCATCAT GGTGTTGGTA QTATTTAGG TGTTGAGACG 2 82 0 

CTCGAAGTGG GGCAAACGCA TCGTGATTAT ATTAAATTGC AATATAAAGG T ACGGAT CAA 28 8 0 

CTATTTGTTC CAGTAGATCA AATGGATCAA GTTCAAAAAT ATGTAGCTTC GGAAGATAAG 294 0 

ACGCCAAAAT TAAATAAACT CGGTGGCAGT GAATGGAAAA AAACAAAAGC TAAAGTTCAA 30D 

CAAAGTGTTG AAGATATTGC TGAAGAGTTG ATTGATTTAT ATAAAGAAAG AGAAATGGCA 30 60 

GAAGGTTATC AATATGGGGA AGACACAGCT GAGCAAACAA CATTTGAATT AGATTTTCCA 3120 

TATGAACTTA CGCCTGACCA AGCTAAATCT ATCGATGAAA TTAAAGATGA CATGCAAAAA 3180 

TCGCGTCCAA TGGATCGCTT GCTATGTGGT GATGTTGGTT AT GGT AAAAC TGAAGTTGCA 32 4 0 

GTGAGAGCAG CATTCAAAGC TGTAATGGAA GGAAAGCAGG TTGCATTTTT AGTTCCTACA 3300 

ACTATTTTAG CTCAGCAACA TTATGAGACG TTAATTGAGC GTATGCAAGA TTTTCCTGTT 33 60 

GAAATTCAAT TAATGAGTCG TTTTAGAACG CCTAAAGAGA TAAAACAAS TAAGGAAGGA 34 2 0 

CTTAAAACTG GATTTGTTGA CATAGTTGTT GGTACACACA AATTACTTAG TAAAGATATA 3480 

CAGTATAAAG ATTTAGGGCT GTTGATTGTA GATGAAGAAC AACGATTTGG TGTACGCCAT 3 54 0 

AAAGAGCGTA T T AAAAC AT T AAAACATAAT GTAGATGTAC TAACATTGAC TGCAACCCCA 3 600 

ATACCTAGAA CATTGCATAT GAGTATGCTA GGTGTGCGGG ATTTGTCAGT GATTGAAACG 3660 

CCGCCAGAAA ATCGTTTCCC AGTTCAAACA TATGTATTAG AACAGAACAT GAGTTTTATC 3720 

AAAGAAGCTT TAGAAAGAGA ACTATCCCGT GATGGCCAAG TGTTTTATCT TTATAATAAA 37 8 0 

GTGCAATCCA TTTATGaAAA ACGAGAACAA CTCCAGATGT TAATGCCAGA TGCTAACATT 38 4 0 

GCAGTTGCTC ATGGACAAAT GACAGAGCGC GATTTAGAAG AAACGATGTT AAGTTTTATC ' 3900 

AATAATgAAT ATGATATTTT AGTAACGACG ACGATTATTG AAACAGGTGT CGATGTCCCA 3 9 60 
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AATGCAAATA CTTTGATCAT TGAAGATGCA GATCGCTTTG GATTGAGTCA GTTGTACAA 4 02 0 
TTAAGAGGTC GTGTTGGTCG TTCAAGTCGT ATTGGTTATG CATACTTCTT ACATCCAGCA 4080 
AATAAGGTAC TAACTGAGAC TGCAGAAGAT CGATTACAAG CGATTAAAGA ATTTACGGAG 414 0 

TTAGGCTCAG GATTTAAGAT TGCGATGCGT GATTTGAACA TTCGTGGTGC TGGTAATTTG 4 2 00 

TTAGGTAAAC AACAGCACGG CTTTATTGAT ACAGTTGGAT TTGATTTGTA CAGTCAAATG 4 2 60 
TTAGAAGAAG CTGTAAATGA AAAACGTGGT ATTAAGGAAC CAGAATCTGA GGTGCCAGAA 4 32 0 

GTCGAAGTTG ATTTAAACTT GGATGCATAT TTGCCAACAG AATATATTGC AAATGAACAA 4 380 

GCTAAAATTG AA 4 392 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 729 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

TTTCTTTTGA ATCTATATCG AGGTGGTTGG TAGGTTCATC TAAAATAAGT ACATTGTCAC 60 

GTTGCAACAT AAGTAGTGCT AGTTGTAAAC GTGCTTTTTC ACCACCAGAT AAATCATTAA 12 0 

TTATCTTTTT AACATCGTCT TGTACAAATA AGAAACGTCC AAGAACTGCT CGAATATCTT 18 0 

TTTCATTCAT TAACGGATAT TGATCCCACA CATAATCTAA AATCGTTTTA CTAGATTTAA 24 0 

ATTCTGCTTG CTTTTGATCA TAATAACCAA TTTGTAAATT TGCGCCGAAA GTAATATCGC 300 

CATTAAGCGC TTTTTGTTGA TTAGCAATAG TTTTAATTAA GGTCGATTTT CCAATACCAT 3 60 

TTGGCCCAAT GAT TGC TATA TGATCGCCTT TAGAGACCTC TATACTCATA GGTTTGGTAA 4 20 

TTGCAGTTTG ATAACCGATT TCTAAATTTT TTACATGCAT GACGTCATTA CCTGTATTCC 4 80 

GGTCAAAGCC AAATTGAATA TTTGCACTTT TGGCATCTAA CATTGGTTTA TCAATGCGTT 54 0 

CCATTTTTTC TAAAATCTTA CGTCTACTTT TTGCCATTCC ACTGTTGAA GCACGGGTAA 600 

TATTTTTCTC AACAAAAGTT TCTAATCGTT TTATTTCTGC TTGTTGACTT TCATATTCTT 6 60 

GCATTCGTTT TTGATAATAT AAATCCCGTT GCTGTATAAA TTCCTCGTAA TTACCAACAT 720 

AGCGTTTGA 729 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13856 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

AATGCCATCG TTTTATCATC TAGTACTATT GAATGTGCAT CATCTTGCCC ATGTGCATCT 60 

TGATGTTTCG ATACATTTGT TGCACCTTGT GGATATACTT TAAAGGTTGT GTCGTATGTT 120 

TCCTTACTAT CTTTAGCTTC AGATTCCTGT GATTCAACCG TTTTATATTT TTCAAGTGCA 18 0 

TGTCCTTCAA TATCAACTCG TGGAATAATG CGATTCAACC ATGCTGGTAA ATACCACGAA 24 0 

CCTTTtCCAA ACAATTTCGt TAATGCAGGA ATTAACATCA TtCTGACTAC GAAGGCATCA 300 

AAG AG T AC AC CAAACGCTAA TGCCATACCC ATTGATTTAA TCATGACATC TTCTTGGAAT 3 60 

ACAAACGCAA AGAAGACACT AAACATAATT AATGCAGCTG CTACAAffiAC AGGACCGCTT 420 

TCTTTCAATC CTACTTTGAT AGAATAATCA TTATCCCCTG TTTTACTATm yyCTTCATGr 4 80 

ATTCGCGACA TAAGGAAGAC TTCATAATCC ATCGCTAATC CAAATAAGAT ACCTATAGTA 54 0 

ATAACCGGTA AAAATGCTAG CATTGGTCCT GTCGTTTCAA TACCAAACAG ACCTTTCATA 600 

AAACCATCTT GCATTACTAA TGTTGTAAAT CCTAATGTTG CCATTAATGA CAAGACGAAT 660 

CCTAAAACTG CTTTTAATGG TATTAGAATT GAACGGAAGA CAATCATTAA TAAGAAAAAT 72 0 

GCTAATACAA CAATGACTGA GGCAAATAAA GGTATCGCCT CATTTAACTT TTTAGACATA 780 

TCAATATTAA TGACACTTTG TCCCGAAAX TCCGTTTTGA ACCCATATTT ATCTTGTGCA 84 0 

TCTTTATGAT AATCTCGTAA ATCATGCACT AAATCATTTG TACTCTCTGC ATTAGGCCCT 900 

TGCTTAGGTA TCACGACCAT CAAAGCGTAA TCATTATCTT TACTCATTTG TGGTGGCGTA 960 

ACGATATCTA CATTTTTCTT ATCTTTAATA TCTTTATATA CAGACTGTAA ATCTGTTGT 1020 

AATCCTTGTG GATCATCCTT TTTATCTTTC ACATTTATCA ACATCGGTAT TTGGCCATTA 1080 

AATCCTTCAC CAAATTTATC CGAGATAATA TCGTAAGCTT TTTTCTGTGT AGAATCTGCT 114 0 

GGTTTAACAC CGTCATCTGG AATACCAAGT CGC ATATGAC TAACTGGTAT TGCAGCTGCT 12 00 

ACTAATATGA TTAAACCTAG TAATACTGCC GCAAGTGCAT TTCCTGTAAT AAATTTAGAC 12 60 

CATGGCGTAT CAATATCTTT TTTGAATTTA GACTGTAATT TATTCACTTT AATGCGTTtA 1320 

TGGAAAATGC TTATTAATGC AGGTAATAAA GTTAAAGCGC TAAGTACTGC AAAAACAACA 138 0 

CTAATTGCCG AAGCAAATCC CATTACCGCT AAGAAG'CAA TGCCTACTAA TGATAAACCA 14 4 0 

CATACTGCAA TTACAACTGT TACACCAGCA AAAACAACTG CACTACCTGC TGTTCCTATT 1500 

GCAAGACCAA TGCCTTTAAT GTAATCTGTT TCAGTTTTCA TAACTTGTCG ATATCTGAAT 15 60 

AAAATAAATA ATGCATAATC GATACCAACT GCTAGTCCAA TCATTACGGC TAATGTCAGT 162 0 

GTGACATTTG GTATATCGAA TGCATAAGTT AACAAACTGA TAATACCTAC ACCAGAGGCT 1680 
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AGACCAATCA ATGCACTTAT AATTGGTAAT CCTGCAGCAA TGACTGAACC GAATGTGATT 174 0 

AACAGTACAA CAAATGCAAC AATAATACCA ACTAGTTCAG AATTACCGCC TACTTCTGTA 18 00 

GATGTCATTC CTGTACCAGT TAACTCTGTT TGTACATTAT GATTGTCACG TAAGTCTTTC 18 60 

AAATGACTTT TAACATTATC TCTAGAGCCA TCTTTTAAAG ATGTTTGACT AACGTCATAT 1920 

GTGATATCTG CAAATGCAGT TGTTTTATCT TTACTAATTT GCTTATTTTC ATAAGGATCT 198 0 

GATATTTTAT CAATGTGCTT GTCATCTTTT TTAATATCAT CTACGTTTT CTTAATATCT 204 0 

TTAGTAATGT TCGGTTGCAC AATACCATCA TCTTTAGTCG TCTTAAAGAC AACACGTATT 2100 

TGTGCCTTTT CACTATCTTG ATTAAAATGT TTTTCAATCT TTTTATTCGT AT C T AAC G AC 2160 

TCTAATCCTG TCATTTTAAT ATCATTGTCA AATTTCGGTG CATTTGTAGC AAGTGGTATC 2220 

AATATTGCAG CTACAATCAC TATCCATGCA ATGACCGCGG ACCATTTATG TTTTGCGATG 2280 

AATGTCCCCA TCTTATATAA AAATTTTGCC AAAGT AT AT T GCCTCCTTTT AAAATCAACG 234 0 

TTATAGTTTA AAT AT AC AG T GTAGATTATT GTTCGATTAT AGTATCTATC CCCGACCTCT 24 00 

TAAAGAATCA ATTGGAAAAT TTTGTATATT AAAC T AC AC A CAAAGGAGAA ATGTAGATGA 2 4 60 

AAG AG AC T G A TTTACGAGTT ATAAAGACAA AAAAAGCATT GTCGAGTAGC TTGCTACAAT 2 52 0 

TGTTAGAACA GCAATTATTC CAAACGATTA CTGTCAATCA AATTTGCGAC AACGCACTCG 2580 

TACACCGTAC AACATTTTAT AAACATTTTT AT G AT AAAT A T GAT CTTCTAGAG TACT TGT 2 64 0 

TCAATCAATT GACTAAAGAC TACTTTGCTA GAGATATCAG TGACCGTCTT AATCAT CCAT 27 00 

TCCAAAC GAT GAGTGATACG ATTAATAATA AAGAGGATTT GAG AG AAAT C GCAGAATTCC 27 60 

AAGAAGAAGA CGCTGAATTT AATAAAGTAT TAAAAAATGT CTGCATTAAA ATTATGCATA 2820 

ACGATATCAA AAAT AAT AGA GACCGTATCG AT AT T GAC AG CGACATCCCA GATAATCTCA 28 8 0 

TATTTTATAT TTATGACTCG TTGATTGAAG GTTTTATACA TTGGATAAAA GATGAAAAAA 2 94 0 

TTGATTGGCC TGGCGAAGAT ATTGATAACA TTTTCCATAG ATTAATCAAT ATTAAGATTA 3000 

AATAGTAGAT GAG AAAC T C A TGAGCGTTAC CSkCATTCAT AATAAAAACG ATAGTGkACA 30 60 

CGTTAATGAA TTCGTGTACT ACTATCGTTT TTTATTTTTA TCGTGCTTAT CGCTATTAAA 3120 

ACAACTGATA CACAACACAT AAAC TAT G AA GAAAAAAATA AATCCGCTAT C T AAAT GAC T 318 0 

TTGACTCAGT TGTTTAAATG ACCAAATTGC T AAT AC AAT T CCCATTATTA TTGAAAT&C 3240 

GTATCTCACA TTCTTATACC TATAATCCTT TTCTAAAAAT ATGGTTGCTA TTACTTAATT 3300 

TTTAAAGTTA TAAATAAAAA G AG C C AAC C G CAATGGATGG CCCTTGTTCA TTATGAAGCA 3360 

T TAG AAC ATT TCTGAAACAA CCTTTTGTTC TAAGAAGTGT AAT AAG TAG T CTGGACTACC 34 2 0 

TGTTTTAGCG TCCGTACCTG AC AT T T T G AA ACCACCAAAT GGATGGTATC C AAC AAC T GC 34 80 

TGAAGTACAG CCTCTGTTAA GGTATAAATT GCCTACATCA AATTCGTTTA CCGCTTTAAT 35 4 0 
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CCAATGCTCG CGATTATTTG TAATCACTGC AC C AG T T AAA CCGTAATCTG TATCATTTGC 3 600 

AACCTCAATT GCTTCATCAA AATCGTTAAC T T T C AC AAAG C C AAC AAC T G GACCAAAAAT 3660 

TTCTTCTTGC ATGATTCTAT CTTTAGATTT AAGTCCTGAA ATGATTGTTG GTTCTACAAA 3720 

GTAACCTTTT GAATCATCAG TGCCGCCACC TTGTTCTAAT TTACCTTCTT CTTTACCAAT 3780 

CTCAATATAA TTTTTAATCT TATCAAATTG TTTTTTATTA ATAACTGGGC CCATATACGT 834 0 

ATTGTCTACA GTATTGCCCA ACGTTAATTC TTTTGTTAAT TTGATTGATT TCTCTAATAC 3900 

TTCGTCATAA ACGTCTTTAT GCACAATTGC ACGTGAACAT GCTGAACATT TTTGACCAGA 3 960 

AAAACCAAAT GCTGACGTTA CAATAGCTTC TGCTGCCATA TCTGTATCAA TATTTTCATC 4 02 0 

AACTACAATG GCATCTTTAC CACCCATTTC AGCGATAACA CGTTTCAAGA AGTTTTGACC 4 08 0 

TTCTTGAACA ACGGCACTAC GTTCATAAAT TCTAGTACCT GTCGCACGTG ATCCTGTAAA 4 14 0 

TGTAACGAAA TGCGTATCTT TATGATCAAC TAAGTAATCA CCAATTTCTT TCGGATCACC 4200 

AGGAACAAAG TTAACTACGC CTTTTGGTAA TCCTGCTTCT TCTAAAffiTT CCATTAATTT 42 60 

ATAAGCGATA TAAGGTGTAT CCTCAGCAGG TTTCAATAAC ACTGTATTAC CTGCCACAAC 4 320 

TGGTGCTAAA GTTGTACCAG CCATAATCGC AAACGGGAAG TTCCACGGCG GAATTGTAAC 4 380 

ACCTGTACCA ATTGATTTAT AGAAATATTT ATTGTGTTCA CCTTCACGAT CAAGTACTGG 4 440 

CTTACCTTGA GCCAAGTCCA TCATTGAACG TGCATAGTAT TCAATAAAAT CAATACCTTC 4 50 0 

AGCTGCATCA CCAACTGCTT CATCCCATGG CTTACCTGCT TCATAAACCA TAATTGCTGC 4 5 60 

AATTTCCGCT TTTCGACGAC GAATAATTGC CGAAACACGT AACATAAGCT CTGCACGATC 4 620 

ATTTGCTGAC CATGTTTTCC AAGATTTA 1 ^ AGCTTCGTTT GCTGCTTTAA ACGCATCTTC 4 68 0 

AACATCTTGT TTTGTTGCCT TTGATGCATT TGCAATCACT TGTGATGTGT CTGCAGGATT 474 0 

GATTGATTTA ATTTTGTCAT CTTTGAAAAT CTTCTCTCCA TTAATCACTA ATGGTATGTC 4800 

TTGACCTAAT TCTTTTTCCA CGTCTTTCAA TGCTTTCTTA AACATATCCA CATTTCTTG 4 8 60 

G AC T G AAAAA TCGTAACCAG GTTCATTTTT AAATTCTACT ACCATGTACA CTTACCCCCT 4 92 0 

ATAAATTTTG AAAGTGGTTT AACCCTTTGA TTTAATGATA TAACATCATT T AAAC T CAT T 4 98 0 

TTACTATGAT TAAGGTTAGT TTTGCAATCG CTTTCATTTT TATGTTTTAT CACTTATTCT 5040 

CAAGTATTTT GAAATTGATT GGTTACTTTT TAAAATTTAT ATGGGTCGCA ACTGCTACTT 5100 

TATCGTTTCG TCATTTAATG TTTCGGATGG TAGGTCATTA TCAATTTTAC GAACGACTTT 5160 

ACAAGGGTTT CCAACCGCTA AGCTGTGTGG CGGAATATCT TTAGTGACAA CACTACCAGC 5220 

ACCAATCACA CTGCCTTCTC CAATCGTCAC CCCTGOAAC ACGGCTACAT GACCGCCAAA 5280 

CCAAGTATTA CTGCCAATAT GAATGGGTCC GGCTTTTTCA AAACCTTCAT TTCTATGATG 534 0 
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GAAATTAAGT GGATGTGTCG CTGTGTAGAA TCCACAATTA GGTCCTATAA AAACATTATC 54 00 

GCCAATTGTA ATCTGTCCAC CATCCATAAA GTAACAATTG GTGTTTACAT AG AC AT T T T T 54 60 

TCCTAGTTTA ACGTTCCAAC CATAATCTGT ATCAAAAGGA ATCGAAATAC TTACATTGTC 5520 

TGTTGTTGTT TGAAATAATT GATCAATTAA TTCCTTTCTT TTATTTGTAG CACTCGGTCT 5580 

TGTATGATTT AATTCAAAGC AAATATCTTT CGCTCGTGCA CGTTCATTGA TTAAGTATTG 5640 

ATCAAAGTTT GCATCGTACC ATTTTTCTGC TAACATTTTT TCTTTTTCAG TCATTACACC 5700 

TTTCAACTCC TAATAACTTA TTTACTTGTT TAAAAGTTAA TCAAATAAAC CTTCGCCTAT 57 60 

GCAACTAATA CGCTATAACA TTATGAAATC ATGACCTTAT CACCCTTATC TATACAATTC 58 20 

TCGCATCAAA TACTGCTAAA GTAGTAGATA AATTCAATAC TASGACGCA TTCATTTTTT 588 0 

AATCTATTAA CGTACAATGT GAGTAAGAGA AATATAAAGG AGTATGATAG CGATGAGAAT 594 0 

ATTAATTACA GGCACAGTTG CTATCTTAAT CATTCTAGGT TTGGTCAAAA C GAT AC AAG A 6000 

TTACGAAATG ACAAACGACA CGAGTCGTcA GTTGTCAGAC AACAAAGATG ATGATAAAGT 60 60 

CATCCATCTT AATAATTTTA AAAATTTACA TGCGAAAGAA TTTAACCCAT CTGATTTCTT 6120 

TTAAGTCACC TAAGAATTGC AAATCCAGAA GTCATTTAAG TTTTACCTTT CATTCATACA 6180 

TCCTTTAATA TTAATTACGA CTTCTTTTAT AT AG AT GC T A AGTAGAGAGA TTGTTGTGCA 62 4 0 

ATGTTTGCAC GGCAATCTCT CTTTTTCTTT TTAAAATTGG TAAAAGTAAA AC G C AAC GAT 6300 

TGACTTATAT ACCTATAGGG GGTACATTAG ACGTGTAACA ATGAATCACA GGGAGGCAAT 6360 

AATGTGGCTA AT AC G AAAAA AACAACATTA GATATCACTG GTATGACTTG TGCCGCATGT 64 20 

TCAAATCGTA TCGAAAAGAA ACTGAATAAA CTTGATGACG TTAATGCCCAAGTGAATTTA 64 80 

AC T AC AGAGA AAGCAACTGT TGAGTATAAC CC TG AT C AAC ATGATGTCCA AGAATTTATT 654 0 

AATACGATTC AACATTTAGG TTACGGTGTC GCTGTAGAAA CTGTCGAATT AGACATTACA 6600 

GGTATGACTT GTGCTGCATG CTCAAGCCGT ATT G AAAAAG TGTTAAATAA AATGGACGGC 6660 

GTTCAAAATG CAACGGTCAA T T T AAC AAC A GAGCAAGCTA AAGTTGACTA TTATCCTGAA 672 0 

GAAACAGATG CTGATAAACT TGTCACTCGC ATTCAAAAAT TAGGTTATGA CGCGTCTATT 67 8 0 

AAAGATAACA AT AAAG AT C A AACGTCACGC AAAGCTGAAG CGCTACAACA TAAATTGATT 684 0 

AAGCTTATCA TATCAGCAGT ATTATCTTTA CG\CTATTAA TGTTAATGTT TGTACATCTT 6 90 0 

TTCAATATGC ATATACCAGC ACTATTTACG AATCCATGGT TCCAATTTAT TTTAGCTACA 6960 

CCTGTACAAT TTATTATTGG ATGGCAATTT TATGTAGGTG C T TAT AAAAA C T T AAG AAA T 7 02 0 

GGTGGCGCCA ATATGGATGT ACTTGTTGCT GTTGGTACAA GTGCAGCATA TTTTTAO&T 70 80 

ATTTATGAAA TGGTTCGTTG GCTAAATGGC TCAACAACGC AACCGCATTT ATACTTTGAA 714 0 

ACAAGCGCCG TACTAATTAC CTTAATCTTA TTCGGTAAGT ATTTAGAAGC TAGAGCGAAG 7200 
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TCTCAAACAA CCAATGCGCT TGGCGAATTA TTAAGTTTAC AAGCTAAAGA AGCACGCATT 7 2 60 

TTAAAAGATG GTAATGAAGT GATGATTCCT CTAAATGAAG TACATGTTGG AG AT AC AC T T 7320 

ATCGTTAAAC CAGGTGAAAA GATACCTGTT GATGGCAAAA TTATTAAAGG TATGACTGCC 7 38 0 

ATCGACGAAT CTATGTTAAC AGGTGAATCT ATCCCTGTTG AGAAGAATGT TGATGATACT 7440 

GTAATTGGTT CAACGATGAA CAAAAACGGT ACTATTACTATGACAGCAAC AAAAGTTGGC 7 50 0 

GGGGACACTG CGTTGGCAAA TATTATTAAA GTTGTCGAAG AAGCTCAAAG TTCTAAAGCG 7560 

CCGATTCAAC GATTGGCAGA TATTATTTCT GGTTATTTCG TTCCTATCGT TGTTGGTATC 7 620 

GCACTATTAA CATTTATCGT GTGGATTACT TTAGTTACAC CAGGTACATT TGAACCTGCA 678 0 

CTTGTTGCGA GTATTTCCGT TCTCGTCATT GCTTGTCCAT GCGCATTGGG ACTTGCTACA 77 4 0 

CCAACTTCTA TTATGGTAGG TACTGGTCGC GCTGCTGaAA ATGGTATTTT ATTTAAAGGT 7800 

GGCGAGTTTG TTGAACGCAC ACATCAAATT GATACCATCG T T T T AG AT AA GACGGGTACC 7 8 60 

ATTACAAATG GTCGTCCAGT CGTGACAGAT TATCATGGTG ACAATCAAAC GCTACAACTA 7 920 

CTTGCTACTG CTGAAAAAGA TTCTGAACAC CCATTGGCAG AAGCCATTGT CAATTATGCA 7 980 

AAAGAAAAGC AATTAATATT AACTGAGACA ACAACATTTA AAGCAGTACC TGGCCATGGT 8 04 0 

AT TGAAGCAA CGATTGATCA TC AC CAT AT A TTGGTTGGTA ACCGTAATT AATGGCTGAC 8100 

AATGATATTA GCTTGCCTAA GCATATTTCT GATGATTTAA C AC AT TAT G A ACGAGATGGT 8160 

AAAACTGCTA TGCTCATTGC TGTTAATTAT TCATTAACTG GTATCATCGC AGTGGCAGAT 8220 

ACTGTCAAAG ATCATGCCAA AGATGCTATA AAACAATTGC ATGATATGGG CATTGAAGTT 82 80 

GCCATGTTAA CTGGCGATAA T AAAAAC AC T GCTCAAGCCA TTGCAAAACA AGTAGGCATA 834 0 

GATACTGTTA TTGCAGATAT TTTACCAGAA GAAAAAGCTG CACAAATTGC GAAAC T AC AG 8 4 00 

CAACAAGGTA AGAAGGTTGC GATGGTTGGT GACGGTGTAA ATGATGCACC TGCATTAGTT 8 4 60 

AAAGCTGATA TCGGTATCGC CATTGGTAG^ GGTACAGAAG TTGCCATTGA AGCAGCTGAT 8 52 0 

ATTACTATTC TTGGTGGCGA CTTGATGCTT ATTCCTAAAG CCATTTATGC AAGTAAAGCA 8580 

ACCATTCGTA ATATTCGTCA AAATCTATTT TGGGCATTCG GCTATAATAT TGCCGGTATC 8 64 0 

CCTATAGCTG CATTGGGCTT ACTTGCGCCA TGGGTTGCTG GTGCTGCAAT GGC&TAAGT 87 00 

TCAGTAAGTG TTGTCACAAA CGCACTTAGA TTGAAAAAGA TGCGATTAGA ACCACGCCGT 8 7 60 

AAAGATGCCT AGATTCCTTA ATAATGAAGG ATTCGTTGGT GATTCTGAGA TAGGCTAGTG 8820 

ATTGGCTCTA TAATGTCGCG GTTTAyaGTt GGATCTTCGC TCCAACTGCA TATATAGTnA 8880 

CACTTTTCGC TTGGCGAATT AGTGTATCTT ACCTAATAGc TCCGCCTATT AGGTTCCATC 8 94 0 

ATT ATT AT AA ATAATAAGTA CACTACGGtT TACAGTTGGA TCTTCGCTCC AACTGCATAA 9000 



96 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



GAGCCACTAA T TAT TAT AAA TAATAAGTGG CyCTAAAAGG AGGTGAATGC TATGTCACAA 9060 

GAAATTTTAA ATGTTGAAGG TATGAGCTGT GGTCACTGCA AAAGTGCTGT TGAATCTGCA 9120 

TTAAATAATA TTGACGGTGT CACTTCAGCT GACGTTAACC TTGAAAATGG TCAAGTAAGT 9180 

GTTCAATATG ATGACAGTAA AGTTGCTGTA TCTCAAATGA AAGACGCAAT TGAAGATCAA 92 4 0 

GGTTACGATG TCGTTTAATT AGGCAATATT CAACGTCATC AACACCAAAT TAAAAAATCG 9300 

AACTGATGAG AATCCCAACA ATCCAAATTA TCTCATCAGT TCGATTTTTA ATTTACTCGT 9360 

AACCTAGTAT CTCCAGTCTG CAATACATCT AATGTTGCAT CTAATGCATC G AC AAT TAG A 94 20 

TTTTTAACTG CAGCTTCAGT ATAAAACGCA ATATGTGGTG TTAATATGAC ATCTTCCCTG 94 80 

TCAATCAACG ATTCTAACAA TGGATCGTTC AGTGTTTTGC CCCTTTGATC AC T T GGGAAA 954 0 

AGTTTGCGTT CAAATTCATA CGTATCAAGT GCTGCACCTT TAATCACACC ATTGTCTAAT 9600 

GCGTCTAATA ACGCCTTAGT AT C TACT AAA GAACCTCTCG C AC AAT T G AC AAATACTGCG 9660 

CCCTTTTTAA AATGTTTAAA TAATTCAGCA TTAAATAGAT AAEATTATA TTTCGTTGCA 97 2 0 

GGTACATGTA ATGTCACGAT ATCAGCACCT TCAACCGCTT CCTCAATCGT ATCTTTGTAA 9780 

TCGACATACG TTGCAATTTT AGCATTAGGA AACGGtCGTA TGCGACCACA TCACTTTGAT 98 4 0 

AACCATTGGC AAATATATCG GCTACTACAC GGCCAATTCG ACCTGTACCA ATAACAGCTA 9900 

CTTTTAAATC TTTAATGGAT TTCGATAAAA TAGTAGGTTC CCATCTAAAA TCATGcTCCC 9960 

GCACTTTCGT TTGAATTTGA TTAAAATGAC GAACCACATT AATAGCCTGG TTCACAGCAA 10020 

ACTCCGCAAT TGAATTCGGA GAGTATGACG GCACATTTGA CACAATAAAG TTATACTTGT 10080 

TTGCTAACTC CAAATCATAT GTATOAAATC CAGCACTACG TTGTGCGATT TGTTTAATAC 1014 0 

CTAGTTCATT TAATCGTTTA TAAACATGCT CTGATAATGG TATTTGTTGT GATAGCGATA 10200 

AGCCATCATA ACCAGCGACA CCTTCAACAT TGTCATCAGT TAATGCTTCT TTAGTAATAT 10260 

CTACCTCAAC ATGATGTTTC TCTGCCCACG CCTTGATATA AGGCATATCTTCATCACGTA 10320 

CACTCATGAT TTTAATTTTT GTCATTTTAA CATCACCCTT AACTTTATTA TTCATATAAA 10380 

TATGCTAGTT CTGTTAATCT TATTGCAGCT TCGTCTAATT TCTGGTCATC TAACGCCAAT 10440 

GAAATTCTCA CATAACGATT ACCATTCTCT CCAAATGGTT TCCCTGGAGC AACAAGTATT 10500 

GACTTCTCTT GCACTAAAAA TTGCTCAAAT TGCTCGCTGT CATAACCAGG CGGTGTTTCC 10560 

AACCATACAT AT AT GC C AC C TTTAGCATGA ACAAATGGCA AATCAGCTTT TGCAAGCATG 10 620 

GCTTCGAATC GGTCACGACG TGTTTTAAAT ACATTGCTTT GTTCTTCTAA AAAAT CAT C A 10680 

TAATGATTCA AAGCATATAT TGCGGCATCT TOAATGCAC CAAACATCCC AGCATTTGTG 10740 

TGCGTTTGGT ACTTTTTCAA AGCTTGAATC ATATCTTTAT TACCAACTGC AAAACCGACT 10800 

CTAAAACCTG ACATGTTATA ACCTTTAGAC AAGGAATAAA TTTCAATCGC AACATCTTTC 10860 
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CCATTTTCCG AAGCAAGTAT ACTAGGATTT TTAGCGTCGA AACCGAAAGC ACCATAAGA. 10920 

AAATCATGCA CGATTTTAGT GTCTGTACCT TTAAATTTAG cTATCGCTTC AT C AAAAAC T 10980 

TCTTTCGTAG CTGTCGATCC AGTTGGATTA TTTGGATACG TTAAATAAAT GAGTTTTGTT 1104 0 

TTATCTATTA TTTGTGAATC AACTTTGGAC CAATCTGGCA AATAATGTGG CGGTTCTAAA 11100 

TTAAGCGGGA CTGGCTTGCC ATCAGC TAAA AGTACACCTG CTAAATAATC CGTGTAGCCT 11160 

GGATCAGGTA GTAATACATA GTCTCCTGGA TTGATAACAC ATGTTGGTAC TGCCACTAAT 11220 

CCATTTTTTG TACCATATAA AATGCATACT TCATCTTCTT TATCTAACGT CACATTATAT 11280 

TGTCTTTGAT AAAAATCTAC AATAGCTTGC TTGAACGCTT CTTTACCATG AAAAGCACCA 1134 0 

TATTTTTGAT TTTCAGGAAT AGTTAGTGCT TTTTGAAAAT GATCAATAAT ACCTTGTGGC 11400 

GTGGGCCCAT CAGGGATTCC AACTGCCATA TTAATTAATG GCAATGGTCC ATGTTCGATT 114 60 

TTACGTCCCA TCGTTTTCCC GAAATAACTA TCAGGGATAT TTGCTAATTT G T TAG AG AT C 132 0 

ATCAAATTCC TCCTCTATCA T T AAAC AT AG CCTGGGCGAC TATCATAATC CTAACAACTT 11580 

GTATCACTCT CAT T T AGATG GTTACAATGA CATCGCCATT CACCGTTATG TTCAACAGAA 11640 

CTTATGACAC ACGTTGTATT GAATGAATTT ATTTTCATTT TAGGTAGGTA TAATATTATT 117 00 

GTCAATATTA GGAATTTTCA GVTTAATATG CACTCAATCG TTATGATTTA ACTGTCATGC 117 60 

ATATCCGCAT GCGCAACCAG TTAGATATGC TTATATAAAG TATAACGCCC ATCAAGGTAC 11820 

GTATTCAAAC GTGAACCTTA ACAGGCGTCA TTCATTGTTA AATAAAACTT CTTAAGCACA 118 80 

TACTTATTTC ACTATGCCTT TTACGTTCCC CTTATACTTT TCTCACffiCT TTCTCTTAGA 11940 

CTACTCCCTT ATACGCCCCG CTCAATATCT TTAATCATTT CATCTACAGT TATTTTCGCA 12000 

CTCGTTAAGA CAATAGGAAC GCCTGCACCT GGATGCGTAC TTGCACCTGC AAAATATAAA 12060 

TCTTTATAAT CTCGCGATAC ATTTTGTGGA CGATAATAAT TACTTTGCGC TAAAGTTGGC 12120 

ATTAAACCGA ATGCCGAACC AAATTTCGCA TGATACGTTT GCTCAAAATC ATTTGGCGTA 12180 

AAGATTGTTT CTGAAACAAT ATGCGATTTT ATATCTTCAA ATACTTCAAT CGTTGCTAAT 1224 0 

TTACGATAAA TAATTTCCTT TATTTGTTGC GTCAAAGCTT CATCTGACCA ATCGATTCCG 12300 

CTACCTGTTT TAAGTTCCGG CGTCGGCATf AGCACATAAA TACCAGTTTT GCCTTCTGGC 12360 

GCAAGTGATT TATCAGCGAC CGCTGGTACA T AC AC AT AAA TAGAAGGATC AT AT GAT AAA 12 4 20 

CGTCCCTCAA ATATTTCTTC AATATTGCCT CTAAAGTCAT CTGAAAAAAT AACATTATGA 124 80 

AGTCTCACTT GATCTGTCAC ATCAATATCT ATACCGATAT AC AT T AAAAA TGCCTAACAA 12 54 0 

GAGTAATCTA AGTCTGCAAT TTTATGTGGT GGATACTTTT TAATAGGTGC AAAATCTGGC 12 600 

ATTAATGATT CCGCAACACT AGGGAAATCA GCCGTACATA AAATTTTATC AAATTTTCTT 12 660 
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AT G T C AC CAT TCACTTTTAT CGCATCGGCC CGTTTGAATT TAGGATCAAT AATAATTTGC 12 720 

TCAAT T T C AG CATTTAGTTC AATATTAACG CCTAAGTCTT TATTTAATTG CGCTAGcCCT 12780 

TGAGCCATGC CATACATACC GCCTTTAATA AAATGCACAC CAAACATCAT TTCAATCATA 1284 0 

GGAATAATTG AATATAGTGA CGGGCCTCGT TTTGGATCAA TTCCTATGTA TAACGTTTGA 12 900 

AACGCTAAAA GCTTTTGTAT CTTTTCGTTA TCAATAAAT GTTCAATTAG CTGATCTGCA 12 9 60 

TGATTTAACG TTTTTAACTT AGCACCTTGC ACAAGTGACG TCAT AT TAT A AAAGTCACTC 13 020 

GGTTTGCGAT ACGTTCTTTC TAAGAAATAG CGACGTGCAA TTTCATATTT TTTATAAACA 13080 

TCCGTTAAAA AGGACATAAA ACCATGCGTT GAACCAGGTT CTATACTTTC TAGCATTTGC 1314 0 

TGTAATTCAG CTAAATCTGT AGGCACCGTT ATACGATCAT CGTGGTCAAA ATACACATCG 13200 

TAAATATAAC GTAATTGTCT CAATTCAATA TAATCTTCAT AATTTTTACC ACACGCTGTA 132 60 

AAAACATCTT TATAAACATC TGGCATCATG ACAATTGTGG GACCCATATC AAATGTAAAG 1332 0 

CCGTCTTTCT TTAATTGATT CATACGCCCG CCTACATTAT TATTTTTTTC AAATATCGTC 13380 

ACTTCATGAC CTTGAGAAGC AATACGGGCT GCCGCTGCTA ATCCTGTGAC ACCTGCACCA 13440 

ATTACTGCAA TCTTCATTAT TCAACCACCT ATATTCTATG ATATTTACTA TTTATTTCAT 13500 

GAAACAACTT TGCCTTTTTC CTCTTATCCA CAAAAACACG TTS.TGTAAT G TAT AG T TAG 135 60 

CCTGTCTCAC TTCGTCCAGT ATTTCAATAT ATATACGTGC TGCTAATTCT ATGATTGGTT 13 620 

GTGCTTCAAT ACTAAATACT TTGATTTGAT CCATAACATC TTGAAAATCT TTTTCTGCGA 13680 

TAGCTGCATA ATATTCCCAT AAGTCAATAT AATGATTATT AACACCATTT TGGTACACTT 137 4 0 

CAGCAATATC AACTTCATAT TGCTTTAATC GTTGCTTACT AAAATATATC CGTTCATTGT 13800 

CAAAATCTTC ACCGACATCT CTTAATATAT TAAnGGGATC CTCTAGAGTC GACCTG , 138 5 6 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10088 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

AT AT AT AAA T ATAGATTAAG TATATAGATT AATCAACTTT TTTGGAAGAG CAAATCACGC 60 

AATCAACAAA TAATATAAGA ASTTTTTGCG ATAGTTTTAA AATAGCTGTA ATAGAATACT 12 0 

AAATGTGACA AACTTAGAAC TAATATCAAG TGTTGATGTT TTGAATATAA AAATGCTAAT 18 0 

AGATGGCATC AC GAG AT T T A TGACTATCTG TTAAATCAAC CTAACACGAT TATAATTTGA 24 0 

ATAATTGGTT AATATATGAG T AAT T AG AAA ATAGACAAAG GATGACBTT TATGTATATC 300 
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AATATGAAAG ATTATGGGTT AACAGGCATA AACAAAACTA AAGATACTCG AGCAATACAA 360 

CGTGCGTTAA ATCGTGGAAG ATGTAAACCA ACGACAGTTT ATATACCGAA AGGGACGTAT 4 20 

GATATTTGCA AACCATTAAC GATATATGGC AATACAACAC TTTTGTTAGA TAATGAAACT 4 80 

ATTTTACGCC GATGTCATTC TGGTCCTTTA TTAAAAAATG GTCGTCGCTT TGGTTTTTaT 54 0 

CGTGGTTATA AT GG AC AC AG TCATATTCAT ATTAAAGGCG GCAAGTTTGA TATGAATGGT 600 

GTATCGTATC CTTATAACAA TACAGCTATG TGCATTGGGC ATGCTGAAGA TATTCAATTA 660 

ATAGGTGTGA CCATTAAGAA TGTAGTGAG 1 GGTCATGCAA TTGATGCTTG TGGGATTAAC 720 

GGACTCTATA TTAAAAGCTG TTCATTTGAA GGATTCATAG ACTATAGTGG CGAACcTTTT . 780 

ATTCTGAAGC AATACAATTA GACATTCAAG TACCTGGTGC TTTTCCAAAA TTCGGAACgA 840 

CAGATGGTAC G AT AAC G AAA AATGTCATTA TCGAAGATTG TTATTTTGGA CCTOAGAAT 900 

TGCCCGAAAT GGGAAGTTGG AATCGTGCTA TTGGCTCACA TGCAAGTAGA CATAATCGAT 960 

AC TAT G AGAA TATTCATATT AGAAATAATA TATTTGAAGA TATACAAGGT TATGCATTAA 102 0 

CTCCCTTGaA GTATAAAGAT GCTTTCATTA TTAATAATAA GTTTATTAAC TGTGaGGGTG 108 0 

GCATTAGATA TTTAGGAGTT AGAGATGGTA AAAATGCAGC AGATGTGaTG ACAGGaAAAG 114 0 

ACTTAGGTTC CCAAGCAGGC AT AAAT AT GA ATATAATTGG AAATGAATTT AAAGGATCAA 12 00 

TGTCTAAAGA TGCGATACAT GTACGTAATT ATAATAATGT TAAACATAAA GATGTATTAA 12 60 

TCGTTGGGAA TACATTCAAT AATTCGACTC AATCAiTTCA TTTAGAAGAT ATTGATACAG 1320 

TGTTTTTAAG TCCTGTTGAA GCGGGTATTC AAGTTACTAC AATCAATGTA GAT G AAAT AA 1380 

AAAAGTAAAA AGTTTCGCAT GACATTAGGA TTAAGAATAG TAGATAATTT TTGAAAGCGC 14 4 0 

ATTCATAAAA CGGTATAAAT ATGCTATAAT AAACCCAATT AT C T GAT AAA AGGGGTATTT 15 00 

TGACGGTAAT GATAATACAA GATAGACAAC TTTCTATACT CTAATATAGT GAGTTGAAGT 15 60 

AGCTTGTCAT AATCATCATG AGGGGGAAAT TTATGGCTTA TTTCAATCAA CATCAATCAA 162 0 

TGATATCGAA AAGGTATTTA ACATTCTTTT CAAAATCAAA GAAAAAGAAA CCGTTTAGTG 168 0 

CGGGACAACT TAT T G G ACT A ATATTAGGTC CATTACTTTT CCTATTAACA TTATTATTCT 17 4 0 

TTCATCCACA AGACTTACCT TGGAAAGGCG TCTATGTTTT AGCGATTACT TTATGGATTG 1800 

CGACTTGGTG GATTACTGAA GCAATTCCTA TTGCAGCAAC GAGCTTATTA CCAATTGTGT 18 60 

TATTACCATT AGGTCATATA CTTACACCAG AACAAGTATC ATCGAATAT GGCAATGATA 1920 

TTATCTTTTT GTTTTTAGGT GGATTTATTT TGGCAATTGC AATGGAAAGA TGGAATTTAC 198 0 

AT AC G AG AG T AGCGTTGACA ATTATTAATT TGATTGGTGC AAGCACTTCA AAAAT AT T AC 2 04 0 

TTGGATTCAT GGTGGCAACA GGATTCTTAT CTATGTTTGT AT CG AAC ACT GCAGCTGTAA 2100 
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TGATTATGAT TCCGATTGGT TTAGCAATTA TTAAGGAAGC ACATGATTTA CAAGAAGCCA 2160 

ATACGAATCA AACAAGTATT CAAAAGTTTG AAAAATCTCT AGTTTTAGCA ATTGGCTATG 22 20 

CAGGTACGAT TGGTGGCTTG GGTACATTAA TCGGAACCCC GCCATTAATT ATTTTAAAAG 22 8 0 

GACAATACAT GCAACATTTT G G AC AT G AAA TTAGTTTTGC TAAATGGATG ATTGTAGGGA 2 34 0 

TTCCAACGGT CATTGTTTTG TTAGGTATTA CTTGGCTCTA TTTAAGATAT GTTGCGTTTA 24 00 

GACATGATTT GAAATATTTa CCTGGTGGTC AGACGTTAAT TAAACAAAAG TTAGACGAGC 24 60 

TTGGCAAAAT GAAGTATGAA GAAAAGGTAG TACAAACTAT CTTTGTACTTGCTAGCTTAT 2 52 0 

TATGGATTAC AAGAGAGTTT CTTCTGAAAA AATGGGAAGT TACGTCATCT GTTGCAGATG 2580 

GTACGATTGC TATTTTTATA TCAATATTAT TATTTATTAT TCCAGCTAAA AAT AC T G AAA 2 64 0 

AACATCGCCG TATCATTGAC TGGGAAGTTG CAAAAGAGCT CCCTTGGGGT GTATTAATTT 2 7 00 

TATTTGGTGG CGGTTTAGCA TTAGCGAAAG GTATTTCTGA AAGTGGTTTA GCAAAATGGT 2 7 60 

TAGGCGAACA GTTGAAATCA TTAAATGGTG TTAGTCCGAT TCTTATTGTA ATTGTCATAA 2820 

CAATCTTTGT CTTATTTTTA ACTGAAGTGA CAT CT AAT AC TGCAACTGCA AC GAT GAT TT 2 8 80 

TACCGATTTT AGCAACGTTG TCTGTTGCTG TTSGAGTGCA TCCATTACTA CTTATGGCAC 2 94 0 

CTGCAGCTAT GGCGGCTAAC TGTGCATACA TGTTACCAGT AGGGACACCA CCGAATGCAA 30 0 0 

TTATCTTTGG TTCTGGTAAA ATATCTATCA AACAAATGGC ATCAGTAGGA TTCTGGGTAA 30 60 

ACTTAATCAG TGCAATAATT ATTATTTTAG TCGTGTATTA TGTAATGCCT ATAGTTTAG 3120 

GTATTGATAT AAATCAACCA CTGCCATTGA AATAGTAATT GCAGATTAGA ACGAAAAATA 318 0 

AAAGGTTACA TTAGCAATTG CTTGGACGAG TGGTAACGAA ACGTATACCG CAGCATCGTG 32 4 0 

TAAAAACAAT ACAAACAAAA GAAAGTCAAC CAAGGATGGA TTCCTATTTT AATCCTTGGT 3300 

TGACTCTTTA TTTTATTTAA ATTGTAGAAC C T AG AAAAT A AAGTTTAATT AAAAGCACCA 3360 

ATCATTTCTA CTTTGAAATC TAAGGTTTCT AAAAT AGCAA TGACTTTCTT TATATCGGTT 3420 

GTAATTGCAG AATCAGCCTG AACGAAAAAT CG AT AC AT AC CTAATTGTGT TTTTAAAGGA 34 8 0 

CGAGACTCAA TCCAGGATAA ATTAATATTA AACAAAGCAAATGTATTAAG CACACTTGCT 354 0 

AACAACCCAG GTTTATCATG CATTGGTGTA ATTAAAAACA TCAATGATGT CGCATTTTGA 3600 

TCAAATTGCT GCTGATTTTT TATAACTAAA AAACGTGTCA CGTTATGTGG ATAGTCTTCA 3660 

ATATGTGTAT CAATAGGTGT AAAACCATAA GctTCGCCAC TACCTAAAGG TGCAATTGCT 732 0 

GCAACGCCAT TTTCAATTTT AGTCAAACTT TGAATTGTAC TGTCGACATA ATCATAGTCA 37 80 

AATTGATGTT GATGTATGTA ATTTGTTGTT TGACTGATTG CTGGTGCAAT AGAATACACT 38 4 0 

TTTTTAATAT CAGAAATGGA ATCTGTTCCA TTACCATATA ATGCAAAGTT AAT AT C T AAA 3900 

CGTATTTCAC CGTGTGCAAA GiCATCTTGC TGTGCAAGTG CATCTGCCAC AATGTTGATT 3960 
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GTTCCTTCTA TAGAATTTTC AATAGGGACA ACACCAATCG ATGTGTCATC ATCTGCAACT 4 02 0 

GCCTTGATGA CTTCAAATAA ATTTGACTTT GGTTGAAAAG TTGCTTCATT TTCAGAAAAA 4080 

TACTGACGAC AAGCCAAATA TGAAAATGTA CCTTTAGGGC CTAAATATA TAATTGCATA 414 0 

TGCTACACCT CTACTAACTT AATGATGGAA AGGGCACTGG TTAGCATTTG ATTCTTTCTT 4 2 00 

TTTATAGAAA AAGTTTGGAT CTTTTACTGT ATTGTCATAT CCGTGATGAT AATTTGACGT 4 2 60 

CAATGTTGGA GATAATGGCG GTGCTAGCCA AG AC CAT T T T CCGGTAACTT GACGACCTTG 4 320 

TTGTGCTTCG TTACGTTCGA ATAGTTCGAA TTGCTTTGCA GCGGTCAAAT GATCGACAAT 4 380 

TGATACGCCT TCTTTTTTAA AGGAATGATA CACAGCATAG TTCAATTCAA CAAGTGCTCG 4440 

ATCTTTATTA AATGAAT TAT TTTTAAGTGT ATCAAATTCA AACGCATCTG CAACTTTTTC 4 500 

TAGTAAATTG TAACGGTAAT CATCAATAJA GTTACGTACG CCAATTTCAG TTACCATATA 4560 

CCAACCGTTA AAGGGTGCAG TT GGATAT AC AATGCCACCG ATTTTTAAGT C CAT AT T G G A 4 620 

AATGATAGGG ACTGCATACC ATTTTAAGTT CAATTTTCTT AATTTTGGAT AATGATTATG 4 680 

TTCAATAGGT ACTTCTTTAA TTAATGAAGT AGGATATTCG TAAAATTTAA CTGSTCATT 47 4 0 

AGGTAATTGG TAAATCAGTG GTAACACGTC AAAATTAGTA CCTTTTCCTT TCCAACCTAA 4 800 

GTGATTTGCT AAGCGTGTAA CTTCTTTTTC AGCAGGATCA CCACAATTGT CATAGCCAGC 4 8 60 

ATAGCGAATT AATTGATTGT TGAAAATTTT AGGTCCATCC TTTGGAGCAT ATATAGTAAT 4 92 0 

ATACGGCTTT AATTTACCTT CATTTGTAGC CTGTGTAATA TGATAAGTAA TTGATGATAA 4 980 

GAACGATGCT TCGTCAGTAA CATCTCTTGC ATCAATGACA TTTAACGAAT CCCAAAATAA 504 0 

ACGACCAATG CAACGATTTG AATTACGCCA AGCCATTTTA GCACCATAAA TAAGTTCTTC 5100 

TTCTGTATGT GTATATGTCC CAGTTTCTTT TATTTfTAGT TCAATGTCAT GTAAACGTTT 5160 

ATTGATAATT TGCGTTTCAT AATGACACTC TTTATACATG TTTTCTATGA AAGCTTGAGC 5220 

CTCTTTAAAT AACATTAACA ACACCTCGCT T TAT AT TATA GTCTACATTA TTAAAATACT 52 8 0 

CTTAAAAATT ATGTATATGT CAT T AAAT T G TTGGTTGATT TTAATTAAAA GTATGGAAAT 534 0 

TAAGGGGCTC TTATGTATAT AAAAAAATGA AT TAT GAT AA AATGTAAGAA AATATTTAGG 54 00 

TCGATTGGAG AG AT AC AAG T GTACCAATTA GAAGACGACA GTTTAATGTT ACATAATGAC 54 60 

T TAT AT C AAA TAAATATGGC TGAAAGTTAT TGGAATGATA ATATTCATGA AAAAATGGCT 5520 

GTATTTGATT TGTATTTTAG AAAAATGCCA TTTAATAGTG GCTATGCTGT TTTTAATGGT 5580 

TTAAAACGCG TCATTGATTT TATAGAACAT TTTGGATTTT CAGAATCTGA TTTGGAATAT 5 64 0 

TTAAAGTCTA TTGGCTACAA GGATGATTTC TTATCATATT TAAAAGATTT AAAATTCACA 57 00 

GGCAGCATCC GTTCGATGCA AGAAGGCGAA TTATGCTTTG GTACGAACC ATTGTTACGC 57 60 
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GTAGAAGCAC CATTGATTCA AGCGCAATTA ATAGAAACAA TTTTATTAAA CATTGTAAAT 58 2 0 

T T CC AT AC AT TAATTACAAC AAAGGCTAGC AGAATTCGTC AAATTGCATC AAATGATAAA 58 8 0 

TTAATGGAGT TTGGTACACG TCGTGCGCAA GAAATTGATG CAGGATTGTG GGGCGCTAGA 594 0 

GCTGCTTACA TCGGGGGCTT TGATTCTACA AGTAATGTTA GGGCGGGGAA ATTATTTGGT 6000 

ATACCTGTGT CTGGTACACA TGCACATGCA TTTGTCCAAA CTTATGGAGA CGAATATGTT 60 60 

GCCTTCAAAA AATATGCTGA AAGACATAAA AATTGTGTGT TCCTAGTAGA TACATTCCAT 6120 

ACTTTAAAAT CTGGCGTGCC AAATGCAATA AAAGTTGCAA AAGAATTAGG TGACAAAATT 6180 

AACTTTGTAG GTATTCGATT AGATTCTGGA GATATCGCTT ATTTATCTAA AGAGGCAAGA 62 4 0 

CGTATGCTTG ATGAAGCAGG ATT T AC T G AA ACTAAAATTA TCGCGTCTAA TGATTTGGAT 6300 

GAAGAAACGA TTACGAGTTT GAAAGCACAA GGTGCAAAAG TAGATTCTTGGGGCGTTGGT 6360 

ACAAAGCTGA TTACAGGATA CGATCAACCA GCATTAGGTG CAGTATATAA ACTTGTAGCT 64 2 0 

ATTGAAAATG AAGATGGTTC ATATAGTGAT CGTAT TAAAT TATCAAATAA CGCTGAAAAG 64 80 

GTTACGACGC CAGGTAAGAA AAATGTATAT CGCATTATAA ACAAGAAAAC AGGTAAGGCA 654 0 

GAAGGCGATT ATATTACTTT GGAAAATGAA AATCCATACG ATGAACAACC TTTAAAATTA 6600 

TTCCATCCAG TGCATACTTA TAAAATGAAA TTTATAAAAT CTTTCGAAGC CATTGATTTG 6660 

CATCATAATA TTTATGAAAA TGGTAAATTA GTATATCAAA TGCCAACAGA AGATGAATCA 67 20 

CGTGAATATT TAGCACTAGG ATTACAATCT ATTGGGATG AAAATAAGCG TTTCCTGAAT 67 8 0 

CCACAAGAAT ATCCAGTCGA TTTAAGCAAG GCATGTTGGG ATAATAAACA TAAACGTATT 68 4 0 

TTTGAAGTTG CGGAACACGT TAAGGAGATG GAAGAAGATA ATGAGTAAAT TACAAGACGT 6 900 

TATTGTACAA GAAATGAAAG TGAAAAAGCG TATCGATAGT GCTGAAGAAA TTATGGART 6 960 

AAAGCAATTT ATAAAAAATT ATGTACAATC ACATTCATTT ATAAAATCTT TAGTGTTAGG 7020 

TATTTCAGGA GGACAGGATT CTACATTAGT TGGAAAACTA GTACAAATGT CTGTTAACGA 7080 

ATTACGTGAA GAAGGCATTG ATTGTACGTT TATTGCAGTT AAATTACCTT ATGGAGTTCA 7140 

AAAAGATGCT GATGAAGTTG AGCAAGCTTT GCGATTCATT GAACCAGATG AAATAGTAAC 7 2 00 

AGTCAATATT AAGCCTGCAG TTGATCAAAG TGTGCAATCA TTAAAAGAAG CCGGTATTGT .72 60 

TCTTACAGAT TTCCAAAAAG GAAATGAAAA AGCGCGTGAA CGTATGAAAG TACAATTTTC 7 32 0 

AATTGCTTCA AACCGACAAG GTATTGTAGT AGGAACAGAT CATTCAGCTG AAAATATAAC 7 38 0 

TGGGTTTTAT ACGAAGTACG GTGATGGTGC TGCAGATATC GCACCTATAT TTGGTTTGAA 74 40 

TAAACGACAA GGTCGTCAAT TATTAGCGTA TCTTGGTGCG CCAAAGGAAT TATATGAAAA 7 5 00 

AACGCCAACT GCTGATTTAG AAGATGATAA ACCACAGCTT CCAGATGAAG ATGCATTAGG 5760 

TGTAACTTAT GAGGCGATTG ATAATTATTT AGAAGGTAAG CCAGTTACGC CAGAAGAACA 7 620 
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AAAAGTAATT GAAAATCATT ATATACGAAA TGCACACAAA CGTGAACTTG CATATACAAG 7 68 0 

ATACACGTGG CCAAAATCCT AATTTAATTT TTTCTTCTAA CGTGTGACTT AAATTAAATA 77 4 0 

TGAGTTAGAA TTAATAACAT TAAACCACAT TCAGCTAGAC TACTTCAGTG TATAAATTGA 7 800 

AAGTGTATGA ACTAAAGTAA GTATGTTCAT TTGAGAATAA ATTTTTATTT AT G AC AAAT T 7 8 60 

CGCTATTTAT TTATGAGAGT TTTCGTACTA TATTATATTA ATATGCATTC ATTAAGGTTA 7 920 

GGTTGAAGCA GTTTGGTATT TAAAGTGTAA TTGAAAGAGA GTGGGGGCC TTATGTCATT 7 98 0 

CGTAACAGAA AATCCATGGT TAATGGTACT AACTATATTT ATCATTAACG TTTGTTATGT 804 0 

AACGTTTTTA ACGATGCGAA CAATTTTAAC GTTGAAAGGT TATCGTTATA TTGCTGCATC 8100 

AGTTAGTTTT TTAGAAGTAT TAGTTTATAT CGTTGGTTTA GGTTTGGTTA TGTCTAATTT 8160 

AGACCATATT CAAAAT AT TA TTGCCTACGC ATTTGGTTTT TCAATAGGTA TCATTGTTGG 82 20 

TATGAAAATA GAAGAAAAAC TGGCATTAGG TTATACAGTT GTAAATGTAA CTTCAGCAGA 82 8 0 

ATATGAGTTA GATTTACCGA ATGAACTTCG AAATTTAGGA TATGGCGTTA CGCACTATGC 8340 

TGCGTTTGGT AGAGATGGTA GTCGTATGQ? GAT GC AAAT T TTAACACCAA GAAAATATGA 8 4 00 

ACGTAAATTG ATGGATACGA TAAAAAATTT AG AT C C G AAA GCATTTATCA TTGCGTATGA 8 4 60 

ACCTCGAAAC ATACATGGTG GATTCTGGAC TAAAGGCATT CGTCGTAGAA AGCTTAAAGA 8520 

TTATGAACCA GAAGAACTGG AAaGTGTAGT AG Aa CAT G AA aTTCmAAGTA AaTSGAaTG 8 58 0 

AAmCAATtGC TGATTGTTTG TCACGAATGA AAtGCAAGGG TATATGCCGG TAAAACGTAT 8 64 0 

TGAAAAACCC GTGTTTCAAG AG C AAAAAG A TGGCACGGTT GAAGTATCAC AT C AAG AAAT 8 7 00 

CGTTTTTGTA GGTAAGAAAA TCCAATAACA TAATCCAATT TAAATAAAGA CTATTTGAAG 87 60 

AGGAAAGGCT ATTCAAAGTT TGAGTAATTT TACTTTGAAT AGCCTATTTG TTTATACATG 8820 

CAAGATGCTC GATCCATATT G TAT GAG AAA CCCCCAGCAA GCTATATAAA GCATATGCTG 88 8 0 

GGGGTTCTTA ATATTTTAAA AATTATTGTT AGATTATATA TATCGTCGCT TTTTCTAAAA 8 94 0 

CAATCTCATC GCATGAAATT TTTTCTTCCT AGAGACTTT AATAAGATTA ATAGTTTACT 9000 

TAATCATATC TAGATAGTCT TATGACTTAT GCTTAATGAA AGTCATTCTA GGAGAAGTTC 90 60 

CCAAAGCTTC TGTGTTCATA ATTGTTAGTA GTATTTTATT ATCATTTGGT AT AAAT AT T T 9120 

CAATAACAAT TGAGCTATTA TTTTTATTAT ATAATGTGAG TTGTTTGTGT TCTGTATTTA 9180 

TATATTtGCA AATGTAATCA GAATCGTCTG AGGTACTGAG AATAAATTTT AAAGGACTAt 92 4 0 

CATTTAAATC TTGAGGATGC CATTCTCCCT CAATAATATT AAGATAATAC TTAGCCTCTG 93 00 

AATTACATTT GAATTTATCA ATACTAAATA ATTCAATTTG TTCCATAATA TTATTTACCT 93 60 

TTCTAAAATA CAAATTTTAA TAACCATAAA TAGATGAATA CCATCGATAA TGGTCGCCAT 94 2 0 
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TGGATACTGG AATAACATTG TTTTTAGCAT CTTGAGTCAT AAAAC CAT T A TCCCATGGAT 94 8 0 

TCCATATAAT TATAACCTCT TGTCCATTAT CTAATTTAGC GTTCCCAACA ACTGCCATGG 954 0 

CATGCCCTGC GTGCATACCA TTTCTTGATT CTACTCTACT AC 0? AAAAC A GCAATTCCTT 9600 

TATTATTTTT AGTAAGATTG TCAACTTCAT TATATGTAGT CATTCTATTA AGAAGTTGTG 9660 

GACTTCTTCC CTGAGTTTGT CCAAAATAAA TCATCTCTCT TGGCGTTAAA CCAGTAAATT 9720 

GGAATCGTTG TCCTTGTAAG TTTGGGTGTA AAAAT C T CAT CACAGCTTCT GCATGATATT 97 80 

TGTTAGTATT ATAAGTCGCA TTTAGTAATT CAGACATCGT ATAGCCTGCA CACCAACCAT 98 4 0 

TGTTACCTTG AGTTTCTCTT ATCTTGAAAT TCTCAAGTTT ATTTATATAT TGsTCGTTGT 9900 

AAGTATAATT ATTACTTTTA AATTGACTAG TTGGCATAGT GACAGAAGCT TTTTGCTTTA 9 9 60 

GTTGCGTTAC ATTATTGCCA GTAGGTATAC TCTCAGTCTT TnTnAACTnT nTATCTTCTA 10020 

GACGTGGTGT TTTTAGTACT AGTTTAGCTT TATGATTTTG AGTACCACAT AGTAACCTTT 10080 

TGAGTTGT 10088 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CGGAAACGnA CCCnATGCGT ATGCTTGACG TGCCAAAATT AAATACGAG TTCATAGCTT 6 0 

TGAGGTACCA GAAGAACATT TATCTGGTCA AGAAGTCGCA GnACTCATAC AAGCAAATGT 12 0 

TAAAACAGTA TTTAAAACGC TTGTTCTAGA AAATACAAAA CATGAACATT TTGTATTTGT 180 

TATCCCAGTA AGTGAAACTT TAGATATGAA AAAGGCAGCT GCTTTGGTTG GAGAGAAGAA 240 

ATTGCAGCTT ATGCCTTTAG ATAATTTGAA AAATGTAACG GGATACATTC GTGGTGGGTG 300 

TTCGCCTGTT GGTATGAAAA CATTGTTTCC AACAGTCGTT GACAAATCGT GTGAAAATTA 3 60 

TAGTCATATC AGTGTGAGTG GTGGGCTTCG AACAATGCAA ATCACAATAG CTGTTGAGGA 4 20 

TTTGATTACA ATAACTAAAG GCAAAATTGG AGCAGTTATC CATGAATGAT TAATAACAAC 4 80 

AAAAAGTATG GGGCAAGATT AGGAGTGTGA C AG AG AT G AA ATTTTTATAA AATTCATTTC 54 0 

TGCCACACTC CTTTTTGATT GAATTAGCAT T T T AC GAT C A TAAACAGTCA TTATAATTGA 600 

GTATTTGAAC ATAAAAATGT AATTTTATCG TAACAATTTG AGTGTTTGTG ATTGTTTTG 660 

GTAATTTATG ATTGAAAAGT GAAAGCGTAC TCATTATAAT ACAAAGT GAG ATGGGGTGAT 720 

GATGATAATT AC TG aAAAAA GACACGAGTT AAT AT T AG AA GAACTTTCGC AC AAAG AT T T 7 80 
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TTTGACTTTA CAAGAATTAA TAGATCGAAC TGGTTGCAGT GCTTCAACAA TACGArGAGA 840 

TTTATCTAAA CTACAACAAT TAGGGAAATT GCAACGTGTG CATGGTGGTG CAATGTTAAA 900 

AG AAAAT C G T ATGGTTGAGG CGAATTTAAC TGAAAAATTA GCAACGAATC TTGATGAAAA 9 60 

GAAAATGATT GCTAAAATAG CAGCTAATCA AATCAACGAT AATGAATGCT TATTTATCGA 1020 

TGCTGGTTCA TCTACATTGG AGCTAATTAA ATATATEAA GC GAAAGAT A TCATTGTGGT 108 0 

AACCAATGGT T T AAC AC AT G TAGAAGCTTT ACTTAAAAAA GGTATTAAAA CAATTATGCT 114 0 

AGGTGGTCAA G T T AAAG AAA ATACACTTGC TACGATTGGT TCTAGTGCTA T GGAGAT AT T 12 00 

AAGACGATAT TGTTTCGATA AAGCTTTTAT CGGGATGAAT GGATTAGATA TTGAACTTGG 1260 

ATTAACTACT CCCGATGAGC AAGAGGCATT AGTTAAACAA ACAGCAATGT CATTAGCCAA 1320 

TCAATCATTT GT ACT TAT AG ATCATTCTAA GTTTAATAAA GTATATTTTG CTCGTGTACC 13 8 0 

TTTGCTAGAA AGTACGACAA T CAT C AC AT C TGAAAAAGCA TTAAATCAAG AATCGTTAAA 14 4 0 

AGAATACCAA CAAAAGTA1C ACTTTATAGG AGGGACTTTA TGATTTATAC AGTGACTTTC 1500 

AATCCTTCAA TTGACTATGT CATTTTTACG AATGATTTTA AAATTGATGG TTTGAACAGA 15 60 

GCAACAGCAA CATATAAATT CGCTGGGGGG AAAGGTATTA ATGTCTCGCG CGTCTTAAAG 1620 

ACATTGGATG TTGAGTCAAC TGCCTTGGGA TTTGCAGGTG GATTCCTGG GAAATTCATT 168 0 

ATAGATACAT TAAATAACAG TGCAATTCAA TCGAATTTTA TTGAAGTTGA TGAAGATACA 17 4 0 

CGTATTAATG T GAAAT T AAA AAC AG G AC AA GAAACAGAAA TCAATGCACC GGGTCCTCAT 18 00 

ATAACGTCAA CACAATTTGA AC AAC T G T T A CAACAAATTA AAAAT ACAAC AAGCGAAGAT 18 60 

ATAGTTATTG TTGCTGGAAG TGTACCAAGT AGTATTCCAA GCGATGCGTA TGCGCAAATT 192 0 

GCACAAATTA CAGCACAGAC AGGTGCTAAA TTAGTAGTCG ACGCTGAAAA AGAATTGGCT 198 0 

GAAAgCGTTT TACCATATCA TCCACTATTT ATTAAACCTA AT AAAG AT G A ATTAGAAGTG 204 0 

ATGTTTAATA C AAC AG T G AA CTCAGJCACA GATGTTATTA AATATGGTCG TTTGTTAGTT 2100 

GATAAAGGTG CGCAATCTGT TATTGTCTCG CTTGGCGGTG ATGGTGCTAT TTATATTGAT 2160 

AAAG AAAT C A GTATTAAAGC AGTTAATCCA CAAGGGAAAG TGGTTAATAC AGTTGGCTCT 222 0 

GGTGATAGTA CAGTTGCAGG CATGGTGGCT GGAATTGCTT CAGGTTTAAC STTGAAAAA 2280 

GCATTCCAAC AAGCAGTCGC ATGCGGTACT GCCACGGCAT TTGATGAGGA CTTAGCAACA 234 0 

CGGGACGCTA TAGAAAAAAT AAAAT C AC AA GTTACGATTA GCGTACTTGA TGGGGAGTGA 2400 

AAATAATGAG AGTAACAGAG T T AT T AAC AA AAGATACAAT AGCAATGGAT TTAATGGCAA 2 4 60 

AT GACAAAAA TGGTGTTATT GATGAGTTAG TAAATCAATT AGACAAAGCA GGTAAATTAA 2 52 0 

GTGATGTCGC GTCATTTAAG GAAGCGATTC ACAATCGAGA ATCACAAAGT ACAACTGGTA 2 58 0 
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TCGGCGAAGG TATTGCCATT CCACATGCCA AAGTGGCCGC AGTTAAGTCA CCAGCTATTG 2 64 0 

CGTTTGGTAA ATCTAAAGCA GGCGTAGATT ATCAAAGTTT GGATATGCAA CCAGCACACT 2700 

TATTCTTTAT GATTGcAGcG CCAGAAGGTG GCGCCCAAAC ACATCTAGAT GCTTTAGCTA 27 60 

AGTTGTCTGG TATTTTAATG GATGAAAATG T AC G T GAG AA AT TAT T AC AT GCTTCATCAC 282 0 

CTGAAGAAGT ACTAGCGATC ATAGATGAGG CTGATGATGA AGTGACAAAA GAAGAAGAK 2880 

CAGAAGCTGA AGCACAACAA GTTGCAACTG CAGAACAATC ATCTAAACAA TCTAATGAGC 2 94 0 

CATATGTGTT AGCAGTAACT GCTTGTCCAA CAGGTATTGC ACACACATAT ATGGCACGTG 300 0 

ATGCATTGAA AAAGCAAGCG GATAAAATGG GTATTAAAAT TAAAGTAGAA ACGAATGGTT 3060 

CAAGCGGCAT TAAAAA.CCAT TTAACTGAAC AAG AT AT T G A AAATGCAACA GGTATCATTG 3120 

TTGCTGCTGA TGTTCATGTT GAGACGGATC GCTTCGATGG TAAAAATGTC GTAGAAGTAC 318 0 

CAGTAGCAGA TGGTATTAAA C G C C C AG AAG AAT TAATTAA T AAAGC AT T A GATACAAGTC 324 0 

GTAAACCTTT TGTTGCCCGT GATGGTCAAA GAAAAGGTAACTCAAATGAC AG T C AAG AAA 3300 

AAT TAAGCCC AGGTAAAGCA TTCTATAAAC ACTTAATGAA CGGTGTTTCT AACATGTTGC 3360 

CACTTGTAAT ATCTGGTGGT ATTTTAATGG CAATTGTATT TTTATTTGGA GC AAAT T CAT 34 20 

T T AAT CC AAA AAGCTCAGAG TACAATGCGT TTGCAGAGCA GCTTTGGAAC ATTGGTAGTA 3310 

AAAGTGCATT CGCGTTAATC ATTCCAATTT TATCTGGATT CATTGCACGT AGTATTGCGG 3540 

ATAAACCTGG TTTCGCTTCA GGTCTTGTAG GTGGTATGTT AGCAATTTCA GGTGGTTCAG 3600 

GATTTATTGG TGGTATTATT GCAGGTTTCT TAGCAGGTTA CTTAACACAA GGTGTTAAAG 3660 

CCATGACACG TAAGTTACCA C3YGCATTAG AGGGAT T AAA GCCAACATTA ATTTATCCAC 3720 

TATTAACAGT GACGGCTACA GGCTTATTGA TGATTTATGC CTTTAATCCA CCAGCATCTT 3780 

GGTTAAATCA TTTGTTATTA GATGGATTAA ACAATTTATC AGGTTCTAAT ATTGTATTAT 384 0 

TAGGTTTAGT TATTGGCGCT ATGATGGCGA TTGATATGGG CGGTCCATC AACAAAGCGG 3 900 

CATATGTTTT TGCAACAGGT GCGTTGATTG AAGGTAATGC AGCACCAATT ACAGCTGCAA 3 960 

TGATTGGTGG TATGATTCCA CCGTTAGCAA TTGCGACAGC GATGTTAATT TTTAGACGTA 4 02 0 

AAT T T AC AAA AGAACAACGT GGTTCAATTA TCCCTAACTA TGTGATGGGT ATGTCATTTA 4 08 0 

TTACAGAAGG TGCGATTCCA TTTGCAGCTG CCGATCCATT ACGTGTTATT CCTTCAATGA 414 0 

TGATTGGTTC AGGTATAGGT GGCGCAATTG CTTTAGGCTT AGGTTCACGA ATTACTGCGC 4 200 

CACATGGTGG TATTATTGTA ATTGTTGGTA CTGATGGTGC ACACTTACTT CAAACTCTTA 4 2 60 

TTGCACTTCT AG T T GGC AC A TTAGTTTCAG CATTAATTTA CGGTTTAATC AAACCAAAGT 4320 

TAACTGAAAC AGAAATCGAA GCTTCAAAAT CAATGGACGA GTAGTTTTAA TGATGTAAAA 4 38 0 

TGATTGTTAG CAAAGAGCTT CATATTAAGT TGTATGTTCA AT G AAT AT AT GTTAGTTTTA 4 44 0 
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TATATCGTGT TAACGGTAGC TTATACAAAG CTGTAAAAAC ACTTTCTATT AATTSGTTT 4 5 00 
TTATGAATTG AT AT G AAAG T GTTTTTATTT TTAGATAAAT GAATGAAGAA ATAGACACCA 4 5 60 

CAAATGTATA GACTTTTTTA ATATTTTGCA AAAAGTTATG CCAAACGAAG C AG AT AT AG T 4 620 

AAAAT AT GAG TGTCTTAAAG TGAAAATTTA TAAATAAAGA AGGGTTTATA CGTGTCAGAA 4 680 

T TAATTAT AT ATAACGGCAA AGTTTATACT GAAGATGGCA AAATCGATAA TGGTTACATT 474 0 
CATGTGAAAG ATGGACAGAT TGTTGCAATT GGAGAAGTGG ATGATAAAGC AGCAATTGAT 4 800 

AATGATACGA CAAAT AAAAT TCAAGTGATT GAT GCT AAAG GTCATCATGT ATTACCAGGT 4 8 60 

TTTATTGATA TACATATTCA TGGTGGTTAT GGTCAA&TG CAATGGATGG GTCATACGAT 4 92 0 
GGCTT AAAAT ATCTATCCGA AAATTTGTTG TCTGAAGGGA CGACATCATA CTTGGCCACT 4 98 0 

ACAATGACGC AATCGACTGA T AAAAT AGAT AATGCACTTA CAAATATTGC TAAATATGAA 504 0 

GCGGAgCAAG ATGTTCACAA TGCAGCGGAA ATTGTAGGTA TACATTTAGA AGGACCATTT 5100 
ATATCTGAAA ATAAAGTTGG TGCTCAACAT CCGCAATACG TTGTACGCCC ATTTATCGAT 5160 
AAAATTAAAC ATTTTCAAGA GACTGCTAAC GGATTAATAA AGATTATGAC GTTTGCACCT 522 0 

GAAATTGAAG GTGCAAAAGA AGCGCTTGAA ACGTATAAAG ATGACATTAT TTTTTCAATT 5280 

GGTCATACAG TAGCAACATA CGAAGAAGCA GTTGAAGCTG TTGAGCGAGG AGCTAAACAT 534 0 
GTCACGCATT TATATAATGC AGCGACGCCA TTCCAACATA GAGAACCAGG TGTTTTTGGA 5400 
GCAGCATGGT TGAATGATGC TCTACATACC GAAATGATTG TTGATGGCAC TCATTCTCAT 54 60 

CCGGCATCGG TTGCAATTGC TTACCGTATG AAAGGTAATG AACGTTTTA TTTAATTACC 5520 
GATGCAATGC GTGCAAAAGG TATGCCTGAA GGAGAATATG ATTTGGGTGG ACAAAAAGTA 558 0 

ACTGTTCAAT CGCAACAAGC ACGTCTTGCA AATGGTGCGC TTGCTGGTAG TATTTTAAAA 5 64 0 

ATGAATCATG GGTTACGTAA CTTAATATCA TTTACAGGTG ATACATTAGA TCATTTATGG 57 0 0 

CGAGTAACAA GTTTAAATCA AGCCATTGCA TTAGGTATCG AT GAT AG AAA AGGTAGTATT 57 6 0 

AAAGTAAATA AGGATGCAGA TCTTGTTATT CTAGATGATG ATATGAATGT AAAAT C T AC A 582 0 

ATAAAACAAG GCAAGGTTCA CACATTTAGC TAATAAATAA TCATAATTAA ATGTATGCAA 58 8 0 

TAGATTTAAT CTGTTAACAT AAGCACTTTA TATTATGATA AAATAGAAGC AATAACATTT 5 94 0 
TTTTCTGGGG GTGTCTAAAT GGGAAGGCGA TAACATGTAG TTGTAATTTA AGTCATAGTG 6000 
ATAAATTTGA ATGCGTGTTA CCCATGAGTG ACACATATAA CATGGAGGTG AATCCCTAGA 6060 

AATAGGGAAT TAATTGGAAA CTTCGAC CAT AATTAGTTTG ATTATATTTA TCTATTAAT 6120 
TGCATTAACC ACTGTATTTG TTGGTTCAGA ATTTGCATTA GTAAAAAT T A GAGCAACAAG 6180 
AAT TGAACAG CTAGCAGATG AAGGAAATAA ACCTGCTAAA AT AG T AAAAA AGATGATTGC 624 0 
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TAATCTAGAT TATTATCTTT CTGCTTGTCA GTTAGGTATA ACAGTAACAT CTTTAGGGTT 6300 

AGGTTGGCTT GGTGAACCAA CGTTTGAAAA GCTATTACAC CCAATATTTG AAGCAATCAA 63 60 

TTTACCAACT GCATTAACGA CGACGATTTC GTTTGCAGTG TCATTTATAA TCGTTACGTA 64 20 

TTTGGATGTA GTACTTGGTG AATTAGCGCC TAAAT CTATA GCTATTCAAC ATACTGAAAA 64 8 0 

GCTTGCTTTA GTATATGCAA GACCATTGTT CTATTCGGT AACATTATGA AACCATTGAT 654 0 

TTGGCTGATG AATGGTTCTG CACGTGTTAT TATTAGAATG TTTGGTGTAA ATCCTGATGC 6600 

CCAAACTGAT GCAATGTCAG AAGAAGAAAT CAAAATTATT ATTAACAATA GTTATAATGG 66 60 

TGGAGAAATC AACCAAACTG AATTGGCATA TATGCAAAAT ATCTTTTCAT TCGATGAAA 67 2 0 

AC AT G C AAAA GATATAATGG TACCTAGAAC TCAAATGATT ACACTAAATG AACCTTTTAA 67 8 0 

TGTAGACGAA TTACTAGAAA CAATAAAAGA ACATCAATTT ACGCGTTATC CAATTACTGA 68 4 0 

TGATGGTGAT AAAGACCACA TTAAAGGATT TATTAACGTC AAAGAATTTT TAACTGAATA 6900 

CGCTTCTGGA AAAA CG AT T A AAATAGCAAA C TAT AT a CAT GAGTTGCCAA T GAT T T C AG A 6960 

GACAACACGT ATCAGTGATG CATTAATTAG AATGCAACGT GAACATGTAC ATATGAGTCT 7 020 

TATTATAGAT GAATATGGTG GAACGGCAGG TATTTTAACG ATGGAAGATA TTTTAGAAGA 7080 

AATCGTTGGA GAAATTCGTG ATGAATTTGA T GAT GAT G AA G T GAAT GAT A TCGTTAAAAT 714 0 

TGATAATAAG ACATTCCAAG TAAATGGCAG AGTACTATTG GAT GAT T T AA CTGAAGAGTT 7 2 00 

CGGTATAGAA TTTGATGACT CTGAGGATAT TGATACGATA GGTGGATGGT TACAATCTCG 72 60 

TAATACCAAT T T AC AAAAAG ATGATTACGT GGATACAACT TATGATCGCT GGGTTGTTTC 7230 

AGAAATCGAT AACCACCAAA TTATTTGGGT GATATTAAAC TATGAATTTA ATGAAGCGAG 7380 

ACCTACTATC GGACAGTCTG ATGAAGATGA AAAATCAGAA TAGATATTAA TATATAAACC 74 4 0 

AACTAAGAAT GATTTAATTC ATTTTTGGTT GGTTATTTTT T T G AC T AAAA TTAAnGAAAA 7500 

GTGAAAATAG TATTGGAACT CffiTATCTTT AATGATTTAA TGAATAAnTT TTATTGAAAG 7 5 60 

CGA 7 5 63 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3492 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

TTATATCAAC TTCATGGCGG AACCATTGAT GACCCATTAG ACGAAACAAT AAGCGCATTT 60 

sATGAATTGA AACAAGAAGG AATTATACGT GCTTACGGTA TTTCTCTAT TCGCCCAAAT 12 0 
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GTAATTGATT AT TAT TT AAA ACATAGTCAA ATCGAAACGA TAATGTCTCA ATTCAATTTG 180 

ATTGATAATC GTCCAGAATC ATTATTAGAT GCAATTCACA ACAATGATGT TAAAGTATTG 24 0 

GCAAGAGGAC CTGTGTCTAA AGGATTATTA ACTTCAAACA GTGTTAATGT GCTCGACAAT 300 

AAATTTAAAG ATGGTATTTT TGATTATTCT CATGATGAAT TGGGTGAAAC AATAGCCTCT 3 60 

AT T AAAG AAA TTGAAAGTAA TTTATCTGCA TTGACATTTA GTTATTTAAC AT C AC AT G AC 420 

GTGCTTGGTT CCATCATTGT AGGTGCAAGT AGCGTCGACC AATTAAAAGA AAATATTGAA 4 80 

AACTATCATA CTAAAGTTAG TTTAGAXAG AT T AAAAC AG CAAGAGCTCG TGTAAAGGAT 54 0 

TTGGAATATA CCAATCATTT AGTGTAGAAG TCATTTTCAG TAATAAAAAC AGCAGCATGA 600 

GGCGTTTCAT TATAAAAATG CCTTACTGCT GTTGTTTATG TACAATTCGC TATAATTTAT 6 60 

GATTATGATT ACTCACTTAT GATAGAAATT AAAGCGTTGT CCTCACGCAT. CfiTATTTAG 720 

TAATTTCGCC TTGCGGCATT GCCTTAAGCA AACTTCTGCC ACTTCATCTC TTAATAATTT 7 80 

TATTAAAACA TCTTTCTATA TTTCACTTCG CATGTTGATT CATCATTATT AGTTATTATT 840 

TGTACACCCA GCACATTTCC TTGCAACACA AGTAGTTTGA ATTTTTCACA AGTATAATAT 900 

AATGTACCGT CTGAAATTTG GTCTACAGAA ATATCGCCTA AAATATCCAG CACTGTAAAT 9 60 

TCTTCAAATA CTGATAGTTG TTCCGCATAT CGTACACAAA GTCTTACCAC ACTCTCCGAT 1020 

TGACAGTTCA TTGCCATCCC ACCTATTTAT GCTTTATTTT TAAATAATTT AGGGAAACAT 1080 

CGTTCAAAAA ATCTAGGCGC AATTTGATAC ATTTCAACG CATGaTGCAT CCATTTAGGC 114 0 

CGATTAATTT CCAATTGTTT TGTTTTAATG CCATAAATGA TATCTTCTGC AAGCTGATTA 1200 

GCATCAAGCA TAATTTCCCC CATCTTTTTA gCATACTTCA TTGATGGGTC GGCTTTTTGA 12 60 

TGAAAAGGTG TATCAATCGG GCCAACATTA ACTGTCATGA TATGTAAGTT TGGTGACTCT 132 0 

AGTCTTAAAG CATTCATTAA TGCATAAAAC CCTGCTTTCG ATGCCCCATA ATGTGCAGCA 1380 

TTTGCTTGTG TGGAAAATGC AGCTTGACTT GAAAT AC C T A CAATATGTGC GTTAGATGTT 14 4 0 

AAATATGGTC TCAACACAGT ATATAAAACA TTAAAACTAA TTAAATTAAG CTGATACGTT 1500 

TCAATCATTT CTGAAAAACT ATGGTCTGAA ATAGATTTGA AATAACCTAA ACCTGCACTA 15 60 

TAAATGAATC CATCGAATGA TGTATTGTCT TCAAATTGCA GTGCCTGTAT CGACTTCAAA 1620 

TCATTTAAGT CACAAGGAAT AACATTTATA GTTTTCCCCA ATTCCTGTTC AAAGATTCTA 168 0 

GTTGCTTTAT CAACATCACG CACCAACAAC GTTACATGCA CTTTATTTTC TAGTAACTTT 17 4 0 

CGGACAATCG ATAAACCTAA ACCACTCGTA CCACCAGTCA CTATAAAATG TTGTCCTTTC 1800 

ATCAATTAAC CTTCCTTTTC AATTATATAG AATGCAATTT ATCAACTTTA CATAATTGAG 18 60 

ACAAGTTGAT TATCTTTCCT AATATATATA CAATAATAAG AAAATATAAC ATACAAATCA 132 
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AAAACTAAAG GGATGTGaCG TTAATGrAAC TCGTATTTTA TGGAGCTGGT AATATGGCAC 1980 

AAGCTATATT TACAGGrATT ATTAACTCmA GCAACTTAGA TGCCAATGAT ATATATTTAA 204 0 

CAAATAAATC TAATGAACAA GCTTTAAAAG CATTCGCTGA AAAACTAGGT GTTAACTATA 2100 

GTTATGAtGA TGCGACATTA TTmAAGATG CAGAyTATGT ATTTTTAGGT ACCAAACCAC 2160 

ATGACTTTGA TGCTCTAGCA ACACGCATCA AACCACATAT TACAAAAGwC AATTGCTTCA 22 2 0 

TTTCAATTAT GGCAGGTATT CCGATTGATT ATATTAAACA ACAATTAGAA TGCCAAAATC 22 8 0 

CaGTTGCTAG AATTATGCCA AACACAAATG CGCAAGTTGG ACACTCTGT ACTGGCATTA 2 34 0 

GTTTTTCAAA CAACTTTGAC CCTAAATCTA AAG AT GAAAT TAACGATTTA GTTAAAGCAT 24 00 

TTGGTTCTGT AATTGAAGTA T C AG AAG AT C ATTTACATCA AGTAACAGCT ATCACCGGAA 24 60 

GCGGCCCAGC ATTTTTATAT CATGTATTCG AGCAATATGT TAAAGCTGGT aCsAAACTTG 2 52 0 

GTCTAGAAAA AGAACAAGTT GAAGAATCTA TACGCAACCT TATTATAGGT ACAAGTAAGA 25 8 0 

TGATTGAACG TTCAGAtTTG AGCATGGCTC AATTAAGAAA AAAT AT T AC C TCTAAAGGTG 2 64 0 

GTACGACACA AGCTGGCCTT GATACATTGT CACAATATGA TTTAGTATCT ATTTTCGAAG 27 00 

ATTGTCTAAA CGCTGCCGTC GACCGTAGTA TTGAACTTTC TAATATAGAA G AC C AAT AAA 27 60 

AACAAACCCG CCAACACATG TATGCATCAT CGCAAGCACT GTGTTTGACG GGTTATTTTT 28 20 

ATAATTTATT GTTATTTGGC AAGCATTGTT TATTACTTTG TC AT TAG ATT TTAAAACTAT 28 8 0 

CAAAATCTTT TACAAAATTA AAATTAGGTG TATCTTCATT TTGTATCAAT GTTTGAAAA 2 94 0 

TTTCATTTAT ATCTTCTGTA TTATAGCGAT TGCTCAAATG TGTAATCAAC GTACGTTTAA 3000 

CATTGGCTTC TTTTATCAAT GCAAATACGT CTTCAATATG GCTATGATGA TAATTGTTGG 30 60 

CTAAATGCTT TTCACCATCT ATATAGGTCG CTTCATGTAC CAT C AC AT C A GCATCTCTAG 312 0 

AAATCACACG TTCATTAGAA CATGGTTTTG TATCACCAAA AATTGCTACA ACTGGACCCT 3180 

GTTTGGACTC ACCTCTAAAA TCTTTTGATT GATAAACTTG ACCATTATGT TCAAATGTAT 32 4 0 

CAT GAG AT T T TACTTCTTGA TATTTAGGAC CTGGTTCAAG ACCAATGTTT TTTAACGCTT 3300 

CAACATTGAT TGTACCTGTA GTTTCAGGTG CCATTACET ATATCCATAT GAT GG AAT AC 3360 

CAT GAT T AAG TAAATGCGCC T C T AC AG T AA AACCATCCAT GATGATATGT C AG AT GAT C A 34 2 0 

TCGATTTCAA TATATGtAAT TGGATAGTTT AAATGTGACT CTGATAAATT CATAGACATT 34 80 

TCCACATATG CT 34 92 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ IDNO: 35: 

ATCTAGCGGT ACAAGCGTCT TGGAGGCTAG TATGTTGAAC ATTGTAAACC CTGAAGATCA 60 

CTTCGTTGTC ATTGTTTCAG GTGCCTTTGG TAACCGATTT AAACAAATTG CACAAACTTA 120 

T T AC AAAAAT GTGCATATTT ATGACGTAAC ATGGGGAGAA GCTGTAGATG TCAAAGATTT 180 

CATCAATTTC CTTTCAACTT TAAATGTTGA AGTTAAAGCA GTATTTAGTC AATATTGCGA 24 0 

AACATCTACG ACAGTGCTAC ACCCTATTCA CGAGTTAGGA AATGCCATTA ATCAATTTAA 300 

TAGTAATATT TATTTTGTAG TTGACGGCGT AAGTtGCATT GGTGCTGTTG AT G T T G AC AT 3 60 

TAACAAAGAT AAAATTGATG TACTTGTTTC TGGTAGTCAA AAGCAATTA TGTTACCTCC 4 20 

AGGATTAGCT TTTGTAGCTT ATAGCCACCG TGCAAAAGAA CATTTCAAAG AAGTAACTAC 4 80 

GCCAAAATTT TATCTAGACT TAAATAAATA CATTTCGTCA CAAGCTGACA ATTCTACACC 54 0 

GTTCACACCA AATGTGTCTT TATTTAGAGG TGTAAATGCA TACGTTGAAA CCGTAAAAGC 600 

AGAAGGTTTC AATCACGTAA TAGCACGACA CTATGCAATT AGAAATGCAT TAAGAAGCGC 660 

CTTAAAAGCA TTAGATTTAA CTTTATTAGT C AAT GAT AAA GATGCATCTC CAACGGTTAC 720 

AGCATTCAAA CCTAATACAA ATGATGAAGT GAAAATAATC itiAAG AT G AAC TTAAAAATnG 7 80 

CTTTAAAATA ACAATTGCnG GTGGTCAAGG CCATCTTAAA GGTCAAATTT TnAGAATTGG 84 0 

TCATATGGGG AAAATTAGTC CTTTCGATAT TTTATCGGTA GTATCTGCTT TAGAAATTAT 900 

TTTAACTGAA CACCGTAAAG TTAACTATAT CGGTAAAGGT AT AT C AAAAT ATATGGAGGT 960 

TATTCATGAA GCAATTTAAT GTACTCGTTG CAGATCCCAT ATCAAAAGATGGTATCAAAG 1020 

CAT TAT TAG A T C AC G AAC AA TTCAATGTAG ATATTCAAAC TGGCTTGTCC GAAGAAGCAT 108 0 

TAATC AAAAT TATACCTTCA TACCATGCTT TAATCGTTCG TAGTCAAACT ACGGTTACTG 114 0 

AAAAT ATCAT AAATGCTGCT GATTCTTTAA AAGTAATCGC ACGCGCCGGT GTTGGTGTAG 1200 

AT AAT ATT AA TATTGATGCT GCAACATTAA AAGGTATTTT AGTTATTAAT GCCCCAGATG 12 60 

GTAATACGAT TTCAGCTACT GAACATACAC TGGCAATGTT ATTATCAATG GCACGAAATA 1320 

TTCCGCAAGC ACACCAATCA CTTACAAATA AAGAATGGAA TCGAAATGCA TTTAAAGGTA 1380 

CTGAGCTTTA T C AT AAAACA TTAGGTGTCA TTGGTGCTGG TAGAATTGGT TTAGGTGTTG 14 4 0 

CTAAACGTGC GCAAAGTTTC GGAATGAAAA TACTAGCTTT TGACCCTTAC TTAACGGATG 1500 

AAAAAGCAAA ATCTTTAAGC ATTACGAAGG CAACAGTTGA TGAGATTGCC CAACATTCTG 15 60 

ATTTCGTTAC ATTACATACA CCACTAACAC CTAAAACAAA AGGCTTAATT AATGCTBCT 1620 

TTTTTGCCAA AGCAAAACCT AGTTTGCAAA TAATCAATGT GGCACGTGGT GGTATTATTG 168 0 



112 

Copied from 1CS06206 on 08/ 12/2004 



ATGAAAAGGC GCTAATAAAA GCATTAGACG AAGGACAAAT TAGTCGGGCA GCTATCGATG 17 4 0 

TGTTTGAACA TGAACCTGCA ACTGACTCGC CTCTTGTTGC ACATGATAAA ATTATTGTTA 1800 

CACCTCATTT GGGTGCTTCA ACAGTCGAAG CTCAAGAAAA AGTGGCAATT TCTGTTTCAA 18 60 

ATGAAAT CAT CGAAATTTTA ATTGATGGTA CTGTAACGCA TGCAgTGAAT GCACCTAAAA 192 0 

TGGACTTAAG CAATATAGAT GATACTGTAA AATCATTCAT CAATTTAAGC CAA 197 3 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

GGTGTTTCAG ATGTCACTGG TTGATTTTTA ATTGT&ACG GGTATTTTGG GCTTTCGCCA 60 

TATTTATTTG CCGGCTTACT GTCAAAGCAT AGGAATACTA TCATAACAAT TGTTAGGCCT 120 

AAATAAACAA AATAAAGAAG T AC T AAC AAA AT AT T AAGAC CCATCGGCAT TAATGTAAAA 180 

TCACTGTCAT AATAACTATC GATAATCTGT AATACTATAT AAAATATAAT ACTGAATACT 240 

GTCATAATCA TTGGAAATAA CATTGTTCTT GATATATCGT GAAATCTTCG AACGCACAAC 300 

GCTAAATTTG GAATAAACGT TGCCAAACTA TAGACAAAAG TATACACAGA TGTAAGGATA 3 60 

AT CAT C AAT A TACTCATAAC TATTAATGTT TCGTTATCCG CCGCTATAGA AATAAAGAAT 4 20 

AGAAATAGGT TTATTATTAG CACACACACA GCTGGAACCA TAAGTATCAA ATGCCATAGT 480 

GCCATATACC AATATTCACT ACGTCTTGAT CTCCCCTTAA AATTTACATA ATTTTTCCAA 54 0 

AATAAAACGA ATGATTTCAT AAAACCTACT TGAGGTAATT GTTCCATTGT AATCTCCCTT 60 0 

TCGTTAATCA TATTTATATT TTTAATTATT GTTACCGTTA TAfflTTACAA GATTCATTAT 6 60 

CAAGCCACTT CATTTCAATC GTCAATCACT TTGGAAAATG AACTTTATAT TTCATCACTA 720 

GTAAAATGAA AACCCGCTAC AAGTACACAT CTATATGGAG ACTCATTTGA AAGTCAACGC 780 

TTCGTTAACT AT AC T AAAAA TATGTCATAC TGCAATGTTC ACGTTTAAAA GAGTCTCAAT 840 

CTATGCAAAT AAAATATTCC ATAACAAAGT ATATACTTTA CATTTTTATA ATTCTTAACA 900 

ATACTATTTT ATCAAACATT TACCACAATA AAAATATCTT TTTCATTTTT ATTTAAATTA 960 

ATCATATAAT TGCGAGGAGA ATATTATGGA TTTCGTTAAT AAT GAT AC AA GACAAATTGC 102 0 

TAAAAACTTA TTAGGTGTCA AAGTGVTTTA TCAGGATACC ACTCAAACGT ATACAGGCTA 108 0 

CATCGTGGAA ACGGAAGCTT ACTTAGGTTT GAATGATCGT GCGGCTCATG GCTATGGCGG 114 0 

TAAAATAACA CCTAAAGTCA CGTCATTATA TAAACGTGGT GGTACAATTT ATGCACATGT 1200 
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CATGCATACG CATTTACTCA TTAATTTTGT AACAAAATCT GAAGGTATACCTGAAGGCGT 12 60 

ACTTATCCGC GCAATTGAAC CAGAAGAAGG TTTATCCGCT ATGTTCCGTA ACAGAGGTAA 132 0 

GAAAGGCTAC GAGGTAACGA ATGGCCCAGG AAAATGGACT AAGGCATTTA ACATTCCACG 138 0 

GGCTATCGAT GGCGCTACGT TAAATGACTG TAGATTGTCT ATTGATACTA AGAATCGTAA 14 4 0 

ATATCCTAAA GATATTATTG CTAGTCCACG AATCGGTATT CCAAATAAAG GTGATTGGAC 15 00 

ACATAAATCT TTACGTTACA CAGTGAAAGG TAATCCATTT GTGTCTCGCA TGCGTAAATC 15 60 

AGATTGTATG TTTCCCGAAG ATACTTGGAA ATAAATGCCA TCTTTCATTG ATTACTATCA 1620 

TGAAAATGAA ATCTATCTCC TTATAAGTCA ATJAATCGTG CCGTCAACAT GCGGATGGGT 168 0 

TGATTGTTTT TCTTTGTATC CATCATATTT TTTGATTCAT CTCCTCTTAT TGAACTTGTT 17 4 0 

CTTAATTATA AAATATAACA ATAGAATTAT TTATAATTAT TAAATTTAGA TGCAT TAATA 1800 

TTATTGATAT TATTTTCAAA AACTAGAAAT ATTGATTTGT TGCATGTATA ATGTTAAS4G 18 60 

CGCCCTTTTA TAACGCTTAC ATATAAAAGC TTATTTAGGG AGAGGGATAT TCAACAAGGG 192 0 

GGATTTGAAA AT G AT AG AAC TTAATGCAAT TACAACATTA TGTTTAGCTT GTATCCTTTA 1980 

TTTACTTGGT AAGGCTATCG TTAATCACGT TAATTTTTTA AAACGTATTT GTATACCAGC 2 04 0 

AC CAGTGATT GGCGGCTTAA TCTTTGCTAT TTTAGTTGCG GCTTTGGATT CATTTGGCAT 2100 

GGTTAAGATT AAATTAGATG CTTCATTCAT TCAAGATTTC TTCATGTTAG CATTCTTTAC 2160 

GACAATCGGT CTTGGTGCAT CATTGAAATT ATTTAAATTA GGTGGCAAAG TCTTGCTATT 2220 

ATACTTTATG TTTTGTGCTA TCATTTCAGT CATTCAAAAC ATAGTTGGTG TATCACTAGC 2280 

AAAAGTATTA AATATTAAAC CTTTGTTAGG ATTAACAGCA GGTTCCATGT CTATGGAAGG 2 34 0 

CGGTCATGGT AATGCTGCTG CTTATGGTAA GACAATTCAA GATTTAGGTA TTGATTCGGC 24 00 

ACTGACAGCG GCTCTTGCAG CTGCAACTTT AGGTCTTGTA TTTGGAGGGC TTATCGGTGG 4260 

TCCAGTTGTA AAATTTTTAA TCAAACGTTA TAACTTAAAG CCTCAACATA GTGACGATAC 2 52 0 

ATTTAAAGAT TATAGCCAAG TAGCATATAA CGAACATTTA CATAGTAAAT TTAATGCCAC 2 58 0 

TGAAGTATTC TTCATTCAAT TTACAATCGT TGTATTCTGT ATGGCAGTTG GAAGTTATTT 2 64 0 

CAGTCATTTG TTTACAGCTC MACAGGGAT TAATGTTCCA ATTTACGTTG GCTCATTATT 27 00 

TGTAGCTGTT ATTGTCCGAA ATATCTCTGA AAGTTTTAAT TTTAATATTG TAGATTTAAA 27 60 

AATTACTAAT CAAATTGGCG ATGTCGCATT AGGTATTTTC TTATCTCTTG CGCTAATGAG 2820 

CATTCAATTA ATCGAAATTT ATAAACTTGC TATACCTCTT ATTATTffiCG TTTTAGTTCA 2880 

AGTTGTCGTT ATGATTTTAT TTGCTGTTTT AATTTTATTT AG AG G T T T AG GAAAAGATTA 2 94 0 

TGATGCTGCA GTAATGGTAG GTGGTTTTAT CGGTCATGGG CTTGGTGCAc GCCAAATGCC 3000 
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ATGGCAAATT TAGATGTTAT TACTAAAAAA TATGGAAACT CACCTAAAGC ATATTTAGTT 3060 

GTACCTATTG TTGGTGCATT CTTAATCGAT TTAATTGGTG TTATAGTCAT TATGGGATTC 312 0 

ATACAATGGT TTAGTTAAAC ACCAAACTCA TAAATAAAAG AGGAGGCCTT CGCCTCcTcT 3180 

TTTATTTATC CTCGATGTAT ATTCAAGTTA CGTTGTTCTA TCCATGACAA TATTTCCGGA 324 0 

CTAAATACGA TTTGTTTTTG TGTTAAGTCE TCAATATTTT TAGCATCTAA CATCGTCATT 33 0 0 

ATTGATTTCA TGTGTTCAAT AAATGATTCT ACATAAGCTA CTGTATGTGC AATGCCATTA 3360 

TTTTCAACTT GATTTAAAAA CGGACGTGAC ATACCAGTTG CCTTTGCACC AAGTGCTAAA 3420 

CTTTTAATTG CATCGAGTGG TGTACGTAAA CCACCACTCG CGAAAACTGA AATTCGCTT 34 8 0 

TGATAAGCCG TTGTTTCAAG TAATGACTCA ACTGTAGACT GTCCCCATGA TGATAAGTAA 35 4 0 

TCCATATCTT TATTTGCACG ACGTTCATTT TCAATATCTA CAAAGT TAGT ACCACCTTTG 3600 

CCACTAACAT CGACATACTT GACGCCTATT TGTTGTAAGT CATGCATTAA TTCTTTGCTC 3660 

ATACCAAATC CAACTTCTTT TATAATGACT G G AAC AG AC A CTCGTGATAC AATCGACGCT 3720 

ATATTATCTA ACCAAGTCAC AAATTCACGA TTCCCTTCAG GCATAACTAA TTCTTGAGGA 3780 

GAATTAACAT GGATTTGTAA CGCTTGTGCC TCAAGTAATT CAACTGCTTC CAAAGCCTTT 38 4 0 

TCTACTGGTA CGTCCGCACC AACATTGCTA AAAAT(ATGC CTTCAGGATT CATTTTTCGC 3 900 

GCAATCGTAA ACGTCTCAGC CATGCGTGGA TTTCTCAATG CCGCATGTGT T GAT C C AAC T 3 9 60 

GCCATCGCTA AGCCAGTTTC TCTTGCAACT ACAGCTAGCT TTTCATTGAT GTTTTTCGTC 4020 

CACTCGCTAC CACCCGTCAT TGCATTAATA TAAACCGGAT ATGCCATCGT TAAGTCAGGC 4 08 0 

GTCTGTGATG TCAAATCGAT ATCATTTACA TTAATTGATG GGATAGAATG ATGCACAAAA 414 0 

CGCATCTTAT CAAAATCTGA ATGCATTGCG TCAGATTGGG CCATTGCTAT T T C AAC AT G T 4200 

TCATTTTTTC TCTGTTCTCT TTGAAAATCA CTCATGATTA AACCTACCTT TTCGTCATTT 4260 

CTAATTTAAA ACCATTTTTT ACGTCTGAAA AAGATGATTA ATACGACAGC TATAACAAAC 4 320 

ATTACAGCTA AGCAAATATA ATATCCATAA TGTAAATGTA ATGCCGGCAT AT T T AC AAAG 4 38 0 

TTCATACCAT AAATCCCAGC TATGAATGTT AACGGTGAAA ATATAACTGA TACTAATGTC 4440 

AGTACTTGCA TAATACTATT CATTCTAAAT GACGTGTATG ACCAAAATT TTCTCGTATT 4 500 

TCGTTTGTCA TTTCTTGAGC AGTACGAATG ATATTACGTT GCTTAATCAA GTGGTCATCG 4 5 60 

ATATGTTGAA TGTATAGCGA ATGTTTATTA TCTATAATCA AATCACCATT TTGTTTCATT 4 62 0 

GTATCAATTA GCTCTTGCAT AGGAAACAGT ACACGTTTTA CTTTAATCAA ATCCGAACGT 4 68 0 

AACTTAAAGA CACTATCCAT GACCATTTTA TTAAAGCGAT CATCTACATG GCGGTCTTCA 4 74 0 

AAAT GAT AAA CACTATCTTC AAGTGCATAT ACAAAGTTGA AATATTTATC AACCATCATA 4 8 00 

TCTAAAATTA AT AT G AC G AC ATCTGCACAA TCTAATTCTG CATCTAATGT ATTCATATAC 4 8 60 
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T TAT AG AC T A CTTTATTTAA TGATTICAAC GTTTGATGAT GATATGTTAC TAATACATTG 4 92 0 

TCTTGTATAA AAATATTTAG TGCTATTGGT GAATAGTTTG ACCCCATAAT ACTATGGAAT 4 980 

ACTAAGTATT GATAATCTTT ATAAGATTTA TATTTAGCTC GTGGCATACC GTTAATTGCA 5040 

TCATCCACTT CTAAATCATT AAAATTAAAA TGTGCTTTAA ACCATTCATTTTCTTGTTCA 5100 

TTCGGTTCAT CAAAATCATA CCAAACAATA GTCGCATCTT TTGGTATCTC T T T GAT AT C A 5160 

TCAACTACTT TAAACGGTTC ATATGTAGTT TGATACCGTA TCTTTAAAGC CATCGATACT 5220 

CCCCCTAAAT AACGAATTCT CTATTATTTT ATCATGAATT AAATAACGTG TATGTCTTAA 5280 

TTTATTTTAG TATGATAGTC ACTAAGGAGA TGGTTATTAT CAAACAACTT TTTACACATA 534 0 

CTCAAACCGT AACATCTGAA TTCATTGACC ATAACAATCA T AT G CAT GAT GCAAATTATA 5400 

ATATCATTTT TAGTGACGTC GTGAATCGTT TTAATTACAG CCACGGTCTT TCTTTAAAAG 5 4 60 

AACGCGAAAA TTTAGCATAT ACGCTATTTA CKTAGAAGA ACATACGACA TACCTCTCAG 5 52 0 

AATTGTCTCT TGGCGATGTA TTTACTGTTA CTTTATATAT TTATGATTAC GATTATAAGC 558 0 

GGTTGCATTT ATTTTTAACA TTAACTAAAG AAGATGGTAC AC TAGCATCA AC AAAT GAAG 5 64 0 

TAATGATGAT GGGAATTAAT CAGCACACAC GTCGTTCTGA TGCTTTTCCT GAATCATirT 5700 

CAACACAAAT AGCACACTAT TATAAAAATC AATCAACTAT CACTTGGCCT G AAC AAT TAG 57 60 

GACATAAAAT AGCAATTCCA CACAAAGGAG CAT T AAAAT G ACAGATGCAT TACAACAAAA 5820 

GATTCATATC GAATTACTAG ATTTATTAGA TGATGTTAAG TTTGAATTAA C AG AAT T AAA 5880 

TGCACAAAAA GGGTTATACA TTAACGGACC AGCAAATCAG CTACTTAAGC GTGGCGTGCA 594 0 

TATGGCTTAT GTTCAAGGAC AAAAGCAAGC CATCGATAAT AT TAT G ACT A TTGTGGAACA 6000 

AC AG C T T G AA AGATCAACAT TTCCTAGAAC AT TAT GAT AA AT TTC AAAAT GAGGTTGCTC 6060 

ATCGCAATTA T G AT AAAAC A GCTAACTTTG CAGAACTTAG TGATATTCCA CGTCAATTCG 6120 

ATAATTTTTT AGATCAATTT TATCAAATTA AAGGGCAATA CTTTATCATC ACACAT AT C A 6180 

ATACACTTAT TGGTGATTTT CACTCAGAAG CTCATTAACA ATTAGTCTAT ATAACCCTTG 62 4 0 

CTATATTTTC AAAAACAAAA CCCAATTACG TTTTCATGTC AAATATCATC TTGCATGAAA 3330 

TCGTAACTGG GTCATTTATA TGTTATTAGT TATTTTGTGT TACATCCTCA TCTATCGATT 6360 

TGGCAATTTG TTTAATAGCT TTATGTGATT GTCTAATTGG ATAAATTGGA AAATCATGTA 6420 

CCATCTTAGG AT AAT C AT AA AACTCAATGT ATTGATGATG TTGCAACATC ATTTGTTCAA 64 80 

ATAGCTTCAT ATCAGGATGT GTCATTTCAC GTCCACCACC AAACATATAA ACTGGTGGCA 654 0 

ATCCTTCTAT TGTGCCATTA ATTGGCGATA TGCGCTTATC TGTTAATGGT AGGCCATTCG 6600 

CCCATTTTTT CATAATCTCA T T G AC AC C AA ACTGACTTAG aACCGCATCT TGTTCGATTA 6660 
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AGGCGTCCGA AATATCTTTA TTAGATAGTG TTGCATCTAA AATTGGGAG ATTAAATACA 67 2 0 

ATTTATTCGG TAATGGCTGT TGATTAkCTA AAAGAGATTG TACAAAGGAT AATGCCAGTG 67 80 

CACCACCTGA ACCATCACCC ATGACTACGA CATTTTGATG TCCTACTTCA GATACTAATT 684 0 

GaTCATAAAC ACGTTGTATC GCTTGGnAAA GTATCGTCaA TATGnAAACT CTGGTGTCTT 6900 

TGGATAGATA GGCAGTACAA CCTCATATAA TGtACTTAAA GTGATTTTAT CCCAACAATC 6960 

TCCAATGGAA CGGTGATGGT TGTAGTGCAT TGAATCCACC GTGAATATAT AAAATTTTCT 7 02 0 

TATCAATTTG ATGTCTGAAA TTAAAGCGAA AGACTTGCAT ATCATCTAAT GACAATTTTT 7 080 

CTAAATTTGC TTTAACATTT AATGTTGA8S GCTGCTTATG TTTTTTTCTA TTTTCAATTT 7140 

CTCTTTTATA AAAAAATCTT TCAACATCTT GATCATTTTT AAACATAATC GAGCGATTGT 7200 

GAAGCAAATA TTTATTGACA ACGCTATTCA TAACACGGTT TCTAATCAAT GTCTTAACCT 7 2 60 

ACCTTTATAT ATTTTATGTA TCCAATGATk GTCTATCCCC TACATTCTTT GCCAAAAAA 7 32 0 

GTATATAATG TAGAAGATAT TTTCTTTTTC ACTTTCAAAT TTAAGACTAC AATTGAACAG 7380 

TGATTTTTCA TCATTATAAC AG AC AAC TAG ACATATTGAT AAGTAAAGAA AAGAACTTTA 74 4 0 

TACGGAGGTA CCTTGCATGA CAAATCCAAA TCAACGATTA GAACCATTTG ATGAGACATT 7 500 

T C AAC AAC C G AATATTCATC GTGGTAAGCG ATATGGTAAG AAAAAACGTT CATTGGTAAG 7 5 60 

CATGATTATT CAAATCATTG TTGTwATATT AACCACCATC GCTGGAATAC AGCATGGTGG 7 62 0 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GTCATtACCG amTTTCtTAG AaTCATTTAA AG AT GAT AAA TATACAAACG TTGGTAATTT 60 

AAAAGAAGTG AATTTTGATA AAATTGCTGC GASSAAACCC GAAGTAATCT TTATCTCTGG 120 

ACGTACAGCT AATCAAAAGA ATTTAGATGA ATTCAAAAAA GCTGCACCTA AAGCGAAAAT 18 0 

TGTTTATGTT GGTGCAGATG AAAAGAACTT AATTGGTTCA AT G AAAC AAA ACACTGAAAA 24 0 

TATCGGAAAA ATTTACGATA AAGAAGATAA AGCTAAAGAA TTAAATAAAG ATTTAGAm 300 

CAAAATTGCT TCAATGAAAG ATAAAACGAA AAACTTCAAT AAAACTGTTA TGTATTTACT 360 

AGTTAACGAA GGTGAATTAT CAACATTTGG ACCTAAAGGT CGTTTTGGTG GAT TAG T T T A 4 20 

CGATACATTA GGATTCAATG CAGTTGATAA AAAAGTAAGT AATAGCAATC ATGGACAAAA 4 80 

TGTTTCTAAC GAATATGTTA ATAAAGAAAA TCCAGATGTT ATTTTAGCGA TGGATAGAGG 54 0 
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TCAAGCGATA AGTGGTAAAT CAACTGCGAA ACAAGCATTA AATAATCCTG TATTAAAAAA 600 

TGTTAAAGCA ATTAAAGAAG ACAAAGTATA T AAT T TAG AT CCTAAATTAT GGTACTTTGC 660 

AGCTGGATCA ACTACAACTA CAATTAAACA AAT T G AGG AA C T T GAT AAA G TTGTAAAATA 720 

ATTTTAAAAG AGGGGAACAA TGGTTAAAGG TCTTAATCAT TGCTCCCCTC TTTTCTTTAA 780 

AAAAGGAAAT CTGGGACGTC AATCAATGTC CTAGACTCTA AAATGTTCTG TTGTCAGTCG 84 0 

TTGGTTGAAT GAACATGTAC TTGTAACAAG TTCATTTCAA TACTAGTGGG CTCCAAACAT C90 

AG AG AAAT T T GATTTTCAAT TTCTACTGAC AATGCAAGTT GGCGGGGCCC AAACATAGAG 960 

AATTTCAAAA AGGAATTCTA CAGAAGTGGT GCTTTATCAT GTCTGACCCA CTCCCTATAA 102 0 

TGTTTTGACT ATGTTGTTTA AATTTCAAAA TAAATATGAT AGTGATATTT ACAGCGATTG 1080 

TTAAACCGAG ATTGGCAATT TGSACAACGC T C T AC CAT C A TATATTCATT GAT T G T T AAT 114 0 

TCGTGTTTGC ATACACCGCA TAAGATTGCT TTTTCGTTAA ATGAAGGCTC AGACCAACGC 1200 

TTAATGGCGT GCTTTTCAAA CTCATTATGG CACTTATAGC ATGGATAGTA TTTATTACAA 12 60 

CATTTAAATT TAATAGCAAT AATATCTTCT TCGGTAAAAT AATGGCGEA scgTGTTTCA 1320 

GTATCGATTA ATGAACCATA AACTTTAGGC ATAGACAAAG CTCCTTAACT TACGATTCCT 1380 

TTGGATGTTC ACCAATAATG CGAACTTCAC GATTTAATTC AATGCCAAAT TTTTCTTTGA 14 40 

CGGTCTTTTG TACATAATGA ATAAGGTTTT CATAATCTGT AGCAGTTCCA TTGTCTACAT 1500 

TTACCATAAA ACCAGCGTGT TTGGTTGAAA CTTCAACGCC GCCAATACGG TGACCTTGCA 1560 

AAT TAG AAT C TTGTATCAAT TTACCTGCAA AATGACCAGG CGGTCTTTGG AATACACTAC 1620 

CACATGAAGG ATACTCTAAA GGTTGTTTAG ATTCTCTACG TTCTGTTAAA TCATCCATTT 1680 

TAGCTTGTAT TTCAGTCATT TTACCAGGAG CTAAAGTAAA TGCAGCTTCT AAT AC AAC T A 1740 

AGTGTTCTTT TTGAATAATG CTATTACGAT AATCTAACTC TAATTCTTTT GTTGTAAGTT 1800 

T AAT TAACGA GCCTTGTTCG TTTACGCAAA GCGCATAGTC TATACAATCT TTAACTTCGC 18 60 

CACCATAAGC GCCAGCATTC ATATACACTG CACCACCAAT TGAACCTGGA ATACSCATG 1920 

CAAATTCAAG GCCAGTAAGT GCGTAATCAC GAGCAACACG TGAGACATCA AT AAT T GCAG 198 0 

CGCCGCTACC GGCTATTATC GCATCATCAG ATACTTCGAT ATGATCTAGT GATAATAAAC 2 04 0 

T AAT T AC AAT ACCGCGAATA CCACCTTCAC GGATAATAAT ATTTGAGCCA TTTCCTAAAT 2100 

ATGTAACAGG AATCTCATTT TGaTAGGCAT ATTTAACAAC TGCTTGTACT TCTTCATTTT 2160 

TAGTAGGGGT AAT GTAAAAG TCGGCATTAC CACCTGTTTT AGTATAAGTG TATCGTTTTA 2220 

AAGGTTCATC AACTTTAATT TTTTCATTTG GGATAAGTTG TTGTAAAGCT TGATAGATGT 2280 

CTTTATTTAT CACTTCTCAG TACATCCTTT CTCATGETT TAATATCATA TAGTATTATA 2340 
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CCAATTTTAA AATTCATTTG CGAAAATTGA AAAGAAAGTA TTAGAATTAG TATAATTATA 2 4 00 

AAATACGGCA TTATTGTCGT TATAAGTATT TTTTACATAG TTTTTCAAAG TATTGTTGCT 2 4 60 

TTTGCATCTC ATATTGTCTA ATTGTTAAGC TATGTTGCAA TATTTGGTGT TTTTTTGTAT 2520 

TGAATTGCAA AGCAATATCA TCATTAGTTG ATAAGAGGTA ATCAAGTGCA AGATAAGATT 2 58 0 

CAAATGTTTG GGTATTCATT TGAATGATAT GTAGACGCAC CTGTTGTTTT AG T T C AT G AA 2 64 0 

AATTGTTAAA CTTCGCCATC ATAACTTTCT TAGTATATTT ATGATGCAAA CGATAAAACC 2 7 00 

CTACATAATT TAAGCGTTTT TCATCTAAGG ATGTAATATC ATGCAAATTT TCTACACCTA 27 60 

CTAAAATATC TAAAATTGGC TCTGTTGAAT AT T T AAAATG aTGctACCGC CAATATGTTT 2 82 0 

TGTATATTTT ACTGGGCTGT CTAAGAGGTT GAATAATAAT GATTCAATTT CAGTGTATTG 2 88 0 

TGATTGAAAA CAATTAGTTA AATCACTATT AATGAATGGT TGAEATTTG AATACATGAT 2 94 0 

AAACTcCTTT GAT AT T GAAA ATTAATTTAA TCACGATAAA GTCTGGAATA CTATAACATA 3000 

ATTCATTTTC ATAATAAACA TGTTTTTGTA TAATGAATCT GTTAAGGAGT GCAATCATGA 3060 

AAAAAATTGT TATTATCGCT GTTTTAGCGA TTTTATTTGT AGTAATAAGT GCTTGTGGTA 312 0 

ATAAAGAAAA AGAGGCACAA CATCAATTTA CTAAGCAATT TAAAGATGTT GAGCAAAAAC 318 0 

AAAAAGAATT ACAACATGTC ATGGATAATA TACATTTGAA AGAAAT T GAT CATCTAAGTA 324 0 

AAACTGATAC AACTGATAAA AATAGTAAAG AATTTAAGGC ACTACAAGAA GATGTTAAAA 330 0 

ACCATCTCAT ACCTAAATTT GAAGCATATT ATAAGTCAGC AAAAAATTTG CCTGATGATA 3360 

CAATGAAAGT T AAGAAAT T A AAAAAAGAAT ATATGACGCT TGCAAATGAG AAGAAGGATG 34 2 0 

CGATATATCA ATTAAAAAAA TTCATAGGTT TATGTAATCA ATCTATCAAG TATAACGAAG 34 8 0 

ACATTTTAGA TTATACGAAA CAATTTGAAA AAAATAGATA CAAAGTTGAA TAGAAATTA 3540 

AATTAGCTGA TAATAAAAGT GAAGCAACTA ATCTTACGAC AAAATTAGAA CATAATAATA 3 60 0 

AAGCGTTAAG AGATACTGCG AAGAAGAACC TAGATGATAG TAAAGAAAAT GAAGTAAAAG 3 660 

GCGCGATTAA AAATCACATT ATGCCAATGA TTGAAAAGCA AATTACCGAT AT T AACCAAA 372 0 

CTAATATTAG TGATAAGCAT GTTAATAATG CAAGGAAAAA CGCAATAGAA ATGTATTACA 37 8 0 

GTCTGCAGAA CTATTATAAT ACACGTATTG AAACAATAAA GGTTAGTGAG AAGTTATCAm 38 4 0 

AAGTCGATGT AGATAAGTTG CCGAAAAAGG GTATAGATAT AACTCACGGC GAT AAAGCC T 3900 

TTGAAAAAAA GCTTGAAAAA T T AG AAG AAA AAAACTATA ATCATTTTTC AAAGTTAAAA 3 9 60 

ATTTTGAATT TATGGTTAAC ATGTCAACTT ACTATGTGTA TAATGGTAAA CATTGATATT 4 02 0 

AACTATATGT ATAAAAATGT CACGCAGATG CTATTTAAAT GTGATAAATA TTTTTAGAGG 4 080 

TGAATAGAGT GGCTATAAAG CTAAGTTCAA TTGACCAATT TGAACAGGTT ATTGAGGA& 414 0 

ATAAATATGT TTTTGTATTA AAACATAGTG AAACTTGTCC AATATCGGCA AATGCGTACG 4200 



119 

Copied from 10806206 on 08/1.2/2004 



ATCAATTTAA TAAATTTTTA TATGAACGCG ATATGGACGG TTATTATTTG ATTGTCCAAC 4 2 60 

AAGAACGCGA TTTGTCAGAT TATATTGCTA AAAAAACGAA CGTTAAACAT GAATCACCTC 4 320 

AAGCATTTTA TTTTGTAAAT GGTGAAATGG TTTGGAATCG AGACCACGGT GATATCAATG 4 38 0 

TGTGGTCATT AGCACAAGCA GAAGAATAAT G AAAC TAT AG GGTTGGAACA TTTTGCCTTA 4440 

CACTACTAGA CGTGAATAGC AC AAC T T AAA TTCGTGTGAA TCAGAGTAGT TTGGCTATAA 4 500 

TGATGTTCTG ACCTTTTATT TTATGTCACC TTTAGAAGCAGTTAAGTTAG TACTTTTTTA 4 5 60 

C AAAC AT AT G TATAATATAT TCGAGTATTT TTATTGAAAa tATTTTGGAA AACGACGAAT 4 620 

CCAATAAGAA AATTTAAACA TGATTTGTAA GTTAGTTTAA TAGGAAATAT AT GC T AAAC C 4 680 

AAAAGAAGCA TATTGTTATT TACTGGAATA ATTAATAATC ATGTCATGTT AAATGTTAGC 4470 

ATATAATCAC GAGATAAAAT CTAAAATTTA AGATTAATCT TTTATGAATA AAAAACGTAT 4 8 00 

CACAACAAAT AATAAAGTAA GGTGGTCAAG GTTATGAAAG TATTAGTAGC CATGGATGAG 4 8 60 

TTTCATGGAA TTATTTCAAG TTATCAAGCT AATAGATATG TTGAAGAGGC AGTTGCAAGC 4 920 

C AAAT T G AAA CTGCAGATGT AGTTCAAGTA CCATTGTTTA ATGGAAGACA TGAATTATTA 4 98 0 

GATTCTGTAT TTTTATGGcm ATCTGGGcaA AAGTATCGTA TACCAGTACA TGATGCAGAT 504 0 

ATGAATGAAG TTGAAGGTGT TTACGGACAA ACTGATACAG GGATGACCGT TATCGAGGGG 5100 

AATTTATTTT TAAAAGGTAA AAAACCAATT GTTGAACGAA CAAGTTACBG TTTAGGAGAA 5160 

ATGATTAAAC ATGCATTAGA TAACGACGCA AAACATGTTG TAATTTCACT AGGTGGGATT 5220 

GATAGTTTTG ATGCTGGTGC AGGTATGTTA CAAGCATTAG GTGCTCAATT C T AT GAT G AC 52 8 0 

GAAGGGCGTG TCGTAGATAT GAGACAAGGT GCTGGTGTAA TTAAATATAT TCGTCGTATG 5 34 0 

GATATGTCGA ACTTACACCC TAAAATGGAA ACAGCAAGAA TTCAAGTAAT GTCGGATTTT 5 4 00 

TCAAGTCGAT TATATGGTAA GCAAAGTGAA ATCATGCAAA CTTATGATGC GCATCAGTTG 54 60 

AATCATAATC AAGCAGCAGA AATCGATAAT TTAATTTGGT ATTTTAGTGA GTTATTTAAA 5520 

AGTGAATTGA AAATTGCAAT TGGTCCAGTT GAACGTGGTG GTGCTGGTGG TGGAATTGCA 5580 

GCAGTCTTGA ATGGACTGTA TCAAGCTGAA ATATTAACCA GTCATGCATT AG TAG AC C AA 5 64 0 

CTAACACATT TAGAAAATTT AG T T G AAC AA GCGGATTTAA TTATTTTTGG AGAAGGATTA 5700 

AATGAAAATG ATCAGTTGCT AGAAACGACA ACATTGCGTA TTGCAGAACT TTGTSTAAA 57 60 

CATCAAAAGG TTGCCATTGC AATTTGTGCA ACTGCTGAAA AGTTTGATTT ATTTGAATCA 5820 

CAAGGGGTTA CAGCAATGTT TAATACATTT ATCGATATGC CAGAAACTTA TACTGACTTT 5880 

AAAATGGGtT ACAAATTAGG CATTATACGG TTCAGTCTTT AAAACTGTTG AAAACACATT 5 94 0 

TTAATGTTGA GGTTTAGTAA AGAAGGACTA AATTGGTGAT GCTGTCATGA TGGTTAATAA 600 0 
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CATTTATGAT GGTTAGCAAA ACGAATTAGA AGATCGAAAG TATACGTAAA AAAT A T G AAA 60 60 

AATCACGCTA TCATTGCACT GAATGTTAGC GTGATTTTTA TATATTAATT AAGCCTGAGT 612 0 

TGAACTAGTA TATAATCGTT GGTTTTTAGT GATTTT&GC GATATCTTCT ACAATTCCAA 618 0 

TGATTACTTG TACTGCTTTT TCCaTAACAT CAATGGATGC a TAT T CAT AT GGGCCGTGGA 624 0 

AGTTACCGCA ACCTGTAAAG ATGTTTGGAG TTGGTAACCC CAT AAAT G AC AATTGTGAAC 630 0 

CATCTGTACC ACCGCGAATA GGTTCAGTGT TTGCTGGAAT ATCTAATTTG G C AAAG AC AC 63 60 

GTTTAGGTAT ATCAATAATA TGAGGCAATG GTAATATTTT TTCTGCCATA TTGAAATATT 64 20 

GATCCGATAT ATCAACTTTA ACTGGATAAT TTTCAAAATG GGCATTGATA TCGTCACGTA 64 80 

TTTCTAAAAT ACGTTTCTTA CGCAATTCGA ATTGTTTTTT AT CAT GAT C A CGAATAATGT 65 4 0 

ATTGCAAAGT TGCTTTTTCA ACAGTTCCTT CAAAGTTCAT TAAGTGATAA AAGCCTTCGT 6600 

ATCCTTCTGT TCGCTCCGGA ACTTCACTAT CAGGTAGCAA ACTATCGAAT TGTTCACCTA 6 660 

AACGTATTGC GTTTACCATT GCATTTTTAG CTGAACCAGG ATGAACATTT ACACCGTGGC 67 20 

ATGTAATAAC CGCTTCAGCA GCGTTAAAGC TTTCATATTG TAATCTCCA TATTGACTAC 67 8 0 

CATCCATAGT ATAAGCAAAA TCAGCATTGA AGCGGTCAAC ATCAAATTTA TGTGGACCAC 68 4 0 

GACCGATTTC TTCGTCTGGT GTAAATCCAA TGCGAATGGT ACCATGTTTA ATTTCTGGAT 6900 

GTTCTTGTAA ATAACAAATA GCTTCCATAA TTTCCACAAT ACCCGCTTTA TCGTCTGCAC 6960 

CTAGTAACGA TGTACCATCA GTTACCATTA ATGTATGACC AACTAAACTG TTAAGTTCTG 7 02 0 

G AAAT AC T T T AGGATCTAAG ACACGTTTAG TATTGCCTAG TTTGTATGGC TTACCAT CAT 7 08 0 

AGTTTTCAAT AATTTGCGGT TTAACATTTG AAGCATTGAA ATCAGGTGAT GTATCAACAT 714 0 

GCGCCAAAAA TCCAACTGTT GGGACGTCGA CATCGATGTT ACTTTCTAAT GTAGCAAATA 7200 

AGTAGCCATT TTCATCTAAA TCAGTTGGCA ATCCTAATTG TTGTAATTCT TTTTCTAATA 72 60 

AATGTAACAA ATCCCATTGC TTTTCAGTTG AAGGTGTTGT TGTAGATTTT GGATCAGATT 7 32 0 

GCGTATCAAT TGTCGTATAT CTTGTTAATC TATCTATCAA TTGGTTCTTC 2LTATATTCG 7380 

ACCCCTTAAA CTCTATTATT CATGTTGTAA GATTTTTTAT ATGTCTTACC TTTGATTTTA 74 4 0 

C CAT AC AG T T GTTTGATACG TGTGTATAGG TAATATAGAA TTTCAGAAAC TAATATACCG 7 500 

AAAGCAATCG C AC C T G AAAT CAGTGTAcTT CTAAAAATGT ATTTACAGCA CTTGTATAAT 7 5 60 

CATTTGATAC TAAAAAACGA GTCGCTTGAT AAGCTGCACC ACCAGGTACT AATGGTATAA 7 62 0 

TGCCTGGCAC TATGAATATA ATTACCGGTC GTTTATATCT GCGACTCATA GTATGACTCA 7 680 

TTAAGCCTAA AATTAAGCTT CCCAAAAATG AAGCGCCAAC TTTTCCAAAC TCTAAATCTA 77 4 0 

CCGTTAATTG GTAAATCGTC CATGCAATGG C A(C C AC AAA TCCACATGCT ACTAAGAGGC 7 8 00 

GTTTGGGTGC ATTGAAAATG ATAGAGAAAA GTACTGTTGA TATAAAGCTG ATTGTAAAAT 7 8 60 
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GAAATAAATA AAATAGCATG CTTTAACAGT CCTTCCTTAA ATGATTAATA AAACGATTGC 7 920 

GACACCAGCA CCGATTGCGA ATGCTGTTAA TGCAGCTTCA ACACCGCGAG ACATACCTG 7 98 0 

AAGTAATTCA CCCGCTAATA AATCTCGAAT GGCATTGGTA AT T AAT AT AC CAGGGACAAG 8 04 0 

TGGCATGACA CTGGCTATAG TAATGATATC TTGATTGGTT GCAATGCCTA ATTTAGTAAA 8100 

TGTGGCTGCA ATGGATATGA CCACAGCGGC TGCAACAAAC TCTGAGAAAA ATTTAATTTG 8160 

TATATAGCGT tGCACAAAGC TGAATGTTAA AAATGCGGAT CCGCCAGCAA TGACTGCAAT 8220 

CCAACAATCT GATGCGACAC CACCAAACAT AAATAGGAAG AAGCCACATG CAATGGCAGC 828 0 

TGCAAAGAAA TTCGTTAAAA AAGAATATTG TAAT GATGCA TGCTGTAAAT GAATAAATTC 8 34 0 

AGATTTAGCT TCATCAATTG TGAGTTCTTT ATTTGATATT TTACGTGAAA GACTATTCGT 8400 

TAAAGCGATT TTCTCTAAAT CTGTTGTACG CTCTTGTACA CGAATTAATC TTGTACTTGT 84 60 

TCGATCGTTT AATGAAAAAA TAATTGCAGT TGAACTGACA AAACTATATG TATTATGAAG 8520 

ACCATAACTA TGTGCGATAC GGTTCATTGT ATCTTCAACT CGATATGTTT CAGCACCTGA 8850 

TTCaAGTAAA ATTCTACCTG CAATTAATAC AACATCAATC ACTTTGTTTT CATCTATAAT 8 64 0 

TGTGATTGAA TCTGGCATAT CAATTCACCT CCAATGATAT GTGTTATTTA TTTGAACAAT 8 7 00 

TGaAGTTTAC AACTTGTTGT TACAACTTTC AAT AGT GAGA CTTTGTGTTA GTATGATGAA 8 7 60 

CTTGTATGGT TCAAATTTAA ATAAGAAAAA CTGTTAATCT TTGCTATTAT ACTATGATTT 8820 

AATAATAGCA AAGGATTAAC AGTTTTGTCG TTGTTATAAA T T GAT AAT AG GGTTAAACAT 888 0 

TACTTTGTTT CGCCCTTGAT TTTTGGCTAC ATGCACCATA TCGTCTGCAT CTTTAAACAC 8 94 0 

TTTACGCTGT GATTTTGGAT CGTCATCTGT TAAATAACCA ACACCGAffiG AC AC T G AC AA 9000 

TTTAATAACT TCTTTGTTTG GTAAATGGAA TGATGATTTT TCAACACCCG AACGAATATT 9060 

TTCAGCTAAT TTAACACTTT GATCAAGTGA AT AAT T G T G A ATGACAACTG AGAACTCTTC 9120 

GCCACCATTT CTAAAAATTT TAAATTGATT CGGCACATAG TTTTTAAGTA ATTGAGACAT 9180 

TTGTTTTAAT ACAGCATCAC CTGATTTGTG TGAGTAGGTA TCATTGaCAT CTTTAAATCC 924 0 

ATCGATATCG ATTAATAATA AT GC GAT AC T TTGATGTTCT TTTTCAGCTT TTCGTGAAAT 9300 

TTCATTTAAA TGTCTATCAA ATTCTTTTAC ATTACCTAAG CCTGTTAAGT AATCATATTT 9360 

ATCTTCGTTT TCATAAC GAT TTACGAGTGA GAAGAAATGC CAAATATCGA CAAATGTTAT 9420 

CGCTGAAGCT AAAGTGATAA TTAATGAAAT TGGTATTAAA ATGATAACTT CCGATAGTGT 94 80 

GTAAATAGGA CTCACTAACG CGACACCAAA TAAAATGATT ATTGTAACAA CATTAAGTAT 954 0 

TAATAATGAT AGCACATCAT TTTGTTTTAA AAATGGTCCA ATAGCACTTG TTAOTCAGC 9600 

AATAACAATC AACGTAACAC CGTACATAAT CGAGTTGTTA AAT AC T AC AA TTTCAACAAT 9660 
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TGCTACAATT ACTGTGGCAG ATAATGTATA GACCATATTT GTAAATCTAC CTAAAAACAA 97 2 0 

TAAAGGAACG AATGTTAAGT GAAT TAAAT A ATCTTCACGA TAAGGGATAG GGTAGACAGA 978 0 

TAATAATAAT GATACGATTG TCATTAAAAC AGTGACATAA GCCTTAGAAA AAAC 9834 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23439 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

TCTCAATCAG ATGAAAAATT GCATATCGTA GGTTTTACAG AAAGTGCAAA ATATAATGCG 60 

TCATCAGTCA TTTTCACGAA TGACGCTACC ATTGCCAAGA TCAATCCTAG ATTGACTGGA 12 0 

GATAAAATTA ATGCAGTTGT TGTACGTGAT ACMATTGGA AAGACAAAAA ATTAAACCAA 18 0 

GAGCTTGAAG CGGTAAGTAT TAATGACTTT ATTGAAAATT TACCAGGTTA TAAACCACAG 24 0 

AACTTAACAT TAAACTTTAT GATTTCATTC TTATTTGTCA TTTCAGCTAC AGTTATAGGC 300 

ATTTTCCTAT ATGTCATGAC AT T AC AAAAG ACGAGTTTAT TTGGCATATT AAAAGCTC&. 3 60 

GGATTTACGA ATGGCTATTT GGCGAATGTG GTAATTTCGC AGACGGTCAT ATTAGCACTA 420 

TTTGGTACGG CATTTGGCTT ACTGTTAACA GGCGTTACAG GTGCATTTTT ACCTGATGCA 4 80 

GTACCTGTCA AATTCGATGT ACTAACATTG CTCGTATTTG CAATTGTGTT AATGATTGTC 54 0 

TCTGTATTAG GAAGTTTATT CTCCATTTTA ACAATTAGAA AAATAGATCC GTTAAAGGCG 600 

ATTGGGTAGG AGGTGTAGCA AATGTTGAAA TTTGAAAATG TAACAAAGTC ATTTAAAGAT 660 

GGGAATCGTA ACATTGAAGC GGTTAAAGAT ACAAATTTTG AGATAAATAA AGGTGATATT 7 20 

ATAGCATTGG TTGGACCTTC TGGCTCTGGT AAAAG T AC AT T T C T AAC TAT GGCAGGTGCT 7 80 

T T AC AAAC AC CGACATCTGG GCACATTTTA ATCAATAACC AAG AT AT T AC GACAATGAAG 84 0 

CAAAAAGCAT TGGCAAAAGT TAGAATGTCT GAAATAGGTT TTATTTTACA AGCTACAAAC 90 0 

CTTGTACCAT TTTTAACGGT AAAGCAACAA TTTACATTAT TGAAAAAGAA AAATAAGAAT fflO 

GTTATGTCTA AT G AAG AC T A TCAGCAACTT ATGTCACAAT TAGGTCTAAC TTCATTGCTT 102 0 

AATAAGTTAC CTTCAGAAAT TTCAGGTGGT CAGAAACAAC GTGTGGCGAT AgCaAAGCGT 108 0 

TATATACGAA TCCGTCGATT ATTTTAGCGG ATGAACCTAC CGCGGCGTTA GATACTGAAA 114 0 

ATGCGATTGA AGTCATTAAA ATTCTACGTG ATCAAGCCAA AC AAA G AAAG AAAGCATGTA 1200 

TTATTGTTAC AC AT GAT G AA CGACTTAAAG CATATTGTGA TCGTTCATAT CATATGAAAG 12 60 

ATGGCGTCCT TAATCTTGAA AAT G AAAC AG TAGAATAGTT TTATTAAGCC GGTACATCAT 1320 
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GTGCCGGTAT TTTTATGTTT ATGTATTATT TGAATAAACT TTCACATTA ATTAATAATA 1380 

ATTATTATCG AAAATCAGAA ATATTCCGTG AAATATAATA TTTTTTGTAG TAAAATGGCC 14 4 0 

TCTAAGTATT CAATATTTAA ATATGGGGAT T GAAT AT AAA ATTATCGTAA TGGGGGTCAA 150 0 

TGGTTATGGA TTTATTGATA GGTACTTTAT TTTTATTTTT GGTCTTAGTG ATTTTTACAT 15 60 

TATTTACATA TAAAGCGCCT AATGGTATGC GTGCCATGGG AGCATTAGCT AATGCAGCAA 162 0 

TCGCAACATT TTTAGTGGAA GCATTTAATA AATATGTTGG TGGCGAAGTA TTCGGTATTA 1680 

AATTTTTAGA AGAGCTAGGA GACGCTGCGG GAGGTCTAGG TGGTGTCGCT GCCGCTGGAT 17 4 0 

TAACAGCATT AGCTATCGGT GTGTCACCAG TATATGCATT AGTTATAGCA GCCGCGTGCG 18 00 

GTGGTATGGA TTTATTACCA GGTTTCTTTG CGGGTTATAT GATTGGATAT GTGATGAAAT 1860 

ATACAGAGAA ATATGTGCCG GATGGTGTCG ACTTAATTGG ATCGATTGTC ATCTTAGCGC 192 0 

CATTAGCTCG TCTTATTGCA GTATTATTAA CGCCAGTAGT GAATAGTACA TTGATCGAA 198 0 

TTGGTGATAT TATCCAAAGT AGTACGAATA CGAATCCAAT TATCATGGGT ATCATTTTAG 204 0 

GTGGTATTAT TACGGTTGTC GGCACAGCGC CATTGAGTTC AATGGCATTG ACAGCATTAT 2100 

TAGGTTTAAC GGGTGTACCT ATGGCTATTG GTGCCATGGC AGCATTTAGT TCGGCATTTA 2160 

TGAATGGGAC GCTATTCCAT CGCTTAAAAT TAGGTGATCG TAAGTCTACG ATTGCAGTAA 222 0 

GTATTGAACC TTTATCACAA GCAGATATTG TATCAGCCAA TCCAATTCCA ATCTATATTA 2280 

CAAATTTCTT TGGTGGTGCG ATTGCTGGTT TAATTATTGC TATGTCAGGT TTAATTAACG 234 0 

ATGCGACAGG TACAGCTACA CCGATTGCAG GATTTTAGT TATGTTTGGA TTTAATCATC 24 00 

CGACGACAAT TGTGATTTAT GGTGTAGTAA TGGCGATTGT AGGTGCGCTT GCAGGTTATC 24 60 

TTGGTTCAAT TGTATTTAAA AAATATCCAA TTGTTACTAA GCAAGACATG ATTAATCGAG 2520 

GTGCAGTAGA CGCATAGCAT CATCATATTG AATAGTAAAA ACAAATAAAA CATAGTAACG 2 58 0 

TGATTCAGTC GATGTAACAG TCGATAATGA GTCACGTTTT TTTATAGAAA AATACAAGAC 2 64 0 

ATAAAAATGT CATAATTTAT TGTCGACAAA TATCATACTG TATAAACATT TATCATTTTC 2700 

TCAAGTACCT T T T AC AC GAT GGAATGAACT TACTTTTTAC GAAATTATGC GTATTTTATA 2 7 60 

AACAAATATC ATTGATAT£A CGGTAAATGT AAGCGTTTAC AACAGAAATA ACAGCATGCT 2 8 20 

ACGATATTTT TGTAAATTCA CTGATTCAAG TATTTTAAGT CAATATGAGG AGGGATGTTA 2880 

TGAGCGATTC TGAGAAAGAA ATTTTAAAAA GAATTAAAGA TAATCCGTTT ATTTCACAAC 2 94 0 

GTGAACTTGC TGAGGCAATT GGATTATCTA GACCCAGCGT AGCAACATT ATTTCAGGAT 3000 

TAATACAAAA GGAATATGTT ATGGGAAAGG CATATGTTTT AAATGAAGAT TATCCTATTG 30 60 

TTTGTATTGG CGCAGCGAAT GTAGATCGTA AGTTTTATGT GCATAAAAAT TTAGTTGCAG 3120 
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AAACATCAAA TCCTGTAACG TCAACACGCT CTATTGGTGG CGTAgCAAGA AATATTGCTG 318 0 

AGAACTTAGG TAGGCTTGGC GAAACGGTCG CTTTTTTATC TGCTAGTGGA CAAGATAGTG 324 0 

AAT GGGAAAT GATTAAACGA TTGTCCACAC CATTTATGAA TTTGGATCAT GTTCAACAAT 330 0 

TTGAAAATGC GAGTACAGGT T CAT AT AC AG CTTTAATTAG TAAAGAAGGC G AC AT G AC AT 3360 

ATGGCTTaGC AGATATGGAA GTGTT1SACT ACATTACGCC TGAATTTTTA ATTAAGCGTT 3420 

CACACTTATT GAAAAAGGCT AAGTGCATTA TTGTAGATTT GAATTTAGGC AAAGAGGCAT 3480 

TAAACTTCTT ATGTGCCTAT ACCACGAAAC ATCAAATCAA ATTAGTTATC ACCACGGTTT 354 0 

CTTCCCCAAA AATGAAAAAT ATGCCTGATT CATTACATGC TATTGATTGG STATCACGA 3600 

ATAAAGATGA AACAGAAACA TACTTAAATT TAAAAATAGA AT C TACT GAT GATTTAAAAA 3 66 0 

TAGCTGCTAA ACGCTGGAAT GATTTAGGTG TTAAAAATGT TAT T G T G AC A AATGGCGTGA 3720 

AAGAACTCAT TTATCGAAGT GGTGAGGAAG AAATCATTAA GTCAGTTATG CCATCAAATA 3780 

GTGTGAAAGA TGTTACAGGT GCAGGCGATT CATTCTGTGC TGCAGTAGTG TATAGCTGGT 384 0 

TAAATGGGAT GTCTACTGAA GAT AT AT T AA TTGCTGGTAT GGTTAACGCA AAGAAAACGA 3900 

TAGAAACGAA ATATACAGTT AGGCAAAACC TAGATCAACA GCAACTTTAT CACGATATGG 3 960 

AGGAT T AT AA AAATGGCAAA TTTACAAAAG TA'ATTGAGT ATTCTCGAGA AGTTCAGCAA 4 02 0 

GCACGGGAGA ACAATCAACC GATTGTAGCA T TAG AAT C AA CAATTATTTC GCATGGTATG 4 08 0 

CCGTACCCAC AAAATGTTGA AATGGCAACA AC AG T AG AG C AAATTATCAG GAATAATGGT 414 0 

GCCATTCCAG C AAC CAT AG C CATTATAGAT GGCAAAATTA AAATTGGTTT AGAAAGCG&. 4 200 

GAT T TAG AAA TACTGGCAAC TAG T AAAG AC GTTGCTAAAG TATCTAGAAG GGATTTAGCA 4 2 60 

GAAGTTATTG CGATGAAGTG TGTTGGTGCT ACTACTGTAG CGACGACGAT GATATGTGCT 4 32 0 

GCAATGGCTG GTATTCAATT TTTTGTTACA GGAGGTATTG GGGGCGTCCA TAAAGGTGCA 4 38 0 

G AAC AT AC G A TGGACATTTC AGCAGACTTA GAAGAACTGT CTAAAACAAA TGTCACTGTT 4 44 0 

ATCTGTGCAG GTGCCAAATC AAT T T TAG AC TTACCTAAGA CGATGGAGTA TTTAGAAACA 4 500 

AAAGGCGTTC CAGTTATTGG ATATCAAACG AATGAATTGC CAGCATTCTT CACTCGCGAA 4 5 60 

AGCGGTGTTA AGTTAACAAG TTCGGTTGAA ACGCCAGAACGACTTGCTGA CATTCATTTA 4 620 

ACAAAACAGC AGTTAAATCT TGAAGGTGGC ATTGTTGTTG CTAATCCAAT TCCATATGAG 4 68 0 

CATGCCTTAT CAAAAGCATA TATTGAGGCA ATCATAAATG AAGCTGTTGT TGAAGCGGAA 474 0 

AATCAAGGTA TTAAAGGTAA GGACGCCACA CCGTTCTTGT TAGGGAAAAT TGTAGAAAAA 4QB0 

ACGAATGGTA AAAGTTTAGC AGCAAATATA AAACTTGTTG AAAACAATGC GGCGTTGGGT 4 8 60 

GCTAAAATTG CTGTCGCTGT TAATAAATTA TTGTAGGTGA TGATACATGA ATATTTTATT 4 92 0 

CGCTATCACA GGGATAGCAT TTGCACTATT TGTTGCGTTT TTATTCAGTT TTGATCGTAA 4 98 0 
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AAAAATAGAC TTCAAAAAGA CGTTAATAAT GAT AT T TAT T CAAGTGTTGA TCGTGTTATT 50 4 0 

TATGATGAAC ACAACGATTG GTTTGACAAT TTTAACTGCA CTAGGTTCAT TTTTTGAAGG 5100 

GCTAATAAAT ATTAGTAAAG CAGGCATAAA TTTTGTTTTT GGAGATATAC AAAATAAAAA 5160 

TGGCTTTACG TTCTTTTTAA AC G TAT TACT GCCATTAGTT TTTATTTEG TATTAATAGG 5220 

CATCTTTAAT TATATTAAGG TATTACCATT TAT T ATC AAA TATGTAGGTA TCGCTATTAA 5280 

TAAAATAACT AGAATGGGGC GCTTAGAAAG TTATTTTGCT ATTTCAACAG CAATGTTTGG 5340 

GCAACCAGAA GTATATTTAA CAATAAAAGA TATTATTCCA AGATTATCTA GAGCGAAATT 54 00 

ATATACAATT GCGACGTCTG GTATGAGTGC TGTTAGTATG GCAATGCTAG GTTCATATAT 54 60 

GCAGATGATT GAACCCAAGT TCGTAGTTAC AGCAGTAATG TTAAATATTT TTAGTGCGCT 552 0 

TATCATCGCC AGTGTAATCA ATCCCTATAA ATCTGATGAT ACTGATGTTG AAATTGATAA 5 58 0 

CTTAACGAAA TCCACAGAAA CTAAAACATT GAATGGAAAA ACAGGAAAAC CTAAGAAAGT 5640 

TGCCTTTTTC CAAATGATTG GTGATAGTGC GATGGATGGG TTTAAAATCG CTGTTGTAGT 57 00 

AGCCGTAATG TTGTTAGCAT TTATTTCATT AATGGAAGCA ATTAATATCA TGTTTGGTAG 57 60 

TGTTGGTTTG AACTTTAAAC AGCTTATTGG CTATGTGTTT GCACCAATCG CATTCTAAT 5820 

GGGGATTCCA TGGAGCGAAC TGTTCCAGCT GGCTCTTTAA TGGCGACTAA ATTAATTACA 58 8 0 

AATGAGTTTG TAGCAATGCT TGATTTTAAA AATGTCCTGG GTGATGTATC AGCTCGAACA 5 94 0 

CAAGGTATCA TTTCAGTTTA CTTAGTAAGc TTCGCTAATT TTGGTACGGT TGGTATCATC 6000 

GTAGGTTCAA TTAAAGGCAT TAGTGATAAA C AAG GAG AAA AAGTTGCATC CTTTGCAATG 6060 

AGGTTGCTAC TTGGTTCAAC TCTAGCTTCA ATCATTTCAG GATCAATCAT TGGCTTAGTA 6120 

TTGTAAATGA ATCGAAGTAC CTAAATTAAA TTCATGGCAA AGCTAAACCC CGTCACCAAG 6180 

TTGGCGCAAC AGCGcATgcA TAACTTAGTG ACGGGGTTT ATCATAACAA TCTACTTTTT 62 4 0 

CGTAGCCGTT TTTGAAATGT ATGTTGATGG TTTATCTTTT TCAAAAATTG TTAATCCCGT 6300 

TATATCTTTT TTATGTTTTG AAGGGACAAT GAAGCTAAGT ATATAAGCAA AG AC AAAAG C 63 60 

AACTGTAAAT GAAAT GGT AG ATACATAGAA AGGTGAGTTA CCTTTGCCAA CACCATTATA 6420 

GACATAAGCA AAG AT GAT AC CCAATATTAA TCCACAAATA ACACCGAATG TATTCGTACG 64 80 

TTTAGTGAAA ATACCAACTG CAAATACACC AGCCAATGGA ACGCCGAATA ATCCAGTCAC 654 0 

AAACAAGAAT AAATCCCATA AGTCATTTGA AT T AGAAGC A ATTAAGTATA GTGACATTCC 6600 

AAAACCGAAA ATACCTGCAA. TGATAATAAT GAAACGTGCA AAGTTAACTT CGTGTCGCTC 6660 

GCTACCTTTT C C G AAG AAG C GTTGCTTAAT GTCGATTGAA ATACAAGCAG ATATAGAATT 6720 

T AAAC TAG AT GAAATGGTAG ACTGTGCAGC GGCGAAAATG GCTGCAATAA GTAATCCTGC 67 8 0 
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TACAAATGGT GGCATCTCAG TCAAAATGAA ATATGGCACT AC AST GAT G TATTGAAGCC 68 4 0 

TTTTGGTAAA ACAGCTTCAT GTGTATAAAA TGAATACAGC ATTGTACCCA TACCATAAAA 6900 

TAAGGGTGCT GAAATTAAAG CTAGGATACC ATTTGTCCAT AACGATTTAT TTGTTTCTTT 69 60 

TAAACTATCA GAAGCTTGAT AACGCTGCAC GACGTCTTGA CTCGCTGTGT ATT G AT AC AA 7020 

GTTGTTGAAA ATATTTCCTA GGAAAATAAT TGGAATGGCA GCTGCCGCAG TATTTAGTTT 7080 

CCAATTGTCT GCACTAATTA ATTTTTTGTG CTCAATCGCA TCTGCAAAGA CAGTGCCGAA 714 0 

ACCGCCTTTA ATGTTCACAA CACCTAGAAT AATAATAACT AAAGCGCCGC CTAATAAAAT 72 00 

GACGCCTTGA ATGAAATCAC TCCAA2CCAC ACCTTCGAAA CCACCTAAAA ATGTATATAA 72 60 

AATACATAGT AAACCAACGA GTGATGCAAC GATATAAGGG TTCATGTCTG ATACAGATGT 7320 

GATTGCTAAT GTTGGTAAGT AGATAACAAT TGCAACACGC CCTAAATGGT AAAC G AC AAA 73 8 0 

TAATAATGAG CCAATGACAC GTATGCTAGG GCCAAATCTA GCTTCTAAAT BTCATATGC 7 44 0 

AGATGTTACC TTTAACTTTT TAAAGAAAGG GACATAGAAA TAAATAAGTA ATGGAATAAT 7 5 00 

TGCGACGATA GCAATGTTAC CAGCGATATA TGACCAATCT GTTAAAAATG CTTTCTCTGG 75 60 

TGTCGACATA AATGTAATCG CACTTAACGT AGTAGCATAA ATTGAAAAGC CAACTACCCA 7 62 0 

AGATGGCAAG CGACCACTTG CGGTAAAGAA ACTATTGGTA CTTTGGCTCG CGCGCTTGGT 7 68 0 

AAAATAAACG CCAATGAACA ACATAGCTAG TAGATAAATG ATAACGGCAA CCCAGTTTAG 77 4 0 

TGTGCCAAAT CCAACTTCTT TCATGGGCAA CATCCCCTTT ACAATGTATT GATTCTTTGA 7 8 00 

TGTCTATAAA TCGTATTTTG CAATGAGTTG ATCTAATGTT TGTCGATGTG CTTCGTTAAA 7860 

AGGTTTGAAA GGTCTTTTCG GTAATCCTGC ATCAATGCCA CGATGACGTA ATATTTCTTT 7 920 

CAATGTTGGA TAAATCCCCA TTGATAACAC TGTTTCGATA ATGTCGTTTG AATCATGTTG 7 980 

CAGTTGGTAA GCTTCTTGAA TTTGACCTTG TCGTGCTAAG TCGAAGATTT TTCTTGCMT 804 0 

GCGACCATTA ACGTTATATG TAGAACCAAT TGCACCATCT AC G C C AG AAA TCGTAGCTTG 8100 

AACTAACATT TCATCAAAGC CAGATAAGAT TAATTTGTCT GGGAATGCTT TTCTAATACG 8160 

TTCGAGTAGG AAGAAGTTTG GCGCTGTATA TTTAACACCA ACAATTTTTT CATGATTAAA 8220 

TAGCTCGCTG AATTGTTCAA TAGAAATATT CACACCTGTT AAATCTGGTA TTGCATAAAT 82 8 0 

AATCATATTG TTCTGAGTTG CTTCGATAAT ATCGAAATAG TAATCTCTAA TTTCTTCAAA 834 0 

AGTAAATGGA TAGTAGAATG GTGTTACGGC AGAAAGTGCA TCATAACCGA GTTCTGTGGC 84 00 

ATATTTTCCA AGTTCAATGG CTTCATTTAA ATCTAACGAACCTACTTGAG CAATCAATTT 8 4 60 

CACTTTATCC CCAACTGCCT CTTTGGCAAC CTTGAAAACT TGCTTCTTCT GCTCTGTATT 8520 

TAATAAAAAG TTTTCGCCTG AGCTACCATT TACATAAAGA CCGTCTAATT CTTCAGTTTC 8580 

AATGGCATTT TGAGCAATTT GTTTAAGTCC TTGTTCATTT ACTTGACCAT TTTCATCAAA 842) 
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AGGAACGAGT AACGCTGCAT ATAAACCTTT TAAATCTTTG TTCATTATGA AGTCCCTCCA 8700 

AAAATCATTT GATAATATAG TTTACAGCTA TAATTGTAAA CGCTATCATA AAATGTAACA 8 7 60 

ATATCTTTTT GAAAATTGTA GTCATATTTA TGTATAATTA ATGAAAATGT TTTTCAAAAT 8 82 0 

CAATAGAAAT GGAGTGAGTA AGSTGTATTA CATCGCAATC GATATTGGAG GCACTCAAAT 8880 

TAAATCGGCA GTTATTGATA AGCAATTGAA TATGTTTGAC TATCAACAAA TATCAACGCC 8 94 0 

GGACAACAAA AGTGAGCTTA TTACTGACAA AGTATATGAG ATTGTAACAG GATATATGAA 9 000 

GCAATATCAG TTGATCCAAC CTGTCATAGG TATTTCATCA GCAGGCGTG TTGATGAACA 90 60 

AAAAGGC GAA ATTGTATACG CAGGGCCAAC CATTCCGAAT T AT AAAGG T A CTAATTTTAA 912 0 

GCGATTATTA AAATCACTGT CTCCTTATGT CAAAGTAAAA AATGATGTAA ACGCTGCATT 918 0 

ACTAGGCGAA TTGAAATTAC ATCAATATCA AGCAGAACGG ATCTTTTGTA TGACGCTTGG 924 0 

TACAGGCATT GGGGGTGCGT ACAAGAATAA TCAAGGTCAT ATTGATAATG GTGAGCTTCA 9300 

TAAGGCAAAT GAAGTTGGGT ATTTATTGTA TCGTCCAACT GAAAATACAA CGTTTGAGCA 93 60 

ACGTGCTGCA ACGAGTGCAT TGAAAAAGCG CATGATTGCC GGAGGATTTA CGAGAAGCAC 942 0 

ACATGTGCCA GTATTGTTTG AAGCAGCTGA AGAAGGTGAT GATATTGCAA AACAAATATT 94 80 

GAATGAGTGG GCAGAAGATG TAGCAGAAGG GATTGCCCAA ATACAGGTCA TGTATGATCC 954 0 

AGGGCTTATA TTAATTGGGG GCGGTATATC TGAACAAGGA GATAATCTCA TTAAATATAT 9 60 0 

CGAGCCGAAA GTTGCACACT ATTTACCAAA AGACTATGTT TATGCACCAA TACA&CGAC 96 60 

TAAGAGTAAA AATGATGCAG CATTATATGG CTGTTTGCAA T GAT AG T T G A AAGAAGGAGT 9720 

CATTCTAAAA TAGAATTTGA AACCGTTACG AGAGATGAGA GCTGTTGTTA GTTCCACACA 97 8 0 

TCACACTCTA TCTAGGACCA ATCTAAACTA TATCAACCAA cAGTGTGCCA CGGGCAAATT 98 4 0 

AAATTGAAGA PGC T GAG AT A TTAAAATTTT AGAAAATGTA AAAAAATATT TGGTATTGAA 9 90 0 

ATTAAAAAAG CACCTAGCAA CTCGTTGGGA CAATCACGAT GATTGTCTAC AGTTGCAGGT 9 9 60 

GGATTTGAAT ATACTACTAG TTATTTGTTG TCTAGGATAA TAGATTTAGT ATGTTGATAA 10020 

GTTTGACTCA GATTCGTATT TTCTAATAAA TGATAACTCA CGATATCGAT TAAAAAGAGT 10080 

GTCGCAATTT GTGTGTTGAT AAATTGATGG TCGGTATTAC GCGATTGATC CGTTGTTAAA 1014 0 

AGTACTAAAT CTGCACAATC TGTAAGTTTA CTACCTTCAA AATTTGTGAT GGCAACGACA 10200 

TATGCACCAT GAGATTTGGC GACTTCCGCT G C AG AAA T T A ATTCCGAAGT ATTACCACTA 10260 

TTTGACATAG CAATAAACAT ATCCGAATGA GATAGTAGGG ATGCCGATAT TTTCATTAAA 10 320 

TGTGAATCGG TAGTAACATT ACCTTTTAGC CCCATACGAA TCATACGATA ATAAAATTCA 10380 

GTCGCTGATA AACCAGAGCT ACCTAGTCCA GCAAAGAGTA TATGTCGACT TGATTGAAGT 1044 0 
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TTGTCGATAA AGGTTTGGAT AATGTCGTTA TCAATAAATT CACCAGTTTG TTGAATGATT 10500 

TGTTGATGAT ATTTATGAAT TCTTTGAATA ATTGGGCTAT TTTCAATAAC TGTCTCTGTC 10560 

ATTTCTTGTT G AAT AT T AAA TTTTAAATCT TGGAAATTCT CATAATCCAG CTTATGACTA 10620 

AAGCGTGTCA TCGTTGCTGG TGATGTACCA ATCGCATGGG CTASGAGTT AATCGTTGAA 10680 

AAGGCATCGC TATAACCATT TTGTCTTATA TAATTGACGA TGCGTTTATC AGTTTTTGTA 10740 

AATAAATGTT GATAACGTTG AACACGATTC TCAAATTTCA TTGTGTCACC CCTTCATCTT 10800 

AAT GAT TACT ATTATATATG AAAAATATTT TCAAGATAGT AAAAAGCATT GATAAAAATT 108 60 

AT C T T AAT G A TATATTGTAA ATGACTTTAC GTGAAAAAAC GACTTATGGA GTGAGGAATA 10920 

ATGTTACCAC ATGGATTAAT AGTATCTTGT CAGGCACTAC C AG AT G AAC C ATTGCATTCA 10 980 

TCTTTTATTA TGTCGAAAAT GGCATTAGCT GCGTATGAAG GTGGTGCTGT TGGTATTCGC 11040 

GCAAATACTA AGGAAGACAT TTTAGCAATT AAAGAAACGG TAGATTTACC AGTTATTGGC 11100 

ATTGTGAAAC GTGACTATGA TCACTCAGAT GTTTTCATTA CTGCAACGTC AAAAGAAGTT 11160 

GAT G AAC T G A TAGAAAGCCA ATGTGAAGTC ATTGCATTGG ATGCAACGTT ACAGCAACGT 1122 0 

CCGAAAGAAA CGTTAGACGA ATTAGTATCA TAT AT T AG AA CACATGCACC fiACGTTGAA 112 8 0 

ATCATGGCTG ATATCGCGAC CGTTGAAGAA GCTAAAAATG CCGCACGACT TGGCTTTGAT 11340 

TATATTGGCA CGACGTTACA TGGCTATACT AGTTATACGC AAGGACAATT ACTTTATCAA 114 00 

AATGACTTCC AATTTTTAAA AGATGTACTA CAAAGTGTTG ATGCAAAAGT TATTGCGGAA 114 60 

GGTAATGTCA TTACACCGGA TATGTATAAA CGTGTGATGG ACTTAGGCGT TCATTGTTCA 11520 

GTCGTTGGTG GTGCGATAAC ACGACCAAAA GAAAT T ACGA AACGTTTTGT TCAAATTATG 11580 

GAAG AT T AAA TGATAACGAT AAAAAAACGA GATGACCATC AT TAAT T AAA GGCACCTAAT 11640 

TATCTTAGGT GGCTGAATGA ATGTAATGGG TTG\TCTCGT TTTGTTTGTT TATGATAGTG 11700 

ATTTTATTTT CAACTTTATC CAAAAATAAG TAAAGCGACG GGGATGGTGA TTAATAGCGA 11760 

CAACGCCACG CGTAAAAACC AAATGATGAT GAGTTTCCAG ACAGGTATTT TAATTTCAGT 11820 

TGCTAGTATA CATGGCACTA ATGCTGAGAA AAAGATAATG GCTGATACGC TTACTACAtC 1188 0 

G AC G AC AAA T TTAGTACTCA TTGCAGCTTT AGTTACTAAC AAAGATGGTA GAAACATCTC 11940 

TACAATAGAA AckCTGACGC TTTTGCTAGT AAAGCCTGAT CAGCAATTGG GAAAATATAA 12000 

ATAAATGGAT AGAAGATATA GCCAAGCCAA TCAATGAATG GTGTATAGTT CGCTACAATC 12060 

AGTCCTAAAA AAC C AAT C G A TAATATAGAA GGTAAAATAC CAACAGTCAT TTCTAAACCG 12120 

TCTTTCAAAT TGTCCCAAAC GTTCTTCACG AGAGATGGTG TTAATGCATT TTGTTTCATC 12180 

GCCTCTGCAT ATGCAGTTTT CAGTCTGCTT CCTTCAATAG CAACTTCTTG TTCTCCTTCT 12240 

TGTCCGTTAT AATATTCTGT TGATTCATTG CTGATTGGCGGTAGCCATGC AGTAATTGCA 12300 
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GTCACGACAA ATGTGATGAC TAAAGTTATC CAAAAGTATA AATTCCAATG CGGCATTAAT 12360 

CCTAAAGTTT TAGCAACGAT AATCATAAAA GTTGCTGAAA CTGTTGAAAA GCCAGTCGCA 124 20 

ATAATCGTGG CTTCTCGTTT GTTGTACATC CCTTGCTTAT AGACACGATT AGTAATCAAT 12830 

AATCCTAAGG AATAACTGCC GACAAACGAA GCCACTGCAT CGACAGCGGA TTTTCCTGGT 1254 0 

GTTTTAAAAA TAGGTCTCAT AATAGGCTCC ATATAAACAC CGACAAATTC TAATAAGCCA 12 600 

TAGCCCACTA ATAAAGAAAG cGcAATTGCA CCTACTGGAA TTAAGATACT TAATGGCATC 12660 

ATTAATTTTT CAAACAAAAA CG5ACCATAG TTAGCTTTAA ATAGTATTGA TGGACCGATT 12720 

TTAAATACAT AC AT T A T AC C GATCATTGCA CCTGCAACTT TAAATAATGT AATGACCAAG 127 8 0 

TTTGTGATTG AAG T CAT AAA AGTACGTCTC ACTATTGGTA ACGCTGTACC AATTAAAATC 12840 

ATAATCAGTG CAACATAGGG CATAAGTGGA CCTATGATTG AGCGAATGC TAG AT GAAC A 12 900 

T GAT C G AC G A AAATAGTGTT GTTACCATTA ATCGTAAAAG GAATAAAGAA ACATAGTATG 12 9 60 

CCCACTAAAC TATAGACAAA AAAACGCCAT GCACTTGGTT GTTGTGCATT AGAATGATAT 1302 0 

TGATTCATTA AAGCAACCCC TTTGTTTAAA TGAATACACA AAACTGTATG ATGCATCTTC 13080 

CCCTTAATGA GATGAATCAT TATTTTAATT TAGAAAAATC TGAAAACTTA CTATAATTGT 1314 0 

ATAGTTTGAA TTATTTTCAT ACCAATACAA ATTAACTAAT TATATATAGA TTGAAACTAT 13200 

ATTACTTAAT AAAATATTTA TCTTAAATGT TGTTGTGTTG ATTCAACACC ACAACTAAAA 132 60 

GTGTTTATAA ATTATTTGGA AATACACATA TTTGTAAATG ATTAGTATCG ATTTAATATC 13320 

GTATTATTAA ATTTTTATTA ATTTTGTAGT CTTAATCmAA AAATAATATA TGTCATGTTA 13380 

TATTGAAGGT GCAGTTGTTT TTCATTCTCA AGAGGGGGTC AAAAAAATAC TTTTGAGGTG 13440 

ATTATATGTT AAGAGGACAA GAAGAAAGAA AGTATAGTAT TAGAAAGTAT TCAAffGGCG 13 50 0 

TGGTGTCAGT GTTAGCGGCT ACAATGTTTG TTGTGTCATC ACATGAAGCA CAAGCCTCGG 135 60 

AAAAAACATC AACTAATGCA GCGGCACAAA AAGAAACACT AAATCAACCG GGAGAACAAG 13620 

GGAATGCGAT AACGTCACAT CAAATGCAGT CAGGAAAGCA ATTAGACGAT ATGCATAAAG 13680 

AGAATGGTAA AAGT GGAAC A GTGACAGAAG GTAAAGATAC GCTTCAATCA TCGAAGCATC 13740 

AATCAACACA AAATAGTAAA ACAATCAGAA CGCAAAATGA TAATCAAGTA AAGCAAGATT 13800 

CTGAACGACA AGGTTCTAAA CAGTCACACC AAAATAATGC GACTAATAAT ACTGAACGTC 138 60 

AAAATGATCA GGTTCAAAAT ACCCATCATG CTGAACffAA TGGATCACAA TCGACAACGT 13 920 

CACAATCGAA TGATGTTGAT AAATCACAAC CATCCATTCC GGCACAAAAG GTAATACCCA 13 980 

ATCATGATAA AGCAGCACCA ACTTCAACTA CACCCCCGTC TAATGATAAA ACTGCACCTA 14 040 

AATCAACAAA AGCACAAGAT GCAACCACGG ACAAACATCC AAATCAACAA GAT AC AC AT C 14100 
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AACCTGCGCA TCAAATCATA GATGCAAAGC AAG AT GAT AC TGTTCGCCAA AG T G AAC AG A 14160 

AACCACAAGT TGGCGATTTA AGTAAACATA TCGATGGTCA AAATTCCCCA GAGAAACCGA 14220 

CAGATAAAAA TACT GAT a AT AAACAACTAA TCAAAGATGC GCTTCAAGCG CCTAAAACAC 14280 

GTTCGACTAC AAATGCAGCA GCAGATGCTA AAAAGGTTCG AC C AC T T AAA GCGAATCAAG 14 340 

TACAACCACT TAACAAATAT CCAGTTGTTT TTGTACATGG ATTTTTAGGA TTAGTAGGCG 14400 

ATAATGCACC TGCTTTATAT CCAAATTATT GGGGTGGAAA T AAAT T T AAA GTTATCGAAG 144 60 

AAT T GAG AAA GCAAGGCTAT AATGTACATC AAGCAAGTGT AAGECATTT GGTAGTAACT 14 520 

ATGATCGCGC TGTAGAACTT TATTATTACA TTAAAGGTGG TCGCGTAGAT TATGGCGCAG 14580 

CACATGCAGC TAAATACGGA CATGAGCGCT ATGGTAAGAC TTATAAAGGA ATCATGCCTA 14 640 

ATTGGGAACC TGGTAAAAAG GTACATCTTG TAGGGCATAG TATGGGTGGT CAAACAATTC 14 7 00 

GTTTAATGGA AGAGTTTTTA AGAAATGGTA ACAAAGAAGA AATTGCCTAT CATAAAGCGC 14 7 60 

ATGGTGGAGA AATATCACCA TTATTCACTG GTGGTCATAA CAATATGGTT GCATCAATCA 14 820 

CAACATTAGC AACACCACAT AATGGTTCAC AAGCAGCTGA TAAGTTTGGA AATACAGAAG 14 88 0 

CTGTTAGAAA AATCATGTTC GCTTT7AATC GATTTATGGG TAACAAGTAT TCGAATATCG 14 940 

ATTTAGGATT AACGCAATGG GGCTTTAAAC AAT T ACC AAA T GAG AG T T AC ATTGACTATA 15000 

TAAAACGCGT TAG T AAAAG C AAAATTTGGA CATCAGACGA CAATGCTGCC TATGATTTAA 15060 

CGTTAGATGG CTCTGCAAAA T T G AAC AAC A TGACAAGTAT GAATCCTAAT STAC G TATA 1512 0 

CGACTTATAC AGGTGTATCA TCTCATACTG GTCCATTAGG TTATGAAAAT CCTGATTTAG 15180 

GTACATTTTT CTTAATGGCT ACAACGAGTA GAATTATTGG TCATGATGCA AGAGAAGAAT 15240 

GGCGTAAAAA TGATGGTGTC GTACCAGTGA TTTCGTCATT ACATCCGTCC AATCAACCAT 15300 

TTGTTAATGT TACGAATGAT GAACCTGCCA CACGCAGAGG TATCTGGCAA GTTAAACCAA 15360 

TCATACAAGG ATGGGATCAT GTCGATTTTA TCGGTGTGGA CTTCCTGGAT TTCAAACGTA 15420 

AAGGTGCAGA ACTTGCCAAC TTCTATACAG G T AT TATAAA TGACTTGTTG CGTGTTGAAG 154 80 

CGACTGAAAG TAAAGGAACA CAATTGAAAG CAKTTAAAT TCATCTTCTG AATTTAATAT 1554 0 

GCTATGTAAA TCGTGCTGTT AT CATGGCAC ATCAGATATA AGTAGCATCA CAGTGTTGAA 15600 

TTTAAAAATA GTAAAGTGAA ATAAAGCGCC TGTCTCATTA GCGAAAACTA AAGGGACAGG 15660 

CGTATCTGTT TATGAGCTTA AT AAAT TG TA T G AAT AAT AT GGTTGATCGA ATAACTGTT 15720 

ATCATGATGA TAAATTGAGT TTTTTAAAAT AATGATATAT TACATCATTG TTATAGCGTT 15780 

T AAGAAAT C A AC AAC T T T AC GATAAATAGT GATTGCTTCG TCATTAGGTC TACGATCAAA 1584 0 

ATCATGCTCG TTTTTATTCA CGCGTTCAAA TGTTGAATGT GGAACATGAT TCATGATATG 15900 

TTCGCTTTCC TCAAGGGGAA CAT CAT AAT C GCCATTACAA TGCGCAATGA AAACAGGTGG 15960 



131 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



AAGTGTTTTA AGTTCATCTG GTGCAATATT ATATTTTGAA T TAG T AT AAT CAGCAATGTT 16020 
AATCATATTT ATCCATTTAC CTGTGCCACG TGCATAAACG TAGATTAAAA AACGTTGTGC 1608 0 

GATTTGATCT TGAACAACCG GTGTTGGTGA AGTGAGTTGT GCAATCATTG TTTCGTTTAC 16140 
GCTTTGAGCT ATTTTTGCGT AAT AAC TAT T AGTTGTTTTA AAAGGTTCAG TGTTGATGCG 16200 
ACTATAACCA TAAAAATCAA TAACACCATC AATATCTCTG TCTCGTGCAA TTAATAGACT 162 60 

TAAATATGCA CCTGATGATC TGCCAAAGGT AAAAATAGGG CAATTAGAAT ATTGTGATTG 16230 

AATCGCATCG AATGAtGCgn AGnACATCCT CAATAATGCA ATCGAGACTT ACTTCTGGTA 16380 

ATAAACGATA ACTTAGTTGA ATTAAATCGT AATGTTCCGT AAgATATCGA TATACTGTGG 16440 

GGATAAATCG TTAGCTTTAC CGAACATTAA TCCACCACCG TGGATGTAGA CAATAGCGCC 16500 

TTTTGTTGGT TGATTTTTTG CTTTAATAAT TGTGTAAGGT AATGCAAATG CATCTTTAGT 165 60 

AATTACTTTA TCTTTAATTT CAGTCACGAT TTAATAGGCT CCTTATTTTT GATATTGATG 16620 

TC ATT AT AAC ACTGTCTTAA ATTTCCATGA AAAATAGTCT TAAGACGATG AGTCATGATA 16680 

ATTCTGTTCC AATTGACGTA AAGCGTCACG GGTATGCTTC TTTAGACTT CCCCATAATC 16740 

CATCATTTTA AC AAT AT C T T TAAAAGCAGC ATGTGGAATG GCTAAATCTT CTAAATCTGC 168 00 

CATAGAAAAT TCAAGATTGA TATCATGTGG TCGCTGTTCA GCAAGTTTAT GCACAAAGTC 16860 

AGGTTCTGTG ACAAAAGGCG AAGACATGCC GACCATATCT GCATGTTGTA AAGCATC TAA 16 920 

AGCAGACTCT GGAGAAT TAA TCCCGCCACT TGCAATTAAA GGGATACGAC CTGCTAAATG 16 980 

TTCATAGACA ATTTGGTTAA CTGGTCGACC GAAATGATCA CCTGGTGTAC GAG AC G TAT T 17 04 0 

TTGATAAATA TGTCGACCCC AGCTAGCGAT TGCTAAGTAT TGGATGTTTG AAACGTCCAT 17100 

GACCCAATTG ATTAATTGGT TGAACTCGTC AATGGTATAT CCTAAATCAC TGCCTCTGGT 17160 

TTCTTCTGGC GTTGCTCGAA ATCCTAAAAT AAAATTGTCA GGTGCTTCTT TATCAATCAC 17220 

TTCTTGTACC GCACGCATAA CTTCTAAACA TAATCTTGCA CGATTTTTTA ATGAGTCGGC 17280 

ACCGTAATGG TCTGTACGTT TATTCGAAAA AG T T GAG AAA AATGTTTGAA TCAGKAACG 17 340 

TTGTGCAATC GAAATTTCCA C AC CAT C AAA ACCTGCTTTA ATCGCGCGTA ATGTAGCATC 17 4 00 

GCGATACTGC TGAATGATGC TATTGATTTT CTCATGAGAC ATGGCGATAA CATCGTGTTC 17 4 60 

AATCGGTGAA TGCAATGTCA TAGGGCTTGG TCCATACACC TTTCCAAAAT TTAAAATGGC 17520 

TTGATTTGAA AAACGACCAG CATGCGCTAg CTGGATAATA GCGAGGCTAC CATGTTGTTT 17 580 

CATCGTAGAT GCCATGTTAG TTAATCCAGG GATACAAGCA TCATGATCAA TATTAAAGCC 17 64 0 

ATATTCAAAC AATTGACCAT AAGGTTCAAT GTAAGCAGCG CCGGTGACTT GCATTCCAGC 17 7 00 

TGAATTAGAG CGACGTGCAG CATAAGCCAA GTCTTCTTT GTAATATAGC CTTCTTTTGT 177 60 
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TGATGTGTTT ACGGTCATTG GTGATAATAC AAAGCGATTC GAAATTTTGA TGCCATTAGG 17820 

TAAGTGGATT GATTGTAAAA GTGGTTTGTA TCGGTACATA CTATGATTCC TTTTCTATTC 1788 0 

AATATTGTTT TCAAAGTACC ATGGAAAGAA TGAATAATCA ATGATGAACA GTCTTGATAG 17 94 0 

AATAGAATTG GTACATGGAA AGTATTTTTA AAATTAAACT AATGAATGGC ATTTGTAGGT 18 000 

CTGAAAATAT G AAT AT G AAA AAGAAAAATA AAGGCGAAAA GATATAAAAG TTAATTGAAA 18 0 60 

AACGTTATCA TATACGTGGG TATATGAAGA GGGAATGGTA T T AAGAACGC TAAAATGTTA 18120 

TGTCGGTTTG ACATGACAG3 ATAAGTTTGG AGATGACGGA TTGGTTAAAT TAAGCGTATT 18180 

AGACTATGCC TTAATAGATG AAGGTAAGGA TGCACAAAAG GCATTGCAAG ATTCAGTGAC 1824 0 

ACTTGCAAAA TTAGCAGATC GACTTGGCTT TAAGCGAATT TGGTTTACGG AACATCATAA 18300 

TGTACCAGCG TTTGCGTGTA GTAGTCCAGA ACTTTTGATG ATGfiTACAT TGGCGCAGAC 18 360 

AAATCACATA CGAGTTGGCT CTGGTGGTGT GATGCTGCCG CACTATCGAC CTTATAAAAT 18 420 

TGCTGAGCAT TTTAGAATGA TGGCAGCGTT ATATCCAAAT CGTATTGATT TAGGTATTGG 184 8 0 

TAATAATCCA GGTACTACTA TGGTAAAGCA AGCTTTAGAT GGAATAAATC CTACATATGA 1854 0 

TAGTTACGAT GAATCGATTT CGTTATTACG TGATTATCTT ACAATAAAGG ATAAACCAAG 18 600 

TGCGCATACG TTAGGTGTCC AACCACACAT TGAT CATTTT CCAGAAATGT GGTTATTAAG 18 660 

TAGTAGCGCA ACATCTGCCA AAATAGCTGC CGAACTAGGT ATAGGGCTTT CTGTTGGAAC 18720 

ATTTTTGCTA CCAGATATAA ATGCGETACA TACAGCGAAG GATAACATTG AT AT T T AC AA 18780 

AAAACATTTC CAAGCATCAA CGATTAAAAT GGACGCAAAG GTGATGGCAT CTGTATTTGT 1884 0 

CATTGTAGCT GATAACGAAG CGGAAGTAGC AG CAT T AC AA CATGCCTTAG ATGTTTGGTT 18 900 

ATTAGGTAAA TTACAATTTG CAGAATTTGA AGATTTTCCT TCAGTAGACA fiGCACAAAA 18960 

GTATAAGCTT AATGATCGAG ACAAAGAGAT GATTCAAGCA CATCAAGCAC GCATCATTGC 19020 

AGGTACACAA GAAAAGGTTA AAGCACAATT AGATGATTTC ATTGCTACGT TTGAAGTTGA 19080 

TGAGGTGTTA GTAGCACCGC TTATTCCAGG TATTGAACAG CGTTGTAAAA CATTAAAATT 19140 

ACTCGCGGAA ATTTATTTGT AGCATTTTAA ATAGAAGAGA AAGGATGAAG ATAAGATGAA 19200 

AAAGTTAGCC AATTATTTAT GGGTAGAAAA AGTAGGAGAT TTGTATGTGT TTAGTATGAC 192 60 

ACCTGAATTG CAAGATGATA TTGGGACAGT AGGTTATGTT GAATTCGTAA GTCCAGATGA 19320 

AGTTAAAGTG GATGATGAAA TTGTGAGTAT CG&GCATCG AAAACGGTCA TTGATGTGCA 1938 0 

AACGCCATTG TCAGGAACGA TTATTGAGCG AAATACAAAA GCGGAAGAAG AAC C G AC AAT 19440 

TTTAAACTCT G AAAAAC C AG AAGAAAATTG GTTGTTCAAA TTGGATGATG TCGATAAAGA 19500 

AGCATTCCTA GCATTACCGG AGGCTTAAAT GGAAACGTTA AAATCAAATA AAGCGAGAE 195 60 

TGAATATTTA AT C AAT GAT A TGCATCGAGA G AG AAAT G AC AATGACGTAT TGGTAATGCC 19620 
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ATCTTCATTT GAAGATTTGT GGGAATTATA TCGAGGCTTA GCAAATGTCA GACCGGCATT 19680 

ACCTGTAAGT GATGAATATT TAGCTGTACA AGATGCTATG TTAAGTGATT TGAATCGTCA 19740 

ACATGTTACG GATTTGAAGG ATTTGAAGCC GATAAAAGGT GACAATATCT TTGTTTGGCA 19800 

AGGTGATATC ACGACGTTAA AAATCGATGC TATTGTTAAT GCTGCAAATA GTCGTTTTCT 198 60 

AGGATGTATG CAAGCTAATC AT G AC T GC AT TGATAATATT AT TC ATACAA AAGCGGGTGT 19920 

TCAAGTTCGA CTTGATTGTG CAGAGATCAT TCGACAACAAGGGCGCAATG AAGGTGTAGG 19 980 

TAAAGCCAAA ATAACACGTG GATATAATTT GCCAGCAAAG TATATAATTC ATACGGTTGG 2 004 0 

TCCGCAAATA CGTCGATTGC CTGTTTCAAA GATGAATCAG GACTTGTTAG CTAAATGTTA 20100 

TCTTAGCTGT CTTAAATTGG CTGATCAACA TAGTTTAAAT CATGTCGCTT TTTGCTGTAT 2 03D 

ATCTACAGGT GTATTTGCTT TTCCTCAAGA TGAAGCAGCA GAAATTGCTG TTCGAACAGT 20220 

AGAAAGCTAT CTCAAAGAAA CAAATTCAAC ATTGAAAGTC GTGTTCAATG TATTTACAGA 2 0280 

TAAGGATTTA CAACTGTATA AGGAGGCATT TAACCGTGAT GCAGAGTAGT AAGTGGAATG 2 034 0 

CAATGTCTCT GTTAATGGAT GACAAGACAA AGCAGGCTGA AGTATTGCGT ACTGCGATTG 204 00 

ATGAAGCAGA TGCGATAGTG ATTGGAATTG GTGCAGGCAT GTCTGCATCT GACGGATTTA 204 60 

CATATGTAGG AGAGCGTTTT ACGGAAAATT TCCCAGATTT TATTGAAAAA TATCGCTTCT 20520 

TTGATATGTT GCAAGCGAGT TTACATCCTT ATGGCAGTTG GCAAGAGfflT TGGGCATTTG 20580 

AGAGTCGTTT TATTACATTA AACTATTTAG ATCAACCTGT AGGTCAGTCT TACCTCGCTT 20640 

TAAAATCCTT GGTGGAAGGT AAAC AG T AC C ACATTATAAC TACGAATGCA GATAATGCTT 2 07 00 

TCGATGTAGC TGATTATGAT ATGACTCATG TATTTCATAT ACAAGGGGAG TATATACTGC 20760 

AACAGTGTAG cTCAGCATTG TCATGCTCAA ACGTATCGCA AT GAT GAT TT AATTCGTAAA 2 082 0 

ATGGTTGTTG CGCAACAAGA TATGCTTATA CCTTGGGAGA TGATTCCAAG ATGTCCAAAA 20880 

TGTGATGCCC CAATGGAAGT GAATAAACGT AAAGCGGAAG TTGGGATGGT TGAAGATGCT 2 0 940 

GAATTTCATG CGCAACTACA TCGTTATAAT GCTTTTCTAG AGCAACATCA AG AT GAT AAA 21000 

GTGTTGTATT TGGAAATTGG AATTGGTTAT ACTACACCAC AATTTGTGAA GCATCCTTTT 21060 

CAGCGTATGA CACGTAAAAA TGAAAATGCC CTTTATATGA CGATGAATAA AAAGGCATAT 21120 

CGCATTCCGA ATTCAATTCA AGAACGTACC ATACATTTAA CTGAGGATAT CTCASATTG 21180 

ATTACAGCAG CACTCCGGAA C G AC AG C AC A AC G AAAAAT A ACAACATTGG AGAGACAGAA 21240 

GATGTACTTA AT AGAAC CG A TTAGAAATGG AGAATATATT ACTGATGGTG CGATTGCACT 21300 

CGCTATGCAA GTTTATGTTA ACCAGCATAT CTTTTTAGAT GAAGATATTT TATTCCCTTA 21360 

TTATTGTGAT CCAAAAGTGG AAATTGGACG TTTTCAAAAT ACTGCTATAG AAGTGAATCA 21420 
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AGAT TAT AT A GATAAACACA GTATTCAAGT AGTTCGCCGA GATACTGGTG GTGGCGCTGT 214 80 

GTATGTTGAT AAAGGTGCCG TTAATATGTG TTGTATTTTA GAACAAGACA CTTCAATTTA 215 4 0 

TGGTGATTTT CAACGATTTT ATCAACCAGC TATAAAGCG TTGCATACAT TAGGTGCAAC 21600 

AGATGTGGTA CAAAGCGGTA GAAATGATTT AACATTGAAT GGTAAAAAAG TGTCAGGCGC 21660 

CGCAATGACA TTAATGAATA ATCGTATTTA TGGCGGTTAT TCGCTATTAC TTGATGTTAA 217 2 0 

TTATGAAGCA ATGGATAAAG TGTTAAAGCC TAATCGCAAA AAGATTGCAT CGAAAGGGAT 217 8 0 

TAAATCTGTG CGCGCACGTG TTGGTCATCT TAGAGAAGCA CTGGATGAAA AGTATCGTGA 218 4 0 

TATAACCATT GAAGAATTTA AAAATTTAAT GGTGACGCAG ATTTTGGGAA TCGATGACAT 21900 

TAAAGAGGCG AAAC GAT AT G AATTAACGGA TGCAGATTGG GAAGCGATTG ATGAATTAGC 21960 

T G AT AAAAAG TATAAAAATT GGGATTGGAA TTATGGCAAG TCACCCAAAT ATGAATACAA 22020 

TCGAAGTGAA AGATTATCTT CAGGTACGGT AGACATAACA ATTTCTGTTG AACAAAATCG 22080 

TATCGCAGAT TGTCGTATTT ATGGGGATTT CTTTGGACAA GGTGATATAA AAGATGTGGA 2214 0 

AGAAGCATTA CAAGGAACAA AAATGACAAG AGAAGATTTA ACGfiTCAGT TAAAGCAATT 22200 

AGACATCGTT TATTATTTTG GCAATGTTAC GGTAGAAGCA TTAGTGGATA TGATTTTAAG 222 60 

TTAATATTGT TATTTTATGT ATGCTGAATC ATTGGAAGTG TTTGCTTGCT CTTGAAAAGG 22320 

TGACAATAGT GTTTGGTGAA GGTTGAACAT ATGAGTGGAA ATTATTGCCT TTAACTATTC 22380 

AAAGTATGAT ATATATATGG TTTTTGTTTC TAAATGATTG GGTATTTGAA AATAGATGAG 2 2440 

TTTAATATTT TAAGGAATAT AATGATGTTT ACTTTTATAA TTCATATAGA ATATTAAGCA 225 00 

ATATAAGTCT GTTGATATAT ACAAAATATA ATGACTGCTA TAATGAGTAA T C AAT AG AC A 225 60 

CAAAGAGGAG ATTATGTGAT G AAT AAT AAA GTATTAGTAA CCGGTGGTAC AGGGTTTGTT 22 620 

GGCATGCGAA TTATTTCACG AT TAT T AG AA CAAGGTTATG ACGTACAAAC G AC GAT AC G T 22 680 

GATTTAAGTA AAGC TGATAA AGTAATTAAA ACAATGCAAG ACAATGGCAT TTCCACAGAG 227 4 0 

CGATTAATGT TTGTCGAAGC GGATTTATCA CAAGATGAAC ATTGGGATGA fiCAATGAAA 22800 

GATTGCAAGT ATGTCTTGAG TGTAGCATCT CCGGTGTTTT TCGGTAAAAC AGACGATGCA 228 60 

GAAGTGATGG CGAaCTGcAA TTGAAGGTAT ACAACGTATT TTAAGAGCTG C AG AAC AT GC 22920 

GGGTGTTAAA CGTGTGGTAA TGACTGCAAA CTTTGGTGCA GTTGGTTTTA GTAATAAAGA 22 980 

TAAAAATTCA AT C AC AAAT G AAAGTCATTG GACAAATGAA GAT G AAC C AG GCTTATCAGT 2 304 0 

AT AT G AAAAA TCAAAATTGT TAGCTGAAAA GGCAGCGTGG GATTTTGTTG AG AAT G AAAA 2 3100 

T AC AAC AG T A GAATTTGCCA CAATCAATCC AGTTGCAATT TTTGGGCCAT CATTAGATGC 2 3160 

ACACGTTTCA GGAAGCTTTC ATTTATTAGA AAffTTATTG AATGGTTCAA TGAAACGTGT 23220 

ACCGCAAATT CCGTTAAATG TTGTTGATGT GAGAGACGTA GCTGAACTGC ACATTTTGGC 2 3280 
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AATGACAAAT GAACAAGCTA ATGGCAAGCG ATTTATTGCG ACGGCTGATG GACmAATTwA 2334 0 

tTTGTTGGGA ATTGcCAAAt TAATTAAAGA AAAGGGCCTG GAAATAGCTC CAAAAGTTtC 234 00 

TACTAAAAAA TTACCCAGCT TTATTTTGAG CnAnGnGCC 234 39 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CCCTTTGAGA GTATATCATC TAGTCAAATT ATGCCTGTCA TTAGAGCGAC TAGCTTTGAT 60 

AAAAAAACTA AACAAGGTGT TCCTTTTATC TATGATGAGC ATACTGCAGA AACAGCAGTT 120 

TATTATGCAG TCGATTTAGG GAAATCATAT CGTCTAATTG ACGAAAGCAT GTTAGAGGAT 180 

TTGAAGTTAA CTGAACAACA AATAAGAGAA ATGTCTCTGT TTAATGTTAG AAAATTGTCA 240 

AATTCATATA CGACTGATGA AGTAAAAGGT AATATTTTTT ATTTTATTAA CTCAAATGAC 3 00 

GGGTATGATG CAAGTAGGAT ACTAAATACT GCATTTTAA ATGAAATTGA GGCACAATGT 3 60 

CAAGGCGAAA TGCTCGTAGC AGTGCCACAC CAAGATGTGT TAATTATTGC AGATATACGC 4 20 

AATAAAACAG GATATGATGT GATGGCACAT TTAACAATGG AATTTTTCAC TAAAGGTCTA 4 80 

GTTCCAATTA CAT CAT T AT C CTTTGGATAT AAACAGGGTC ATCTTGAACC GATATTTATT 54 0 

TTAGGTAAAA ATAATAAACA AAAAAGAGAT CCAAACGTGA TTCAGCGTTT AGAAGCAAAT 600 

CGTCGTAAAT TTAATAAAGA TAAATAGAAA TAATTGGATA AGGAGTTTTG TCATAATGAA 6 60 

TTTATTTTAC AATCCTAAAT AT GT AGGAGA TGTCGCATTT TTACAAATTG AACCAGTTGA 7 20 

AGGTGAATTA AACTACAATA AAAAAGGTAA TGTTGTTGAA ATTACt AATG AAGGTAATGT 7 80 

TGTAGGTTAT AATATTTTTG AAATTTCAAA AGATATAACA ATTGAAGAAA AAGGTCATAT 840 

TAAATTAACT GATGAACTTG TAAATGTATT CCAAAAGCGT ATTTCAGAAG CTGGTTTTGA 900 

TTATAAATTA AATGCTGATC TAT C AC C G AA ATTTGTAGTT GGCTEGTTG AAACTAAAGA 96 0 

CAAACATCCT GATGCAGATA AATTAAGTGT ACTAAATGTA AACGTTGGAA AT G AC AC AT T 1020 

ACAAATTGTA TGTGGCGCGC CTAACGTTGA AGCTGGACAG AAAGTTGTTG TTGCTAAAGT 1080 

AGGTGCAGTG ATGCCTAGCG GTATGGTAAT TAAAGATGCT GAATTACGTG GTGTTGCCTC 1140 

AAGCGGTATG ATTTGTTCAA TGAAAGAATT GAATTTACCT AATGCACCTG AAGAAAAAGG 12 00 

TATTATGGTA TTAAATGACA GCTATGAAAT TGGACAAGCA TTtTTTGAAT AATTAAGGAA 12 60 
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GGTAGTGAAA ATATGAGCTG GTTTGATAAA TTATTCGGCG AAGATAATGA TTCAAATGAT 1320 

GACTTGATTC ATAGAAAGAA AAAAAGA^GT CAAGAATCAC AAAATATAGA TrACGATCAT 1380 

G AC T CAT T AC TGCCTCAAAA TAATGATATT TATAGTCGTC CGAGGGGAAA ATTCCGTTTT 14 4 0 

CCTATGAGCG TAGCTTATGA AAATGAAAAT GTTGAACAAT CTGCAGATAC TATTTCAGAT 150 0 

GAAAAAGAAC AATACCATCG AGACTATCGC AAACAAAGCC ACGATTCTCG TCTACAAAAA 15 60 

CGACATCGCC G T AG AAGAAA TCAAACAACT GAAGAACAAA ATTATAGTGA ACAACGTGGG 162 0 

AATTCTAAAA TATCACAGCA AAGTATAAAA TATAAAGATC ATTCACATTA CCATACGAAT 168 0 

AAGCCAGGTA CATATGTTTC TGCAATTAAT GGTATTGAGA AGGAAACGCA CAAGCCAAAA 17 4 0 

ACACATAATA TGTATTCTAA TAATACAAAT CATCGTGCTA AAGATTCAAC TCCAGATTAT 1800 

CACAAAGAAA GTTTCAAGAC TTCAGAGGTA CCGTCAGCTA TTTTTGGCAC AATGAAACCT 18 60 

AAAAAGTTAG AAAATGGTCG TATCCCTGTA AGTAAACCTT CAGAAAAAGT TGAGTCAGAT 192 0 

AAACAAAAAT ATGATAAATA TGTAGCTAAG ACG(AAACGT CTCAAAATAA ACAATTAGAA 1980 

CAAGAAAAAC AAAATGATAG TGTTGTCAAA CAAGGAACTG CATCTAAATC ATCTGATGAA 204 0 

AATGTATCAT CAACAACAAA ATCAATGCCT AATTATTCAA AAGTTGATAA TACTATCAAA 210 0 

ATTGAAAATA TTTATGCTTC ACAAATTGTT GAAGAAATTA GACGTGAACG AGAACGTAAA 2160 

GTGCTTCAAA AGCGTCGATT TAAAAAAGCG TTGCAACAAA AGCGTGAAGA ACATAAAAAC 2220 

GAAGAGCAAG ATGCAATACA ACGTGCAATT GAT GAAATGT ATGCTAAACA AGcGGAACgC 22 8 0 

TATGTTGGTG ATAGTTCATT AAATGATGAT AG T G AC T T AA CAGATAATAG TACAGATGCT 234 0 

AGTCAGCTTC ATACAAATGG CATAGAGAAT GAAACTGTAT CAAATGATGA AAATAAACAA 24 00 

GCGTCAATAC AAAATGAAGA CACTAATGAC ACTCATGTAG ATGAAAGTCC ATACAATTAT 2 4 60 

GAGGAAGTTA GTTTGAaTCA AGTATCGACA ACAAAACAAT T G T C AG AT G A TGAAGTTACG 2 52 0 

GTTTCGAATG TAACGTCTCA AC AT C AAT C A GCACTACAAC fflAACGTTGA AGTAAATGAT 2580 

AAAGATGAAC TAAAAAATCA ATCCAGATTA ATTGCTGATT CAGAAGAAGA TGGAGCAACG 2 64 0 

aATAAAGAAG AAT AT T C AG k AAGTCAAATC GATGATGCAG AATTTTATGA ATTAAATGAT 2700 

ACAGAAGTAG ATGAGGATAC TACTTCAAAT ATCGAAGATA ATACCAATAG AAACGCGTCT 2 706 

GAAATGCATG TAGACGCTCC TAAAACGCAA GAGTACGCAG TAACTGAATC TCAAGTAAAT 2 82 0 

AATATCGATA AAACGGTTGA TAATGAAATT GAATTAGCAC CGCGTCATAA AAAAG AT G AC 2880 

CAAACAAACT TAAGTGTCAA CTCATTGAAA ACGAATGATG TGAATGATAA TCATGTTGTG 2 94 0 

GAAGATTCAA GCATGAATGA AATAGAAAAG AATAACGCAG AAAT T AC AG A AAATGTGCAA 3000 

AACGAAGCAG CTGAAAGTGA ACAAAATGTC GAAGAGAAAA CTATTGAAAA CGTAAATCCA 30 60 

AAGAAACAGA CTGAAAAGGT TTCAACTTTA AG T AAAAG AC CATTTAATGT TGTCATGACG 312 0 
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CCATCTGATA AAAAGCGTAT GATGGATCGT AAAAAGCATT CAAAAGTCA TGTGCCTGAA 318 0 

TTAAAGCCTG TACAAAGTAA GCAAGCTGTG AGT GAAAGAA TGCCTGCGAG TCAAGCCACA 32 4 0 

CCATCATCAA GATCTGATTC ACAAGAGTCA AATACAAATG CATATAAAAC AAATAATATG 3300 

ACATCAAACA ATGTTGaGAA CAATCAACTT ATTGGTCATG CAGAAACAGA AAATGATTAT 33 60 

CAAAATGCAC AACAATATTC AGAGCAGAAA CCTTCTGTTG aTTCAACTCA AACGGAAATA 34 20 

TTTGAAGAAA GTCAAGATGA TAATCAATTG GAAAATGAGC AAGTTGATCA ATCAACTTCG 34 8 0 

TCTTCAGTTT CAGAAGTAAG CGACATAACT GAAGAAAGCG AAGAAACAAC ACATCCAAAC 354 0 

AATACTAGTG GACAACAAGA TAATGATGAT CAACAAAAAG ATTTACAGTC ATCATTTTCA 3600 

AATAAAAATG AAGATACAGC TAATGAAAAT AGACCTCGGA CGAACCAACA AGATGTTGCA 3 6 60 

ACAAATCAAG CTGTACAAAC ATCTAAGCCG ATGATTCGTA AAGGCCCAAA TATTAAATTG 37 2 0 

CCAAGTGTTT CATTACTAGA AGAACCACAA GTTATTGAGT CGGACGAGGA CTGGATACA 37 8 0 

GATAAAAAGA AAGAACTGAA TGACGCATTA TTTTACTTTA ATGTACCTGC AGAAGTACAA 38 4 0 

GATGTAACTG AAGGTCCAAG TGTTACAAGA TTTGAATTAT CAGTTGAAAA AGGTGTTAAA 3900 

GTTTCAAGAA TTACGGCATT ACAAGATGAC ATTAAAATGG CATTGGCAGC G AAAG AT AT T 39 60 

CGTATAGAAG CGCCTATTCC AGGAACTAGT CGTGTTGGTA TTGAAGTTCC GAACCAAAAT 4 02 0 

CCAACGACAG TCAACTTACG TTCTATTATT GAATCTCCaA GTTTTAAAAA TGCTGAATCT 4 080 

AAATTAACAG TTGCGATGGG GTATAGAATT AATAATGAAC CATTACTTAT GGATATTGCT 414 0 

AAAACGCCAC ACGCACTAAT TGCAGGTGCA ACTGGAT&G GGAAATCAGT TTGTATCAAT 4200 

AGTATTTTGA TGTCTTTACT ATATAAAAAT CATCCTGAGG AATTAAGATT ATTACTTATC 4 2 60 

GATCCAAAAA TGGTTGAATT AGCTCCTTAT AATGGTTTGC CACATTTAGT TGCACCGGTA 4 320 

ATTACAGATG TCAAAGCAGC T AC AC AG AG T TTAAAATGGG CCGTAGAAGA AATGGAACGA 4380 

CGTTATAAGT TATTTGCACA TTACCCATGT ACGTAnTATA ACAGC AT T T A AC n AAAAAG C 4 4 40 

CCCATATGAT GAAAGAATGn CAAAAATTGT CATTGTAaTT GATGAGTTGG CTGATTTAAT 4 500 

GATGATGGTC CGCAAGAAGT TG 4 522 
(2) INFORMATION FOR ffiQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
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TCAACTTTAC GGATACGTAT ATATTTTGCA TGACATTTAG TGCAATAATA TTCATAATTT 60 

GCCCGTTGTT GATAGCTTTC AATGCTGTTA C AAAAT C TAG GCGCTCCAAC CTGTTGGCTC 12 0 

AATCGTTTAA AATCTTGATC TTTATGTTGA TAACCTTTAC CAGCAATATG CAAGTGATAA 18 0 

TGACACAATT CGTGCAGTAT AATTTTTACA ACAGCATCTT CTCATAATG CTCATATTGT 24 0 

TTTGGATTAA TTTCAATATC ATGGGACTTT AAAAGATAAC GTCCGCCTGT TGTACGTAAC 300 

CTTTTATTAA AATATGCACA ATGTCGAAAC GTACGTCCAA ATTTTTCTTC CGAAAGATTC 360 

TCAACCATTC GCTGAAGTTT GTCATTATTC ATGTGGATCA ATCATCGTTA ATGATACTTT 420 

GTCTTTATTT TTGTCAATAC TGTAAATCCA AACGTCAACG ATATCACCAA CACTGACAAT 48 0 

ATCCATTGGA TTTTTTACGA ACTTCTTAGA AAGTTTCGAA ACATGGACAA GTCCATCTTG 54 0 

TTTAACACCT ATATCTACAA ATGCTCCAAA ATCTACTACA TTTCTAACTG TACCACTCAA 60 0 

TTTCATTCCT TCTTGTAAAT CTTC?ATTGA TAGCACATCG GATTTAAGGA TTGGTGTTTC 6 60 

AAACTCGTCC CTTGGATCTC GATTAGGTGC GTTCAAGGAT TTAATAATAT CCTCTAATGT 72 0 

AGGTACACCG ACTTGTAATT CAATCGCCAG T 751 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1076 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

TCTCCAGCTT TAACTTGATC TGGCACTTTA ACAATTGTCT GATCCATfiA TACGCGACCA 60 

ATAACTTCGC ATT GAT G AC C ATTTACATTT AC AAAGC T AC CTTGCATTAT GCGTAAATGG 12 0 

CCATCTGCAT ATCCAATAgG TAACAATGCT ATTGTAGTTG GGTCAGTAGC TGTATAAGTT 18 0 

GCACCATAAC TTACAGACTC ACCCGCTTGT AGCGTCTTTG TTTGAACTAC ATTAGCAATT 24 0 

AATTGCACAC TTGGTTTAAG GTGTACTTTA ACTTTTTGCT GTACATACTC TGATGGATAA 300 

TATCCATAAA GGGAAATTCC TGGTCTTATT GCATTACAGA ATTGGCAATC CATTAATAGA 3 60 

GAGCCTGCTG AGTTCTGACA ATGTATATAT TCAGGTTTAA TTGCTTCATT GACCATATCT 4 20 

TTAAAACGTT GATATTGTTC AGTTGTCATA TCTCCTGGTT CGTCAGCACA GGCAAAGTGT 4 80 

GTAAACACGC CTTCAAATAC AAGTTGCTCA TATTGTTGAA TGATTTCAAT CACTTCTTGA 54 0 

TACGTTTTAG TATCTTTAAT ACCTAAACGT CCCATTCCTG TATCTAATTT AATGTGCAAC 60 0 

CATAACTTTT TCTCTTGCTC AC C AG AAAT G TTTTTAATTG CTTCTTTCAA CCACOTTTA 6 60 

GACGGAACCG TTAAGGCAAC TCGGTGTTGT ATCGCTTTAT CAATATCTTT AGCTGGTAAC 720 
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ACACCTAAGA CTAAAATTTT AGCAGTAATC CCATGCATTC TAAGTTCTAT CGCTTCATCT 780 

AACGTTGCTA CAGCAAAAAA TGTGGCGCCA TTTTCCATTA AATGACGTGC TACTTTAACA 84 0 

CTACCTAGTC CATAGGCATT GGCTTTAACG ACAGCCATCA CTGTTTTATT TGGATGCAAT 900 

GTACTGAATA CTTTGAAATT TGATGCAACA GCGTTTAAAT CTACATTCAT ATACGCAGAT 9 60 

CTATAATATT TATCCGACAT ATTACTTCCT CCTGTAATTC CCACACGTTT T AAAAC TAG A 1020 

TCTTAATTAT CATTGTATAA C AAAT T T AAA ATGCTGAITT T T C T AAAAC A ACTTGG 107 6 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2930 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

TGACCACAAT GCCCAATACA ACCATCCCAT GGTAAAGCCA A GAG AT GAG T CAATAAAGCG 60 

TGTTGAATAA GAGCTGAATG AAC C T GAT AC TGGATAAAAT GTTGCCAACT CTCCAATTGA 120 

TGACATTAAG AAATATAGCA TGACACCAAT AACAAGATAA GCGAGTATAG CGCCTCCAGG 180 

ACCAGCTTGA G AAAT GAT AT TACCAGTAGC TACAAATAGA CCAGTCCCAA TTGCACCACC 24 0 

TATAGCAATC ATGGAAATGT GTCTTGAGTT AAGACTACGG TTCATTTTAT TATCTTCCAT 300 

ATTTAGTCTC CCATCTATTT AAATATACCC ATTATTGTAA GCTTTTTAAG TGTACTATTC 3 60 

AATAACTATT TAGTACTGTA AAGCGAAAAA ATTAAAATTT TCTGATTTTT TAATCATCTT 420 

GAGCATGTTT AATTGTAATT TTGATGGGGT T AAAT T AT AA TATGTATTAA ATTATAATTA 4 80 

TnATAAATTG TGGAGGGaTG ACTATGTCAC AACAAGACAA AAAGTTAACT GGTGTTTTTG 54 0 

GGCATCCAGT AT C AG AC C G A GAAAATAGTA TGACAGCAGG GCTAGGGGA CCTCTTTTAA 600 

TGCAAGATAT TTACTTTTTA GAGCAAATGT CTCAATTTGA TAGAGAAGTA ATACCAGAAC 660 

GTCGAATGCA TGCCAAAGGT TCTGGTGCAT TTGGGACATT TACTGTAACT AAAGATATAA 720 

CAAAATATAC GAATGCTAAA AtATTCTCTG AAATAGGTAA GCAAACCGAA ATGTTTGCCC 70B 

GTTTCTCTAC TGTAGCAGGA GAACGTGGTG CTGCTGATGC GGAcGTGACA TTCGAGGATT 840 

TGCGTTAAAG TTCTACACTG AAGAAGGGAA CTGGGaTTTA GTAGGGAATA ACACACCaGT 900 

ATTCTTCTTT AG AG AT C C AA AGTTATTTGT TAGTTTAAAT CGTGCGGTGA AAC GAG AT C C 960 

TAGAACAAAT AT GAG AG AT G CACAAAATAA CTGGGATTTC TGGaCGGGTt TCCAGAAGCA 102 0 

TTGCACCAAG TAACGATCTT AATGTCAGAT AGAGGGATTC CTAAAGATTT ACGTCATATG 108 0 
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CATGGGTTCG GTTCTCACAC ATACTCTATG TATAATGATT CTGGTGAACG TGTTTGGGTT 114 0 

AAATTCCATT TTAGAACGCA ACAAGGTATT GAAAACTTAA CTGATGAAE AGCTGCTGAA 12 0 0 

ATTATAGCTA CAGATCGTGA TTCATCTCAA CGCGATTTAT TCGAAGCCAT TGAAAAAGGT 12 60 

GATTATCCAA AATGGACAAT GTATATTCAA GTAATGACTG AGGAACAAGC TAAAAACCAT 132 0 

AAAGATAATC CATTTGATTT AACAAAAGTA TGGTATCACG ATGAGTATCC TCTAATTGAA 138 0 

GTTGGAGAGT TTGAATTAAA TAGAAATCCA GATAATTACT TTATGGATGT TGAACAAGCT 14 40 

GCGTTTGCAC CAACTAATAT TATTCCAGGA TTAGATTTTT CTCCAGACAA AATGCTGCAA 1500 

GGGCGTTTAT TCTCATATGG CGATGCGCAA AGATATCGAT TAGGAGTTAA TCATTGGCAG 15 60 

ATTCCTGTAA ACCAACCTAA AGGTGTTGGT ATTGAAAATA TTTGTCCTTT TAGTAGAGAT 162 0 

GGTCAAATGC GCGTAGTTGA CAATAACCAA GGTGGAGGAA CACATTATTA TCCAAATAAC 168 0 

CATGGTAAAT TTGATTCTCA ACCTGAATAT AAAAAG C C AC CATTCCCAAC TGATGGATAC 1740 

GGCTATGAAT ATAATCAACG TCAAGATGAT GATAATTATT TTGAACAACC AGGTA&TTG 18 00 

TTTAGATTAC AATCAGAGGA CGCTAAAGAA AGAATTTTTA CAAATACAGC AAATGCAATG 18 60 

GAAGGCGTAA C GG AT GAT G T TAAACGACGT CATATTCGTC ATTGTTACAA AGCTGACCCA 192 0 

GAATATGGTA AAGGTGTTGC AAAAGC AT T A GGTATTGATA TAAATTCTAT TGATCTTGAA 198 0 

ACTGAAAATG ATGAAACATA CGAAAACTTT GAAAAATAAA TTTGATATGT AGTTTCTATA 2 04 0 

TTGCGTAGTT GAGCAGTTTA T GAT AT CAT A ATAAATCGTA AAGATTCCTA ACAAGAGAGG 2100 

GTGTTTAACG TGCGCGTAAA CGTAACATTA GCATGCACAG AATGTGGCGA TCGTAACTAT 2160 

ATCACTACTA AAAATAAACG TAATAATCCT GAGCGTATG AAATGAAAAA ATATTGCCCA 2220 

AGATTAAACA AATATACGTT ACATCGTGAA' ACTAAGTAAT TCTTATCATT C AAAT AC G AC 2280 

GATTTGAAAA TAAAGCGGGC TTACCTATTA TATTGGGGAG CTCGCTTTTT TATGAAATTT 234 0 

TTGTGAAGAG TGATTAATGG ATTGAGTTTC ATCGGTAGAA C AAT AT AT G A TTATATTAGT 2400 

TGTTACTTTA TTAAAaTTTG AGAATATTTA TAGAAGGAAA TAGATTACTG AT T T TAT AAA 24 60 

GTCACTTTGT TAGCGAATGC TTGAAAGAGT ATT T AAT AT A GTAGAATTTA AAATTTCAAA 2520 

GCGGAATTTA ATAAGTACGA AGTAGTTCTG GGTATGTTTT ATAAATGTTC GATAATACAC 2580 

TTTAATCTTA AATATGATGG TTTAGAAAAT GATTTAACAA AG AAAT G Aa A CTTTACTGTT 2 64 0 

GAATTATGTG AGGATTGTGT TAT T AT AT AA ATCGTAATAA TTACGATTTG AT AAAAAG T G 2700 

AGGTAACTAT ATATGGCTAA GAAATCTAAA ATAGCAAAAG AGAGAAAAAG AGAAGAG T T A 2 7 60 

GTAAATAAAT ATTACGAATT ACGTAAAGAG TTAAAAGCAA AAGGEATTA CGAAGCGTTA 2820 

AG AAAAT T AC CAAGAGATTC ATCACCTACA CGTTTAACTA GAAGATGTAA AGTAACTGGA 2880 

AGACCTAGAG GTGTATTACG T AAAT T T G AA ATGTCTCGTA TTGCGTTTAG 2 930 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

CTTCTTGCCA TGGCTCTCTT TATTTAAAAA TGCTTCCAAC TTGTCCATTT GATTGTTTCT 60 

CACTCATAGT CATCACTTCC ATTTTGTTTT T C AAT AC AC A TTGTATTATA TATCCTAATT 12 0 

TTATAAAAAA CTAATTTTAC AAATGCTTTT GCGTTCTTAC AAAAAATGCA TTTGACTATT - 18 0 

ATTATAATAA GCGTATAATT GTCGCATATT ATTTTTTGTA TTTTTGGCAA TAACGAAGGA 24 0 

GTATTTATGA ATAAAGACAA GCAATTGCAC AACGACAAAA TCAATCTATC CCAATTAGTC 300 

TTATTAGGGT TAGGCTCTTT AATAGGATCT GGTTGGCTAT TTGGTGCGTG GGAAGCATCA 360 

TCAATAGCTG GACCAGCAGC AATCATATCA TGGGTTCTTG GATTCCTAGT CAT TGGAACC 4 20 

ATTGCCTATA ACTACATTGA AATCGGCACA ATGTTTCCTC AATCAGGTGGCATGAGTAAC 4 80 

TATGCCCAGT ATACACATGG CTCATTATTA GGCTTTATTG CTGCTTGGGC GAATTGGGTG 54 0 

TCTTTGGTGA CAATAATACC TATCGAAGCT GTGTCAGCTG TTCAATATAT GAGTTCTTGG 600 

CCGTGGCATT GGGCGAAACC AAT GAG AT AT TTAATGGAAA ATGGCTCTAT TAGCACATAC 6 60 

GGATTGCTAG CTGTATATCT CATCATTGTT ATTTTTTCAT TATTAAACTA TTGGTCCGTA 720 

AAACTTTTAA CATCATTTAC GAGTTTAATT TCTGTATTTA AATTAGGCGT ACCCATGTTA 78 0 

AC CAT CATC A TGTTGATGCT ATCAGGATTC GACACTTCAA ATTACGGCCA TTCGGCAAGC 84 0 

ACATTTATGC CTTACGGAAG TGCACCGATT TlTGCTGCAA CAACAGCATC AGGGATTATT 900 

TTTTCATTCA ATTCATTCCA GACAATTATT AATATGGGTT CAGAAATTAA AAATCCTGAA 960 

AAAAATATCG CAAGAGGCAT CGCTATCTCA CTGTCAATCA GTGCAGTGTT GTACATCATT 1020 

TTACAAAGTA CGTTTATCAC TTCTATGCCT CAAT CAATGT TACAACATAG TGGATGSAT 108 0 

GGCATCAACT TCAATTCACC ATTTGCTGAT TTAGCTATCT TATTAGGAAT TAATTGGCTC 1140 

GCAATTTTAC TATACATTGA AGCTTTTGTA TCACCATTCG GTACTGGCGT GTCATTTGTC 1200 

GCCGT TACAG GTCGAGTTTT ACGAGCAATG G AG AAAAAT G GACATATCCC TAAATTTCTT 12 60 

GGGAAGATGA ATGAAAAATA TCATATCCCA CGTGTAGCAA TCATCTTTAA TGCCATCATT 1320 

AGTATGATTA TGGTTACATT ATTTAGAGAT TGGGGTACGC TAGCAGCAGT TATTTCTACT 1380 

GCAACTTTAG TAGCCTATTT AACTGGCCCA ACGACAGTGA TTGCATTAAG AAAAATGGGA 14 4 0 
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CCAACAATGA CTCGTCCATT TAGAGCAAAA ATTTTAAAS TAATGGCACC ATTATCATTT 1500 

GTATTAGCTT CATTAGCTAT ATATTGGGCA ATGTGGCCAA CAACGGCTGA AGTTATTTTA 15 60 

ATCATTATAC TTGGATTACC AATCTACTTC TTCTATGAAT ATCGTATGAA TTGGCGTAAT 162 0 

ACAAAGAAAC AAATTGGTGG TAGCTTATGG ATTATTGTAT ATTTAATCGT GCTATCAATA 1680 

CTGTCATTTA TAGGAAGCAA AGAATTTAAA GGCTTAAATA TGATTCACTA TCCATTTGAC 174 0 

TTTATCGTTA TTATTATTGT GGCACTTATC TTCTATTACA TCGGTACAAC GAGTTCATTT 18 00 

GAAAGCGTCT ATTTCCGTCG CGCAACACGA ATCAATACGA AGATGCGTGA GTCACTAAAT 18 60 

AATGAATCAA AGTCATCTCA TTAAATATTT TAAAATACAG TGCCATATAT GATTTCAACA 192 0 

CACACACATT AACCAACCAT TGATTTCAAC ATCTTGGTTG GTTTTTTATT TTGAAAATCG 198 0 

GTTATAAATA ACTAACATAA CAAGATGATG ATCAGGCTGG GACATAAATC AATGTTCTAT 204 0 

GCTCTACGAA gTTATATTGG C AG TAG T T G A CTGAACGAAA ATGCGCETGT AACAAGCTTT 2100 

TTTCGATTCT AGTCAGGGGC CCCAACACAG AGAATTTCGA AAAGAAATTC TACAGGCAAT 2160 

GCAAGTTGGG GTGGGACGAC GATAAAGAAA TACTTTTTCT AT AG AAAT T A GTATytCTTA 2220 

TGCATGAGTT TTACTCATGT ATTCATATTT TTAAGTACAC ATTAGCTGTG GCTAATGTAT 228 0 

AAGAACCACT ACATAATAAA TCATTTGTGG CTCTTTATCA TTTCTGTCCC ACTCCCGTAG 234 0 

AAGTACATCA TATAATGCTG AAAATGGTTT GAG T T AAAAC AGATATCAAG CTCGTCTGAT 2 4 00 

TCAGTCACAA AATTGTCTTG TTATACTTGT CACCTATCAT CTATAGACCG TGGTATGATT 2 4 60 

AAATTGGGGA TGATAAAGGA GGTTAATffiA TATGAAGATT AATACTACAG GTGGTCAAAT 2 520 

TCATGGTATT AC AC AAG AT G GTTTAGATAT CTTCTTAGGC ATTCCTTATG CAGAACCACC 2 58 0 

AGTTCATGAC AATCGCTTTA AACATTCTAC GTTAAAAACA CAATGGTCAG AGCCAATTGA 2 64 0 

TGCAACTGAA ATACAACCCA TCCCACCGCA ACCAGACAAC AAAT TAG AAG ATTTTTCTC 27 00 

CTCACAATCT ACAACTTTTA CTGAACATGA AGACTGTTTA TATCTAAATA TTTGGAAACA 2 7 60 

ACATAATGAT CAGACGAAGA AACCTGTCAT CATTTATTTT TATGGTGGTA GTTTTGAAAA 2 82 0 

TGGTCATGGT ACAGCCGAAC TCTATCAACC GGCACATTTA GTACAAAATA ACGACATTAT 2 88 0 

CGTTATTACA TGCAATTATC GTTTAGGCGC ATTAGGATAT TTAGACTGGT CATATTTTAA 2 94 0 

TAAAGATTTT CATTCCAATA ATGGCCTTTC AGAT C AAAT C AATGTCATAA AATGGGTGCA 3000 

TCAATTTATT GAATCCTTCG GTGGCGACGC TAATAACATT ACTTTAATGG GTCAGTCTGC 30 60 

AGGCAGTATG AGCATTTTGA CTTTACTTAA AATACTGAC ATTGAGCCAT ACTTCCATAA 3120 

AGTCGTTCTA CTAAGTGGCG CACTACGATT AGACACCCTT GAGAGTGCAC GCAATAAAGC 318 0 

ACAACATTTC CAAAAAATGA TGCTCGATTA TTTAGATACA GATGATGTTA CATCATTATC 32 4 0 

G AC AAAT GAT ATTCTTATGC TGATGGCGAA g c T AAAAC AA TCTCGAGGAC CTTCTAAAGG 3300 
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GCTTGATTTA ATATATGCGC CTATTAAAAC AGATTATATA CAAAATAATT ATCCAACAAC 33 60 

GAAACCAATT TTTGCATGTT ATACAAAAGA TGAAGGCGAT ATTTATATTA CTAGTGAACA 34 20 

GAAAAAATTA TCGCCGCAAC GCTTTATCGA CATTATGGAA TTAAATGATA TTCCTTTAAA 34 80 

ATACGAAGAT GTTCAGACGG CGAAGcAACA ATCTTTAGCG ATTACACATT GTTATTTCaA 35 4 0 

ACAGCCGATG aAGCAATTTT TACmACmACT CAATATACmA GATTCCAACC GCACCAACTA 3600 

TGGCTT 3606 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GAAATTAAAA AAGCAATTGG nACAAGATGC AACAGTGTCA TTGTTTGATG AATTTGATAA 60 

AAAAT TAT AC ACTTACGGCG ATAACTGGGG TCGTGGTGGA GAAGTAT TAT AT CAAGC AT T 120 

TGGTTTGAAA ATGCAACsAG AACAACAAAA GTTAACTGCA AAAGCAGGTT GGGCTGAAGT 180 

GAAACAAGAA GAAATTGAAA AATATGCTGG TGATTACATT GTGAGTACAA GTGAAGGTAA 24 0 

ACCTACACCA GGATACGAAT CAACAAACAT GTGGaAGAAT TTGAAAGCTA CTAAAGAAGG 300 

ACATATTGTT AAAGTTGATG CTGGTACATA CTGGTACAAC GATCCTTATA CATTAGATTT 3 60 

CATGCGTAAA GATTTAAAAG AmAAATTAAT TAAAGCTGCA AAATAATTCA GCTATATAAG 420 

TTAGTGAAAT GAGAGTCTGA AACATATCAA TCTTTTGATA TTGTATTAGG CTCTTATTTT 480 

TATAGCTAGA AAGTTAGATA TTTGTATTTT TTTAAATAAT AAGTGCCGTT GTTATCGTTC 54 0 

AATTTAATTA ATGATAGATT AGTATTATTA TAGCTAAAGT AGTATACCTG AGAAAATAGC 600 

TCAATGTATC TCTTTATTAA TAAGTTATAT CATAATTATT TTAGTGATA CTTTATGGAA 6 60 

GGGATATCAG GGAATGGCTT TCAATTAAAG AAGAGGTTTA AAAGGAT TAC AACAGAATGT 720 

TATGATTTTG TAGAAAGATA TATAACAACG TTTTATAAAA ACATAATATT GTTAATGGAA 780 

AATGAAATGT AAGGGGGATT TCGAGTGACT AAGAAAGTTT ATTTTAACCA CGATGGTGGT 84 0 

GTAGATGATT TAGTATCTCT ATTTTTATTA TTACAAATGG AAAACGTTCA ATTGATAGGG 900 

GTCAGTACAA TTGGTGCTGA TTGTTATTTA GAGCCATCTT TGAGCGCATC AGTAAAAATT 9 60 

ATTAATCGTT TTTCAAATGA AGATATTCAA GTTGCGCCAT CATATGAACG AGGAAAAAAT 1020 

CCATTTCCTA AAGAATGGCG TATGCATCEC TTTTTTATGG ACGCATTGCC AATTTTAAAT 1080 
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GAGCCAGTCA AACATGTTGC TTCAAATGTG AGCGACAAAG AAGCCTTTGA AG AC AT TAT T 114 0 

CAAACTTTAA AG AG AC AAT C AG AAAAAG T A ACATTATTAT TTACAGGCCC GCTTACAGAT 1200 

TTAGCAAAAG C AC T AC AAAA AGATTCATCT ATCGTTCAGT ATATAGAAAA ATffGTTTGG 12 60 

ATGGGTGGCA CCTTTTTACC AAAAGGAAAT GTTGAAGAAC CTGAGCATGA TGGTTCTGCA 132 0 

GAATGGAATG CATATTGGGA TCCAGAAGCG GTTAAAATTG TTTTTGATAG CG AT AT AG AG 138 0 

ATTGATATGG TTGCTTTAGA AAGTACGAAT CAAGTACCGC TAACGTTAGA TGTTAGACAA 14 4 0 

AGATGGGCGA ATGAACGCCA ATATACGGGC ATTGATTTTT TAGGAGTAAG TTATGCAGCG 1500 

GTACCACCAT TAACACACTT TATAACAAAT TCTACTTACT TTTTATGGGA TGTTTTAACG 15 60 

ACTGCTTATA TTGGTAACAA GGACTTGGTT C AT T C AAT T G AGAAAAAAGT CGATGTAATA 1620 

AGTTATGGAC CAAGTCAAGG TAAGACATTT GAGTffAAAG ATGGGCGCAA AATTAATGTC 168 0 

ATAAATCATG TAGATAACAA CGCATTTTTT GAT TAT AT AA CTGCACTTGC TAAAAAAGTA 17 4 0 

AATTAACAGC TGTGTAGAAT AATTAAGGTT TTAATTTATA TAGAACAACT TATTGTAAAC 1800 

TTTTCATTTC TTAAAGTTTA CAATGGTGCT ATAATAATGG TCATGAAATA CGAAAGGAAG 18 60 

TAAAAAATGA CAACAAAACA GTTAGTATAT ACAGCTTTAA TGACAGCGAT TATCGCTATT 192 0 

TTAGGATTGG TACCGGTAAT TCCACTACCA TTTTCTTCAG TACCAATTGT ACTTCAAAAC 198 0 

ATTGGTATTT TCTTAGCAGG TGCGATTTTA GGACGTAAAT ATGGCACATT AAGTGTTATC 204 0 

GTCTTTTTAT TATTAGTAGT TGCTGGCTTG CCATTGTTAT CAGGTGGTCG CGGTGGCATC 2100 

GGTGTATTCG CAGGTCCTTC AGCAGGGTTT TTACTATTAT ATCCAGTTGT AGCATTCATG 2160 

ATTGGGGCGA TTCGAGATAG ATTCATCAAT GAAATTAATT TCTGGATTTT ATTCGTTGGT 2220 

ATTTTAGTTT TTGGTGTTAT AG CAT TAG AT GTTATTGGTA CffiTGATTAT GGGCATGATT 2280 

ATTAACATAC CAT T T AC G AA AGCTATTTCA ATTTCATTAG CTTATTTGCC TGGTGATATA 234 0 

T TAAAAGC AA TTGTAGCAAG TTTGATTGGT ACAGCTTTAC T T AAT C AC T C GCAGTTTCGT 2400 

CAAATTATGG GAATAAAATA ATCATATTTA AGATAGTAAA GTAATTGAAT AAGTTGCTTT 2 4 60 

GAAAT T TATA AAAGTGAAAG GAGTAGGTGT CAATGGCTAG TATAAGTATG TCAGATATAT 2 52 0 

ATTGTAACGG CACTATATTT G AAAAT G AC G ACGAGCAGTT GATTTATTTA ACGCCTTCTT 2580 

TTCCACAACG ATACACAAGT AAC AC AT G G A TATATAAAAA GACGCCTACC CAAGAGCGAT 2 64 0 

GGCTGAAAGA CTTAGAACGT CAAOYTCAAT TACATACAAA TCAAGGTTCA AATCATTATG 2 7 00 

CGTTTAGTTT CCCGGAAAAT GAACAACTTG ATAATCATTG GATGGCTATG TTTAAAGATA 27 60 

TGAATTTTGA ACTAGGTATT ATGGAATTGT AT GCCAT AGA AAGTGATGCG CTTGCCAATT 28 2 0 

TGCCGCGTAA CTCTGACGTT GAAATTGCCA TCGTTGACGA GTCGCATATAGATGCCTATT 2880 

TAAAAGTTGC ATATCAGTTT AGTTTGCCAT TTGGAAAAGA CTATGCAGAT GCACATGAAG 2 94 0 
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AAATGGTAAG GGAAC AT TAT C AAAAAG AT G TGATTAAACG CTTAGTAGCT TATTTAAATA 3000 

ATGAACCTAT TGGCGTTGTA GATGTCATTG AAAGTGAAAA TTACATTGAA TTAGATGGAT 30 60 

TTGGTGTATT AGAACAATTT CGGCACCAAG GAATTGGATC TACAATTCAA TCGTTGATAG 312 0 

GTGAATACGC CATATCAAAA AATCACAAAC CAATCATATT AG T T G C AG AT GGTGAAGATA 3180 

CAGCAAAAGA TATGTATGCA AAGCAAGGTT ATGTCTATCA ATCGTTTTGT TATCAAATAT 32 4 0 

TAAAAGAAGA TATTGGAAAT TAACGATTGA T5ATAGGTTT GTATTAGAAT AAAAAAGCAG 3300 

TAAGCTGGTT TCGAGTAGAA ATCAACTTAC TGCTTTTTAA ATTGTTTTGA GCTACTTATA 3360 

CTTATAAAAA TAGTGCGTTT AAATTGTTGA TTCATGTAGA ATATCGTTCA TTATGACACA 34 20 

CTATAATGAA TATGTTATTG TTCAGAATCA ATGATACGTT CTGGATGACT GTATATATA 34 8 0 

AAGCCACCAT TTCGAATAAA TCCAACTGCC GTAATATTTA GGTCATTAGC TAAGGTTACA 354 0 

GCAAGCGTTG TCGGAGCTGA TTTAGATAAA ATGACGCCAA CACCAATTTT TGCGGCTTTA 3600 

ATTAAAATTT CTGATGAAAT ACGTCCACTA AAAATTAATA CTTTATCTCG GACAGTAATA 3660 

TGTCGCTGAA TACAAAATCC ATATAATTTA TCTAGAGCGT TATGTCTACC AATGTCTTGT 3720 

CGATGTACAA AAAATGTCAA ACCATCGCTT ATAGCAGCAT TATGTAAGCC ACCTGTTTCT 37 8 0 

TGGTAAATAT GACTTGCACT TTGTAATCGA GTCATCATGT TAATAATTTG CATTGGAGTT 38 4 0 

AAAGTGATTT TAGACATAGA TGTTTTAGCG ATAGCAGCff CATTTTGAAA ATAAAACTCA 3 900 

CGACTCTTTC CGCAACAAGA TGCAATCATT CGTTTTGTGG AAT AT T G AAA GCGATCGCCT 3 9 60 

AAATCTTTAT TAAGTTCAAC ATGGGCAAAA CCTTTACTAT CATCAATCAG TACAGATTTT 4 020 

AATTCATCTC GCTTTAAAAT GGCACCTTCC GAAGCCAGAA ATCCAATGAC TAACTCCTCA 4 08 0 

AGGTTTGTTG GACTGCATAT AACAGTCGCA AATTCTTCAC CATTCACCAT AATTGTAAGT 414 0 

GGAAATTCTG TCACATATTG ATCTGTTGTA TTGAATAATT TTCCATCTTC ATATCTAACA 4 2 00 

ATTGGTTGAC C T AAAGAT AC ATCTTTGTTC ATTATCTAAC CCCTTTAATT AGCTTAAACT 4 2 60 

TTATTTTAAA GCAATTTGCT TAAAATTTTA ACATATTTGC TTAAGTTTGA AATTTGATTG 4 32 0 

ATAAAAATTA ATAGCGAGCA ATCTGTTTGA TTTAAATTGA ATTCGAGAAT ATACATACTA 4 380 

GGGCATCAAT TAATAAATAT CAATCTTATG CAAATTTGAC AATTGTTTGA ATCAATATAT 4 4 40 

AAACAGGCAA CGGTTCTTTT CAAATATAAT AGTAAGTGTA TAATGAAAT GTAAATATTA 4 500 

TTAAAAATGG GGGTTCACTC AATGAAATTG AAACGTTTAT TTGCTGTTGT GATTGCAATG 4 5 60 

CTTTTAGTAT TAGCTGGTTG CTCTAATTCT AACGATAATA ATGAAAGTAA AAAAGATGAC 4 620 

GC AG AC AAT G GTAAGAAACA AG AG AT T C AA GTTGCAGCGG CAGCAAGTTT AACAGATGTA 4 680 

ACCAAGAAAT TAGCTTCAGA ATTTAAAAAA GAGCATAAAA ATGCTGATAT TAAATTTAAC 4 74 0 
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TATGGTGGAT CAGGGGCATT AAGAAAACAA ATTGAATCAG GCGCACCTGT TGACGTATTT 4 8 00 

ATGTCTGCAA ATACTAAAGA TGTAGATGCA T T AAAAG AC A AGAATAAAGC GC AT GAT ACA 4 8 60 

TATAAATATG C G AAAAAT AG TCTAGTATf A ATTGGTGATA AAG AT T C AAA TTACACTTCA 4 920 

G T AAAAG AC T TAAAAGACAA TGATAAATTA GCATTAGGTG AAGTGAAAAC TGTACCAGCA 4 980 

GGAAAATATG CGAAACAGTA T T T AG AT AAC AATAACTTAT TTAAAGAAGT CGAAAGTAAA 504 0 

ATCGTTTATG CTAAAGATGT AAAACAAGTA TTAAATTATG TTGAAAAGGG TAAGCGAAA 5100 

CAAGGTTTTG TGTATAAAAC TGACTTATAT AAACAAAATA AAAAAATTGA TACTGTAAAA 5160 

GTAATTAAAG AAGTAGAACT TAAGAAGCCA ATCACATACG AAGCTGGTGC TACATCAGAT 5220 

AGTAAATTAG CAAAAGAGTG GATGGAATTC TTAAAATCAG ATAAAGCTAA AGAAATACTA 528 0 

AAAGAAT AC C ACTTTGCAGC ATAAGGAGTT GTAATCCATG CCTGACTTAA CACCTTTTTG 534 0 

GATATCAATA CGAGTTGCTG TAATCAGTAC GATTATTGTA ACGGTTTTAG GTATTTTTAT 54 00 

ATCTAAATGG TTGTATCGTC GTAAGGGTTC GTGGGTTAAA GTATTGGAAA GTTTATTGAT 54 60 

ATTACCTATT GTTTTGCCGC CAACGGTATT AGGTTTATT CTATTAATCA TCTTCTCGCC 5520 

AAG AG G AC C A ATCGGTCAAT TCTTTGCGAA TGTACTACAT TTACCTGTAG TGTTCACTTT 558 0' 

GACAGGTGCT GTGATAGCAT CTGTCATTGT TAGTTTTCCA CTAATGTATC AAC AT AC T G T 5 64 0 

GCAAGGCTTC AGAGGTATAG AC AC G AAAAT GATTAATACA G C TAG AAC GA TGGGAGCAAG 57 00 

T G AAAC G AAA ATTTTCCTCA AATTAATTTT ACCATTAGCT AAACGCTCTA TTTTAGCAGG 57 60 

TATAATGATG AGTTTTGCTC GTGCATTAGG TGAGTTTGGT GCTACATTAA TGGTTGCAGG 5820 

ATATATTCCA AATAAAACGA ATACACTACC TTTAGAAATA TACTTCTTAG T G G AAC AAG G 588 0 

TAGAGAAAAT GAAGCGTGGT TATGGGTATT AGTGCTAGTC GCATTCTCTA TTGTGGTTAT 5 94 0 

ATCTACAATT AATTTATTGA ATAAAGATAA ATATAAGGAG GTCGACTAGA TGCTTAAAAT 6000 

CAATGTGAAA TATCAATTAA AG AAC AC T T T AATTCGCATC AATATAGATG ATACTGAACC 6060 

AAAAATTTAT GCAGTTCGTG GTCCATCTGG CATTGGTAAA AdACTGTTT TAAATATGAT 6120 

TGCCGGATTA CGTAAAGCAG ATGAAGCTAT TATCGAAGTG AATGGGCAAT TACTTACTGA 6180 

TACGGCAAAA AACGTGAATG TTAAAATTCA ACAACGACGT ATTGGATATC TGTTTCAAGA 624 0 

CTACCAATTG TTTCCTAATA TGACGGTCTA T AAAAAT ATT ACTTTTATGG CTGAACCATC 6300 

T G AAC AC AT C GATCAATTAA TTCAAACTTT AAAC AT T GAT CATTTGATGA AACAATATCC 6360 

TAT G AC AT T G TCAGGTGGAG AGGCACAACG TGTAGCACTT GCACGTGCAC TTAGCACrAA 64 2 0 

ACCAGATTTA ATTTTATTAG ATGAACCTTT TTCTAGTTTG GATGATACTA C AAAAG AT G A 64 8 0 

GAGTATTACA TTAGTTAAAC GTATTTTCAA CGAATGGCAA AT AC C AAT C A TATTTGTGAC 654 0 

ACATTCAAAC TATGAAGCAG AACAAATGGC TCATGAAATT ATTACAATTG GGTAATCATT 6600 
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TATTTGCCAT TAAAGAGTTT AGAACGTATT TAAAATTGTA GAAGTGAATG CTTCTATCAG 6 6 60 

CATTTTAATG ATGTTTTAAA CTCTTTTTTA GGGGCAGTTT TTTTGAGAGACATTGACGCG 6720 

CGTCATATAA TGAAAGTAAT GATAAAAAGA AAGGATAACT TAATGTGAGT CAAGAACGTT 67 8 0 

ATTCAAGGCA AATTTTATTT AAACAAATAG GTGAAATAGG TCAAAGCAAA ATAAATCAAA 684 0 

AATGTGCGTT GATTATTGGT ATGGGCGCAT TAGGTACACA TGTGGCCGAA GGACTTGTTA 6900 

GAGCAGGCAT TGCCAAACTA ATCATTGTTG AT AG AG AT T A TATTGAATTT AGTAATTTAC 6960 

AAAGACAAAC ATTGTTTACT GAAGAAGATG CTTTGAAAAT GATGCCTAAG GTGGTTGCAG 7 02 0 

C T AAAAAGC A TTTGCTAGCG TTACGTAGTG ATGTTGATAT T GAT GAT TAT ATTGCCCATG 7080 

TGGATTATTA TTTTTTGGAA ACACATGGAC iGGACGTTGA CGTTATTATT GATGCAACCG 714 0 

ATAACTTTGA AACACGACAA CTGATTAATG ATTTTGCATA TAAATATCGT ATACCTTGGA 7200 

TTTATGGTGG TGTTGTACAG AGTACATATA CAGAAGCTGC ATTTATACCT GGTAAAACAC 7 2 60 

CTTGCTTTAA CTGTTTGGTA CCACAATTGC CAGCATTAAA TTTAACATGT GATACAEAG 7 32 0 

GGGTCATTCA ACCTGCCGTG ACGATGGCAA CAAGTTTACA ATTAAGAGAT GCGATGAAAG 7 380 

TATTAACGGA ACAACCAATT GACACAAAAA TAACTTATGG CGATATTTGG GAAGGTAGTC 74 4 0 

ATTATTCATT TGGTTTCAGT AAAAT GCAAC GTTCAGACTG TACAACTTGT GGAGATGTAC 7500 

CAAGTTATCC GTATTTAAAC AAGAATGAAC AACGTTATGC AACATTGTGT G G TAG AG AC A 7 5 60 

CTGTACAGTA TGAAAATGCA TCAATTACAC ACGACATTCT TGTTCAATTT TTAAAACAAC 7 62 0 

ATCAGTTAAA TTATCGCAGT AATTCGTATA TGGTTATGTT TGAATTTAAA GGACACCGCA 7680 

TTGTTGCTTT TAAAGGTGGA AGGTTTTTAA TACATGGCff GACACGCACA TCAGATGCCA 7740 

CACATCTAAT GAATTTATTG TTTGGATAAA AAAAGATAAG ACAAAAGGAG TGTAATATTA 7 8 00 

TGGGCGAACA TCAAAACGTT AAATTGAATC GTACAGTTAA AGCAGCCGTA CTAACGGTAT 7 8 60 

CAGATACTAG AGACTTTGAT ACAGATAAAG GTGGTCAATG CGTGCGCCAA CTATTACAAG 7 92 0 

C AG AT G AC G T TGAAGTGAGT GACGCACATT ATACAATTGT GAAAGATGAA AAAGTAGCCA 7 98 0 

TCACGACGCA GGTGAAGAAG TGGTTAGAAG AAGATATTGA TGTCATCATT ACGACTGGTG 804 0 

GAACAGGTAT TGCACAACGT GATGTGACGA TTGAAGCAGT AAAACCACTT TTAACTAAAG 8100 

AGATAGAAGG CTTTGGGGAA TTGTTTAGAT ATTTGAGTTA TGTTGAAGAT GTTGGCACGC 8160 

GTGCATTATT GTCTCGTGCT GTAGCAGGTA CAGTTAATAA TAAATTGATA TTTTCGATTC 8220 

CAGGATCAAC AGGCGCAGTT AAATTAGCAT TAGAAAAGCT CATTAAACCA GAATTAAATC 8280 

ATCTGATTCA TGAGCTTACA AAATAATTTA TTGATTTGAT TGGCGTGAA AATCTCCAGA 834 0 

TTTACCGCCA GACTTGCTTT CAAGGTAGGT TTCGCCAATA ATCATACCTT TATCAACTGC 8 4 00 
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TTTCGTCATG TCGTAAATGG TTAAAGCCGT TGCTGATGCA GCGGTTAAAG CTTCCATTTC 84 60 

AACACCGGTT TTGCCAGTTG TAG AG AC AG T TGTTTGAATG TTTAAAGTAT AAAGGGGTGC 8 52 0 

ATTTGTTTCA TCCCAGCTGA AGTGAACATC TATGCCAGTC AATGGTAATG GATGGCACAT 8580 

CGGAATAAGT GTTGATGTAT TTTTGGCAGC CATAATACCA GCGATTTGAG CAGTGTTCAA 8 64 0 

TACATTGCCT TTGGTATTGG TGTTATTAGA AATCTGCTTA AAAATAGTTT CATTTACAGT 8700 

AATGCTTGAA TGAGCGACAG CAGTTCT"ITT TGTAATTTGT TTGTCTGATA CATCGACCAT 87 60 

TTTGGCGTGG CCTTGTTGAT TAATATGAGT AAACTCAGTC ATTTTACCCC TCCTAGTGCA 8820 

TCTAGTATAT CATGAAAAAA TAAAAGTTTT GGAGATGATT TTTAATGGTA GTAGAAAAAA 8880 

GAAACCCAAT CCCAGTTAAA GAAGCAATTC AACGTATCGT TAATCAGCAG AGTCAATGC 8 94 0 

CGGCAATTAC GGTAGCACTT GAAAAAAGTC TAAATCATAT CTTAGCAGAA GATATTGTAG 9000 

C T AC T T AT GA TATACCAAGG TTTGATAAAT CACCTTATGA TGGTTTTGCA ATTCGCAGTG 90 60 

TTGATTCACA AGGGGCAAGT GGTCAGAATC GCATTGAGTT TAAAGTGATT GATCATATTG 9120 

GTGCAGGTTC AGTTTCTGAT AAATTAGTTG GGGATCACGA AGCGGTGCGT ATTATGACTG 9180 

GAGCACAAAT ACCTAATGGC GCAGATGCTG TTGTTATGTT TGAACAAACG ATTGAACTAG 92 4 0 

AAG AT AC AT T TACAATTCGT AAACCATTTT CAAAAAATGA AAATATATCT TTAAAAGGTG 9300 

AAGAAACAAA GACAGGCGAT GTTGTTCTAA AAAASGACA AGTAATTAAT CCAGGGGCTA 9360 

TCGCGGTCCT TGCAACATAT GGCTATGCAG AGGTTAAAGT TAT T AAGCAA CCGAGTGTCG 9420 

CTGTTATTGC AACAGGAAGC GAAT TAT TAG ATGTTAATGA TGTATTAGAA GATGGGAAAA 94 80 

TTCGTAACTC TAATGGCCCA ATGATTCGTG CCTTAGCAGA AAAATTAGGT CTTGAAGTTG 95 4 0 

GTATTTACAA AACACAAAAA GATGATTTAG ATAGTGGCAT CCAAGTCGTT AAAGAAGCTA 9600 

TGGAAAAACA TGATATCGTT ATTACAACGG GCGGAGTTTC TGTTGGAGAT TTTGACTATT 9660 

TACCTGAGAT TTATAAGGCT GTAAAGGCGG AAGTGTTATT TAATAAAGTA GCAATGCGTC 9720 

CTGGTAGCGT AACAACG3TT GCATTTGTAG ATGGaAAGTA TTTGTTTGGa TTATCTGGAA 97 8 0 

ATCCATCAGC TTGTTTTACA GGATTTGAAC TATTTGTGAA nCCAGCTGTT AAACATATGT 98 4 0 

GTGGCGCACT AGAAGTCTTC CCGCAAATAA TTAAAGCAAC ATTAATGGAA GATTTTACCA 9900 

AGGCAAACCC ATTCACACGA TTTATACGTG CTAAAGCAAC GTAACAAGT GCTGGAGCTA 9 9 60 

CTGTAGTACC TTCAGGATTC AATAAATCAG GTGCGGTTGT AGCGATTGCA CATGCTAACT 10020 

GTATGGTCAT GTTACCAGGA GGGTCACGTG GTTTTAAAGC GGGGCATACA GTAGATATTA 1008 0 

TATTGACTGA ATCTGACGCT GCTGAAGAGG AACTTCTTTT AT GAT T T T AC AAATTGTAGG 10140 

T T AC AAAAAG TCTGGTAAGA CAACATTGAT GAGGCATATT GTCTCTTTCT TAAAGTCACA 10 200 

TGGTTATACA GTTGCTACTA TTAAACATCA TGGGCATGGT AAGGAAGATA TTCAATTACA 102 60 
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GGATTCAGAC GTCGATCACA TGAAGCATTT TGAAGCGGGG GCAGATCAAA GTATTGTACA 10 320 

AGGTTTTCAA TATCAGCAAA CTGTAACACG TGTAGATAAT CAAAATCTTA CTCAAATTAT 10380 

TGAAAAATCT GTTACAATTG ACACCAATAT CGTATTAGTT GAAGGCTTTA AAAATGCTGA 104 4 0 

TTTTGAAAAA GTCGTAGTCT ATCGAAATGA AGAAGAGTTG CAAGTATTAC AACAATTGTC 10500 

GAATGTTTGT T AT AG CAT T A ATGTAAGGGA GCATGAAGAT TTTACAGCATTTGAGCAATG 10560 

GTTATTAAAT AAAATTAAAA ATGATTGTGA TACACAATTA ACATAGAGGA TTGAAATGAA 10620 

TGAAACAATT TGAAATCGTG AC AGAAC C G A TACAAACAGA ACAATATCGT GAATTCACTA 10 680 

TAAATGAATA TCAAGGTGCA GTAGTTGTTT TTACCGGTCA TGTTCGCGAA TGGACTAAAG 10740 

GCGTCAAAAC GGAATATTTA GAATATGAAG CGTATATTCC AATGGCTGAA AAGAAATTGG 108 00 

CACAAATTGG AGATGAAATA AATGAAAAAT GGCCTGGAAC GATAACGAGT ATTGTTCATA 108 60 

GAATAGGGCC ATTACAAATT TCAGATATCG CTGTATTAAT TGCGGTTTCT TCACCGCATC 10920 

GTAAAGATGC CTATCGAGCA AATGAATATG (AATTGAGCG TATAAAAGAA ATTGTTCCGA 10 980 

TTTGGAAAAA AGAAATTTGG GAAGATGGTT CAAAATGGCA AGGGCATCAA AAAGGGAATT 1104 0 

ATGAAGAAGC AAAGAGGGAG GAATAAGAGA GATGAAGGTA CTTTACTTCG CAGAAATTAA 11100 

AGATATATTA CAAAAAGCAC AGGAAGATAT TGTGCTTGAA CAAGCATTGA CTGTACffiCA 11160 

ATTTGAAGAT TTATTGTTTG AACGTTATCC GCAAATCAAT AATAAAAAGT TTCAAGTTGC 11220 

TGTAAATGAG GAATTTGTAC AAAAATCGGA TTTCATTCAA CCTAATGATA CTGTTGCATT 11280 

AATTCCACCG GTTAGTGGAG GTTAAGGGAG CAT GAAAGC A ATAATTCTTG CAGGTGGTCA 11340 

TTCAGTGCGA TTTGGTAAGC CCAAAGCTTT TGCGGAAGTG AACGGTGAGA CCTTTTATAG 11400 

TAGAGTAATT AAG AC AT TAG AAT C AAC AAA TATGTTCAAT GAAATTATTA T TAG T AC AAA 114 60 

TGCGCAATTG GCAACGCAAT TTAAATATCC AAATGTTGTT ATAGATGATG AGAATCATAA 11520 

TGATAAAGGT CCATTAGCAG GAATTTATAC AAT CAT G AH CAACATCCTG AAG AAG AAT T 115 80 

GTTTTTTGTC GTTTCTGTTG ATACACCAAT GATTACTGGT AAAGCTGTAA GCACGTTGTA 11640 

TCAGTTTTTA GTTTCTCATC TTATTGAAAA TCATTTAGAT GTCGCAGCTT T T AAAGAAG A 11700 

TGGACGTTTT AT T C C AAC AA TTGCATTTTA TAGTCCGAAT GCATTAGGCG C TAT AACTAA 117 60 

AGCACTACAT T C T GAT AAT T ACAGTTTTAA AAATGTATAT CATGAATTAT CAACGGATTA 11820 

TTTGGATGTA AGGGATGTAG ATGCGCCCTC ATATTGGTAC AAAAATATAA ATTATCAGCA 118 80 

TGATTTGGAC GCTTTAATTC AAAAATTGTA AGCTGTTAGG AG G T C C AC AA ATGGT AGAAC 11940 

AAATAAAAGA TAAACTAGGA CGTCCCATCC GTGACTTACG GTTATCTGTG ACAGATCGGT 12000 

GTAACTTTAG GTGTGATTAT TGCATGCCTA AAGAGGTATT TGGAGATGAT TTCGTATTTT 12060 
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TACCTAAAAA TGAACTTTTA ACGTTTGATG AAATGGCTAG AATCGCTAAG GTATATGCAG 1212 0 

AATTAGGTGT AAAAAAAATA CGCATTACAG GTGGAGAACC ATTGAGCGA CGGGATTTAG 1218 0 

ATGTACTTAT AGCTAAATTA AATCAAATCG ATGGTATTGA AGATATTGGT TTGACTACAA 1224 0 

ATGGTTTGTT ATTAAAAAAG CATGGACAAA AGTTATATGA TGCTGGGCTG CGCAGAATTA 12300 

ATGTCAGTTT GGATGCTATT GATGATACGC TATTTCAATC AATCAATAAT CGTAATATTA 12360 

AAGCGACTAC GATTTTAGAA CAAATTGATT ACGCGACGTC TATTGGTTTG AATGTAAAAG 124 20 

TAAATGTTGT TATACAAAAA GGTATTAACG ATGATCAAAT CATACCAATG CTTGAATATT 124 8 0 

TTAAAGATAA ACATATAGAG ATTCGATTTA TAGAATTTAT GGATGTTGGT AATGATAATG 1254 0 

GATGGGATTT CAGTAAAGTT GTAACTAmG ATGAAATGCT TACAATGATA GAGCAGCACT 12600 

TTGAAATCGA TCCTGTAGAA CCAAAATATT TTGGGGAAGT AGCAAAATAT TATCGCCATA 12 660 

AGGATAATGG TGTTCAATTT GGTTTGATTA CAAGTGTTTC ACAATCATTT TGTTCTACAT 12720 

GTACACGCGC AAGGCTGTCA TCAGATGGGA AGTTTTACGG ATGTTTATTT GCACTGTCG 12780 

ATGGATTTAA CGTTAAAGCG TTTATTCGTT CTGGCGTGAC CGACGAAGAA TTAAAAGAAC 128 4 0 

AATTTAAAGC TTTATGGCAA ATAAGAGATG ATCGATATTC AGATGAGAGA ACTGCTCAAA 12 900 

CAGTTGCCAA TCGTCAACGT AAAAAGATAA ACATGAATTA TATTGGTGGT TAATGTGTAG 12960 

GGACCACTAC ATATTAAATC AT T AGAG AT G TTTTAATATT TCTGTCTTAC TCCCTAAAAT 13020 

ACAATATTAT TTATTAAAGT AAAAACGGTC ATATCTATGC CAGATTTAAT AGAAATGATC 1308 0 

GTTTTTAAAG TTTTTACAAG TTGGCGGGGC CCCAACACAG AAGCTGACAG AAAGTCAGCT 1314 0 

TACAATAATG TGCAAGTTGG CGGGGCCCCA ACATiSAGAA TTTCAAAAAG AAATTCTACA 13200 

GACAATGCAA GTTGGGGAAC GGGGCCCCAA CACAGAAGGT GACGAAAAGT CAGCATACAA 132 60 

TAATGTGCAA GTTGGCGGGG CCCCAACATA GAGAATTTCA AAAGAAATTC TACAGACAAT 13320 

GCAAGTTGGG GATCAACGAA ATAAATTTTA TGAGAATATC ATTTCTATCC CACTCTTAAG 13380 

AATCACTACA TAATAAATCT TTAGTGGTTC TTTAACATTG ATGTCACACT CCATGCCATT 13440 

GAGTTGTAAT ATATCTTTTT TAGGTATAAA TGTTGTCGAA TAAACAACAA GTTGTCCAAA 13500 

AGATATAAAT CTAAACAAGA TATAGCCAGC AATTTAATAT TTGTAATAGA TAAAATGCTA 135 60 

AGTTTGATAT ATAATAAA.TT TAAGTAATTG TATAATAATA TGAATTACAA ACATCTAAGA 13 620 

AGAAACATAG GAGGCATCAT ATTATGAGTA ATAAAGTTCA ACGTTTTATA GAAGCAGAAA 13 68 0 

GGGAGTTAAG TCAGTTAAAG CACTGGTTAA AAACAACACA TAAGATTTCA ATTGAAGAAT 137 4 0 

TTGTAGTCCT TTTTAAAGTG TATGAAGCTG AAAAGAT TAG CGTAAAGAA TTGAGGGATm 13800 

CATTACATTT TGAAATGCTA TGGGATACAA GTAAAATCGA TGTGATTATC CGTAAAaTCT 13860 

ATAAAAAAGA GCTTATTTCT AAATTGCGTT CTGAAACGGA TGAAAGACAA GTATTCTATT 13 920 
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TCTATAGTAC TTCTCAAAAG AAATTGTTAG ATAAAATTAC TAAAGAAATA GAAGTGTTAA 13 980 

GCGTTACAAA CTAAAAACTT aAAAAgcaTG CCAATCTCTA TTCATCATAA TTGCGTCTTG 14 04 0 

GGGACAGACA AATGATGAAT AGAGATTGGC ATGCTTTTTA TTTTTGAATA TAAATATTTA 14100 

GTTCATGGCA TTTCTAGTTA CATGACGTCC ATGAATTAAG AAGTAAACAA GCATAGTAAT 14160 

GATTGCTAAA GCGGCCATAA AGCCGAAGAT TTCACTATAT GAAAACATAT GAGTAAATAA 14220 

CCCAAGGAAT GATGGACCGA AGCCGACACC TGCATCTAGA CCAACGTAAA AAGTAGATGT 14 28 0 

CGCGATACCA TATTTAATCG GGGGTGAGAC TTTTATCGCA ATAGATTGCA TTGCAGATGA 14 34 0 

TAAATTTCCA TACCCTAAAC CTAGGCAAGC ACCAGCAAGT AATATTAACCAGCTTTGATA 14 4 00 

GCTTGAAATT AAGCATACAA ATGAAAGGAA AAGCATGATA AATGCTGGGT AGACAATAAT 14 4 60 

ATTTTCATTT TTATCATCCA TCAATCTACC AGCAATAGGT CTAGTAATTA ACGATGCTAT 14 520 

AGCATAGCAA ATAAAGAAAT AGCTTGCTGC AGTGACTAGG TGTCGCTCTA AAGCAAATGC 14 580 

TTGTAAATAA GTTAGGATGG ACGCATAGGT AACGCCAATT AAAAGCATAA TTACAGCAAC 14 640 

AGGAATGGCC TCTTTTGCAA TAAATTGATG AATACTAAAT CTTGGTTTAT CAATGACATT 14 700 

AGTTTCAGTT TTGTTATTTG TTACTTCGAA ATCAACTTTT ATAAATAATG AGATAATGAG 14 7 60 

TCCGAGTATG CCTAATATGA CACAAATAAT ffiACAGTAAG TCAATTGCGT ATTTTGTAAT 14820 

AAGTAACATG CCTAGAAATG GGCCAATCGC TGTACCTAAT ACTAAACTTA AGGAAAATAA 14 88 0 

ACTGATGCCT TCACTTTTTC TATTAACAGG GGTAACGTAT GCCGCAATAG TACCTGTTGC 14 94 0 

AGTTGTCACA ACTGCAGTTG CGATACCGTT TATGAGACGT ACAAAGATTA AAAAAGEAA 15000 

AGATCCATCA ATAAAATAAA GTAATTGCGT GATAAT T AAA GCAATTAAAC CAATAAATAA 15 0 60 

TAATCGTTTA GGTCCrATTT s AT T T AC AAA TTTACCTGTA GCAAATCGA 1510 9 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 9072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

GAGAGTCAAT GGCAAGAAGA ATATAAATAT TTGAGAGCGT TAATCTTTAA TGAAACAGAA 60 

TTAGAGGAAG CGTATAAATG GATGCATCCT TGTTACACGT TGAATAATAA AAAT G TAG T A 12 0 

CTTATCCATG GCTTCAAAAA TTATGTTGCA CTATTATTTC ATAAAGGTGC CATTTTGGAG 18 0 

GAT AAAT AT C ATACACTCAT TCAACAGACT GAAAAGGTGC AAGCAGCTCG TCAGTTACGA 24 0 
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TTTGAAAATT TAACAGAGAT TCAAGCACGT AC C GAS AAA TTAAATATTA TCTAGCCGAA 300 

GCAATTAAAG CTGAAAAAGC TGGTAAAAAA GTTGAAATGA AGAAAACAGA GGAATATGTT 3 60 

ATTCCTAAAG AGTTAGAAGC TAAGTTCGAA GAAATGCCAC AG T TAG AG T C TTCATTTTAT 4 20 

AAATTAACGC CAGGCAGACA ACATCAATAT ATATATCATA TTGGACAAGC TAAACGCAgT 4 80 

GgAACAAGAC AAAAGCGTGT TGAAAAGTAT ATTAACCAAA TACTAGAAGG TAAAGGGATG 54 0 

CAT G AT AAG T AATTAATGAG TAAAGCATAC C GG T TAT AC A ACAACATACA AGATGACACG 600 

AAACAACCAA TGGCTCATGC TGTTGGTTGT TTTTTTAGGT GTGTCTGTCA TGGGCAACAC 6 60 

TTTGACGTTG GAATTCCGTT ACAGGCTTGG GAGTAGAAAA TGTTAGCAAA AGGCAAGGGT 7 20 

GTCTACAATG AATGATGAAG AT AT T AAAAT ATAAGGATGA CTTTGTGAGT GGCGGATGGG 780 

CGGTTGTCCG TCTGTAACAA TGGATGCGTG TGCATTATTA CAAAAATTCG ACTTTTGTAA 840 

TAATATTTCA CATTTTCGAC ACTTTTTTGC TATAAAACAA CCATTGAGC GATAATAAAT 90 0 

TCGCTTTTAA AAAAT AT GAG TTATCTATTT AGTTGCCAAA GAT AAAAT AA TAATGTTTAA 9 60 
TAACATCATA TAGAGTATGT TAGTTTTAAA TGTCGAATAT ACGAATGTGc AAACAAAGTA . 1020 

ATCGGTAGAA ATTCAACATA CATAGCGCCG TTTACTGTTA AGTATTCACA T T AC AG AT GA 10 80 

AAAAT AT AAA ATTCTACATA AT C AAGAC C A TGATGTGTAC TTGTTTAACT TATGACTCTA 114 0 

TTTGTTTAAC AATTGCGATA ATGGTCTTTT TATTTTATGC GTATCATTCG TCATATTTTT 12 00 

TATGAGGAAG G AG AAAT GAT TAT G T T AAG T ATTAAGCATT TAACGAAAAT TTATTCTGGT 12 60 

AATAAAAAGG CAGTAGATGA CATCKTTTA GATATTCAAT CTGGGGAATT TATCGCATTT 1320 

ATTGGAACCA GTGGAAGTGG CAAAACGACT GCTTTAAGAA TGATAAACCG TATGATTGAA 1380 

GC G AC AG AAG G AC AAAT T GA AATTGATGGT AAAGATGTTC GGAGTATGAA TCCTGTCGAA 14 4 0 

TTGCGTAGAA ATATTGGCTA TGTTATTCAA CAAATTGGCT TAATGCCTCATATGACGATT 150 0 

AAAGAGAATA TTGTGTTGGT ACCCAAATTG TTGAAATGGA CTAAAGAGGA AAAGGATAAA 15 60 

CGTGCAAAGG AATTAATTAA ACTTGTGGAT TTACCGGAGT CATTTTTAGA GCGTTATCCA 162 0 

GCAGAACTAT CAGGTGGGCA ACAACAACGT ATCGGTGTTG TAAGAGCACT TGCGGCCGAA 1680 

CAAGATATTA TTTTAATGGA TGAACCTTTT GGTGCATTGG ATCCTATTAC GAGAGATACG 1740 

TT AC AAG ATT TAGTTAAAAC GTTACAACGA AAATTAGGCA AGACGTTTAT CTTTGTAACA 18 00 

CATGATATGG AT GAAGCGAT TAAATTAGCA G AC AAAAT T T GTATTATGTC AGAAGGTAAG 18 60 

GTGGTGCAAT TTGATACGCC AGACAATATT T'AAGACATC CCGCAAATGA TTTTGTACGT 192 0 

GATTTTATAG GACAAAATAG AC T GAT T C AA GACCGTCCCA ATGAC AAGAC TGTAGAAGGT 198 0 

GTAATGATTA AACCAATCAC GAT AC AAG C A GAAGCAACAC TGAATGACGC CGTTCATATT 204 0 

ATGAGACAAA AACGTGTTGA TACTATTTTT GTAGTAGATA GTAATAACCA TTTACTACT 2100 
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T T C T TAG AC A TTGAAGATAT AAATCAGGGT ATACGTGGAC ACAAAAGTTT ACGAGACACC 2160 

ATGCAACAAC ATATTTATAC CGTTCAAATT GATTCTAAAT TACAAGATTC TGTACGTACG 2220 

ATTTTAAAAA GAAACGTTAG GAATGTACCT GTCGTAGATG ATCAACAGCG TTTAGTAGGA 22 8 0 

CTGATTACGC GTGGCAATGT TGTTGATATT GTATATGACA CGATTTGGGG CGATAGTGAG 234 0 

GATACAGTGC AAACAGAACA TGTGGGGGAA GACAcTGCGT CCTCAAAAGT GCATGAGCAA 2 4 00 

CACACTACTA ATGTCAAAGT ACGTGACATA GGAGATGATA AATCATGATT GAGTTCCTAC 24 60 

ATGAACATGG TGGACAGTTG ATGTCGAAAA C AC T GG AAC A T T T C T AT AT T TCTATAGTGG 2520 

CATTATTACT TGCCATCATT GTTGCAGTAC CTATAGGCAT TTTATTATCA AAAACAAAGC 2580 

GAACTGCCAA TATTGTATTA ACTGTGGCAG GTGTCTTACA AACTATTCCA ACACTAGCTG 2 64 0 

TACTTGCTAT TATGATACCG ATTTTTGGTG TTGGTAAAAC GCCTGCAATT GTAGCGCTAT 723 0 

TTATTTATGT ATTATTACCT ATTTTAAATA ACACGGTACT CGGTGTTCAA AATATTGATA 27 60 

GCAACATTAA AGAAGCTGGA AAAAGTATGG GAAT GACACA ATTTCAATTG ATGAAGGATG 2820 

TTGAATTGCC GTTAGCATTG CCGCTTATCA TTGGTGGCAT TCGTTTGTCA TCTGTGTATG 2880 

TAATTAGTTG GGCTACACTT GTAAGTTATG TAGGTGCGGG TGGATTAGGT GATTTCATTT 2 94 0 

TCAATGGTTT AAAT T TAT AT GATCCACTGA TGATTGTAAC TGCAACGGTA CTCGTTACTG- 3000 

CACTAGCATT AGGTGTTGAT GCCTTATTAG CTTTAGTTGA AAAATGGGTA GTTCCCAAAG 3060 

GCTTAAAAGT ATCTGGATAA TTAGGAGGCT AAGATAATGA AGAAAATAA AT AT AT ACT T 3120 

GTCGTGTTTG TCTTATCGCT TACCGTATTA TCTGGATGTA GTTTGCCCGG ACTAGGTAGT 3180 

AAGAGCACGA AAAATGATGT C AAAAT T AC A GCATTATCAA CAAGCGAATC GCAAATTATT 324 0 

TCACATATGT TACGGTTGTT AATAGAGCAT GAT AC AC AC G GTAAGATAAA GCCAACATTA 3300 

GTAAATAATT TAGGGTCAAG TACGATTCAA CATAATGCCT TAATTAATGG GGATGCTAAT 33 60 

ATATCAGGTG TTAGATATAA TGGCACAGAT TTAACGGGAG CTTTGAAGGA AGCACCAATT 34 20 

AAAAATCCTA AGAAAGCAAT GATAGCAACA CAACAAGGAT TTAAAAAGAA ATTTGATCAA 34 80 

ACGTTTTTTG ATTCGTATGG TTTTGCGAJT ACGTATGCAT TCATGGTAAC GAAGGAAACC 3540 

GCTAAAAAAT AT CAT T TAG A GACAGTTTCA GATTTAGCAA AGCATAGTAA AGATTTACGT 3 600 

TTAGGTATGG ATAGTTCATG GATGAATCGT AAAGGCGATG GCTATGAAGG ATTTAAAAAA 3 6 60 

GAGTATGGTT TTGACTTTGG TACAGTGAGA CCAATGCAAA TAGGTCTAGT CTAGACGCA 3720 

TTAAACTCAG AGAAGTTAGA CGTTGCATTA GGTTATTCTA CAGATGGTCG AATTGCGGCG 37 8 0 

TATGATTTGA AAGTACTTAA AG AT GAT AAA CAATTTTTCC CACCTTATGC TGCGAGTGCT 384 0 

GTTGCAACAA ATGAATTATT ACGGCAACAC CCAGAACTTA AAACGACGAT TAATAAGTTG 3900 



154 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



ACAGGAAAGA TTTCGACTTC AGAGATGCAA CGCTTGAATT ATGAAGCGGA TGGTAAAGGT 3960 

AAAGAACCTG CTGTCGTCGC AGAAGAATTT TTGAAAAAAC ACCACTATTT T G AT AAAC AG 4 02 0 

AAAGGTGGTC ATAAGTAATG GAAGGTAATT TATTACAGCA ATTATTCAAT TATTATGTTA 4 080 

CGAACTTTGG TTATCTATGG GATTTATTTT TCAAAC&CTT ATTAATGTCT GTCTATGGTG 414 0 

TGCTGTTTGC AgCTTTAATT GGTATTCCAT TGGGAATCTT GCTTGCaAGA TACACAAAAC 4200 

TTTCTGGATT TGTAATTACA ATTGCAAATA TAATTCAAAC AGTTCCAGTC ATTGCAATGT 4 2 60 

TAGCTATTTT AATGTTAGTC ATGGGCTTAG GTTCAGAAAC AGTAGTTTTA ACAGTGTTTT 4320 

TATATGCGTT ACTTCCAATT ATAAAAAACA CTTATACTGG TATAGCTAGT GTTGATGCGA 4 38 0 

ATATTAAGGA TGCTGGCAAA GGTATGGGAA TGACACGCAA TCAAGTGCTA CGAATGATTG 4440 

AATTACCGTT ATCTGTTTCG GTTATTATCG GTGGCATTCG TATTGCCTTG GTTGTTGCGA 4 500 

TAGGTGTTGT TGCCGTTG3A T CAT T TAT AG GAGCACCTAC GCTTGGTGAC ATTGTGATTC 4 5 60 

GTGGTACAAA TGCGACGGAT GGCACAACGT TTATTTTAGC AGGTGCGATT CCGATTGCTA 4 62 0 

TCATTGCAAT CGTCATTGAT GTACTATTAA GATTTTTAGA AAAAC GAT T A GACCCAACAA 4 680 

CACGACATCG TAAAAATCAA TCTAATCATC GGCCGCAAAG TATAATATG TAATAGTAGA 474 0 

AGATGTTTAT AATTTAGCGA TTTCGTTTCA TGATTTATAA AAAATGAGGC TACTCAAGGA 4 800 

GCTCAAATAA TCTTTGAGTA GCCTTTTTAT AGGTTGTGTT TGTATGCGTT T AC AC T AAAA 4 8 60 

TAGCAATTAT TATCATGAAA GTTTTTGGAT AAAAAGCGTT AATTATTGTA AAAATACTAA 4 92 0 

AAAATGAGAT GTTTTATTTA TAATTTTCTG CAAATT TATG ATATTGTTTC TTAATATATC 4 98 0 

AT AT T AAAAA TTTGTTTTTC T T AAACAT AG GAGGCTTATC TAATTCATGG AC AC AT C AAA 5040 

ACAATTTAGA GGTGACAACC GATTGCTTTT GGGTATCGTT TTAGGGGTTA TTACCTTTTG 5100 

GCTATTCGCG CAGTCACTTG TTAATTTTGT TGTCCCATTA CAATCAACAT AT AG TAG T G A 5160 

CGTTGGAACG ATAAATATCG CTGTTAGCTT ATCTGCCTTA TTTGCTGGTT TGTTTATCGT 5220 

AGGTGCTGGT GATGTTGCTG ATAAATTTGG TCGCGTCAAA ATTACTTATG TAGGATTGAT 5280 

ATTAAATGTT GTAGGTTCAT TACTCATCAT CATTACACCT TTGCCAGCATTTTTAATTAT 534 0 

AGGTAGAATA ATTCAAGGTT TGTCTGCAGC AT GT AT TATG CCATCAACAC TTGCTATTAT 54 00 

TAACGAATAT TATATTGGTA CAAGAAGACA ACGTGCCTTA AGCTATTGGT CTATTGGTTC 5 4 60 

TTGGGGTGGT AGTGGTATTT GTACGTTGTT TGGTGGCTTA ATGGCTACAT ATATAGGTTG 5520 

GCGTTCAATA TTTGTTGTTT CAATTCTATT AACATTATTA GCAATGTACT TAATCAAACA 5 58 0 

TGCACCTGAG ACTAAAGCAG AACCAATCAA AGGTATGAAA GCAGAAGCTA AAAAGTTTGA 564 0 

CGTTATTGGT TTAGTCATTT TAG TAG T G AC GATGTTAAGT TTAAATGTAA TCATCACACA 57 00 

GACGTCTCAT TTTGGTTTAG TTTCACCGTT AffTCTAGGT TTAATTGTTG TGTTTATCTG 57 60 
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TTCATTAGTT GGTTTTGTAT ATTACGAAAA TAAAATAAAG CATCCACTTG TTGATTTTTC 5820 
AATTTTTAAA AATAGAGGAT ACAGTGGTGC AACTATTTCA AACTTCTTAT TAAATGGTGT 58 8 0 

AGCAGGTGGT GCACTTATCG TTATTAACAC GTATTATCAA CAACAATTAG GATTTAATC 5 94 0 
TTCGCAAACG GGTTATATTT CATTAACGTA TTTAATAACA GTGTTGTCAA TGATTCGTGT 6000 
AGGTGAAAAG ATTTTATCTC AACATGGTCC GAAGCGCCCA CTATTACTAG GAAGTGGCTT 60 60 

TACAGTGATT GGGTTAATCT TATTGTCGTT AACATTTTTA CCAGAAGTGT GGTATATCAT 6120 

ATCTAGTATA GTTGGATATT TATTGTTTGG TACTGGTTTA GGATTATATG CTACACCATC 618 0 
AACTGATACA GCAGTTGCTA GTGCGCCAGA TGATAAGTCG GGTGTTGCTT CAGGTGTGTA 62 4 0 

TAAAATGGCG TCATCATTAG GAAATGCATT TGGAGTAGCA GTATCTGGTA CGGTTTATAC 6300 

TGTGTTAGCA GCTAATTTAA ATTTGAACTT AGGTGGTTTC ACAGGTATGA TGTTTAATGC 6360 
CTTGCTAGCA ATTGTTGCAT TTTTAGTCAT TTTACTATTA GTTCCTAAAA ATCAAACGAA 64 20 

TTTGTAAAAC T G AAAT G AAA GCAAGTTATT ATGTAGGGAT TTTAAAGGAA ATTTTGTGAA 64 80 

AGTAAGTTTA TCATACACAC TTAATGTTGC GTATTGACGT TTAATGTTAG GTGTGTTCTT 354 0 
T T AT AGACGA TAAAAGCTGT GTGCATATTA AGCGAATGAT TTTCAAATTG ACGCTAATAT 6600 
GCGAAAGTAG TATTTTTAAA ATGAACAACA ACGATGAAGA GGGGTT TATA GGATGAAAAT 6660 
TGCAATTGCT GGATCGGGTG CATTAGGTAG TGGCTTTGGT GCCAAACTAT TTCAAGCAGG 6720 

ATATGATGTC ACACTTATTG ACGGATATAC ATCTCATGTT GAAGCGGTTA AGCAACATGG 67 8 0 

ATTAAATATA ACGATTAATG GAGAGGCATT CGAGTTAAAC ATTCCGATGT ATCATTTTAA 68 4 0 

TGATCAACCG GACGAAAGCA TTTACGATGT TGTCTTTCTA TTTCCAAAGT CTATGCAATT 6900 

AAAAGAAGTG ATGGAAGATA TGAAGCCACA TATTGATAAT GAAACGSCG TCGTATGTAC 6 9 60 

GATGAATGGT CTGAAGCATG AAGAAG T CAT TGCGCAGTAT GTTGCTCAAT CACAAATTGT 7 02 0 

CAGAGGTGTT ACGACTTGGA CGGCAGGTCT TGAAAGCCCT GGACACAGTC ATTTACTTGG 7 08 0 

TAGTGGACCA G T T G AAAT AG GTGAACTAGT GGATGAAGGT AAAGAAAATG TTATAAAAGT 714 0 

TGCTGATTTA CTTAACGAAG CGGAATTGAA TGGTGTCATT AGTAAAGATT TATACCAATC 72 00 

GATTTGGAAA AAGATTTGTG TTAATGGTAC GGCAAATGCA TTAAGCACAG TGTTGGAGTG 7 2 60 

TAATATGGCA TCGCTGAATG AAAGTAGTTA TGCGAAGTGT TTGATTTATA AATTAACGCA 7 32 0 

AGAAATAGTG CATGTAGCGA CGATTGATm TGTTCATTTA AATGTTGATG AAGTATTTGA 7 380 

ATATTTAGTT GAT T T AAAT G AAaAAGTTGG TGCGCATTAT CCATCCATGT ATCAAGATTT 74 40 

AATTGTTAAT AATAGAAAAA CTGAAATTGA TTATATTAAT GGCGCAGTTG CAACATTAGG 7500 

TAAACAACGT CaTATTGAAG CGCCAGTCAA TCGCTTTATT ACTGATTTAA TTCAACTAA 7 5 60 
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AGAAAGTCaG CGACACGCAC AGGATTAATT GGTGTTTTAG GTATGATTAC AAGGAAATGA 7 620 

CAATCACGTG ATATTACGGT CATTATTAAG ATTGAAATGT AATAAATAAA GAACAGCAGT 7 680 

AAGGTACTTT C AAAT T G AAA TGATCTTGGT GCTGTTTTTC TTGATTGATC TTCGTCATAA 77 4 0 

TTCAGATTTG TCATAGGcTA CGACATACTA TTAGTATTTA CTAGACAGTT TTTACGACGA 7800 

CACTTTGAAA AATTTTGAGG CAAATCATTT GGAAGTCTCA CGTGAATTTT GTAAACTCAT 7 8 60 

CAAGCAAGTA AT TAT AT T AA AAAGACAAAT AGAGAAAAGG TGTTTATAAT GAGTAAAATT 7 920 

TTTGTAACTG GTGCAACGGG CCTTATTGGC ATTAAiTTAG TTCAAAGACT AAAAGAAGAG 7980 

GGGCATGAGG TTGCTGGTTT TACTACATCT GAGAATGGTC AACAAAAGCT AGCTGCTGTT 804 0 

AATGTAAAAG CATATATTGG TGATATATTA AAAGCTGATA CTATTGATCA AGCGTTAGCA 8100 

GATTTTAAAC CAGAAATCAT TATCAATCAA ATTACGGATT TAAAAAATGT TGATATGGCA 8160 

GCAAATACGA AAGTACGTAT TGAAGGTTCT AAAAACCTAA TTGATGCGGC GAAAAAGCAT 82 2 0 

GACGTTAAGA AAGTAATTGC CCAAAGTATT GCCTTTATGT ATGAACCTGG CGAAGGAT T A 8280 

GCAAATGAGG AAACTTCACT TGATTTTAAC TCAACTGGCG ATAGAAAAGT AACGGTTGAT 834 0 

GGTGTGGTTG GTTTAGAAGA AGAAACGGCT CGTATGGATG AATACGTTGT TTTACGTTTT 8 4 00 

GGCTGGTTAT ATGGCCCAGG TACTTGGTAC GGAAAAGATG GCATGATTTA TAATCAATTT 8 4 60 

ATGGATGGTC AAGTGACACT TTCAGATGGC GTAACATCAT TTGTGCATCT TGATGATGCA 8 52 0 

GTTGAAACAT CTATTCAAGC TATTCATTTT GAAAATGGTA TC2ITAATGT AGC AG AT GAT 8 58 0 

GCACCTGTTA AAGGTTCTGA ATTTGCAGAA TGGTATAAAG AACAACTTGG TGTTGAACCA 8 64 0 

AATATTGATA TTCAACCTGC GCAACCATTT GAACGTGGCG TAAGCAATGA GAAGTTTAAA 87 00 

GCGCAAGGTG GTACTCTGAT TTATCAAACT TGGAAAGATG GCATGAATCC AATTAAATAA 87 60 

TAATTTATCC GTTTAATATA CAAAGAATAA AGACTTGGTC GAATCGTGGA TGATATATTA 8820 

TCAAACGCAC GGCTCGAACA AGTCTTTTTT ATTATGTCTT CGTTATCTTT G T AT GAAGG A 88 80 

ATAACAGAAT TACAATTAAT GTACTGAATA ATGCAATTAA TGTTGTGATT AGTGCTAATT 8 94 0 

TAATTTCTAT TGGTAGCCAA GTCAGTACAA AAGACCAATT ATTGCTACCG AGAATGAGAT 9000 

ATGGTAATGC ATATAATATG AGCGCTAAAG CGATACATAT AC AT AAT GAT AACCAACTCA 90 60 

ATACAGCAAT CC 9072 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16826 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GTGGAACAGC TGTAACTATA TCATTTCTTT CAACATTTAT TGGGAAATG TTAGCTACAT 60 

TTCTATATCC GATTAATAAT GTAGTACTTT CATATATnTC TGTAAATGAA AGTGACAATA 120 

TAAAGAAGCA ATATTTGaAA ACTAATCTAA TTGCTATAGC TGCCCTATGT TTAGTCATGA 180 

TTATATGTTA TCCAATTACA ATAATTATTG TCTCTTTACT GTATAACATT GATTCAAGTT 24 0 

TATATTCGAA GTTTATTATT TTAGGTAATA TAGGTGTTTT ATTCAATGCA GTGAGTATTA 300 

TGATCCAAAC TTTAAATACA AAAC AC G CAT CAATAACATT ACAAGCGAAT TATATGACGC 360 

TTCACACGAT TACATTTATA TTCATAACTA TTTTAATGAC AATTGCGTTT GGTCTAAATG 4 20 

GATTCTTTTG GACAACGCTG TTCAGCAACA TTATTAAGTA TGTGATTTTA AATATTATAG 480 

GTTTAAAGTC TAAATTCATT AATAAAAAGG ACGTCGATTA GATGAGTGAA AAAAAG AT T T 54 0 

TGATTTTATG TCAGTATTTT TATCCGGAAT ATGTATCTTC TGCGACGTTA CCAACTCAAT 600 

TGGCGGAAGA TTTAATTGCG AATCACATTA ATGTCGATGT CATGTGTGGA TGGCATATG 660 

AATATAGTAA T CAT AAAC AG GTTTCTAAAA CCGAGATGCA TCGTGGTATT CGCATTCGAC 720 

GTCTCAAGTA TTCGAGGTTT AATAACAAAA GTAAGGTTGG AAG GAT CAT C AATTTCTTTA 78 0 

GTTTATTTTC AAAATTCGTG AT T AAT AT AC CTAAAATGTT GAAAT AT GAT CAGATTCTTG 84 0 

TTTACTCTAA TCCACCAATC TTGCCATTAA TACCAGACGT T T T AC AC AG A CTGCTTAAGA 900 

AAAAAT AT T C TTTTGTGGTG TATGATATAG CACCTGATAA TGCGATTAAG ACAGGTGCAA 960 

CTCGTCCAGG TAGCATGATT GATAAGCTGA TGCGTTACAT TAATAGACAT G T C T AC AAG A 1020 

ATGCTGAAAA TGTCATTGTC CTTGGTACGG AAATGAAAA CTACTTACTA AATCATCAAA 108 0 

TTTCTAAAAA TGCTGACAAT ATCCATGTGA TTCCTAACTG GTATGACATG C G T C AAT T AC 1140 

AAG AC AAT C G TATCTATAAT GACACATTTA AAGCTTACCG TGAGCAATAC GACAAAATTT 12 00 

TATTGTATAG CGGTAATATG GGGCAGTTAC AGGAT AT GGA GACACTTATC TCATTTTTAA 12 60 

AATTAAATAA GGATCAGTCT CAAACGTTAA C AAT AC T T T G TGGTCATGGT AAGAAATTTG 1320 

CAGATGTCAA AACGGCAATA Ga AG AC CATC GTATTGAAAA TGTTAAAATG TTTGAGTTTT 138 0 

TAACAGGTAC AGACTATGCT GACGTATTAA AAATTGCGGA TGTATGTATT GCATCGCTGA 14 40 

TTAAAGAAGG CGTCGGTTTA GGCGTGCCGA GCAAGAATTA TGGCTATCTT GCAGCTAAGA 1500 

AAGCGTTGGT ACTCATCATG GATAAGCAAT CTGATATCGT TCAACATGTT G7AAC AAT AT G 15 60 

ATGCGGGTAT CCAAATTGAT AATGGCGATG CACATGCCAT TTATAACTTC ATCAACACTC 1620 

ACTCGAGTAA GGAATTGCAC GAGATGGGTG AGCGCGCACA TCACTGTTT AAAGATAAAT 1680 

ATACGAGAGA AAT T AAT AC T ATGAAGTATT ACAATCTGTT GAAGTGAGGA GATAATTATG 174 0 
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AAGCGATTAT TCGATGTAGT GAGTTCAATA TATGGTTTAG TAGTTTTAAG TCCGATTCTG 18 00 

TTAATTACAG CATTAC TAAT TAAAATGGAa TCACCTGGAC CAGCCATTTT CAAACAAAAA 18 60 

AGACCGACGA TTAATAATGA ATTGTTTAAT ATT TATAAGT T TAG AT C AAT GAAAATAGAC 1920 

ACACCTAATG TTGCAACTGA TTTAATGGAT TCAACATCGT ATATAACAAA GACAGGGAAG 1980 

GTCATTCGTA AGACCTCTAT TGATGAATTG CCACAATTAT TGAATGTTTT AAAAGGAGAA 20 4 0 

ATGTCAATTG TAGGTCCTAG ACCAGCGCTT TATAATCAAT ACGAATTAAT CGAAAAACGT 2100 

ACAAAAGCGA ACGTGCATAC GATTAGACCA GGTGTGACAG GACTAGCTCA AGTGATGGGG 2160 

AGAGATGATA TCACTGATGA T C AAAAAG T A GC G TAT GAT C ATT AT TACT T AACACATCAA 2220 

TCTATGATGC TTGATATGTA TATCATATAT AAAACAATTA AAAATATCGTTACTTCAGAA 228 0 

GGTGTGCATC ACTAATGAGA AAAAATATTT TAATTACAGG CGTACATGGA TATATCGGTA 2 34 0 

ATGCTTTAAA AGATAAGCTT ATTGAACAAG GACATCAAGT AGATCAAATT AATGTTAGGA 24 00 

ATCAATTATG GAAGTCGACC TCGTTCAAAG. ATTATGATGT TTTAATTCAT ACAGCAGCTT 24 60 

TGGTTCACAA CAATTCACCT CAAGCAAGGC TATCTGATTA TATGCAAGTG AATATGTTGC 2 52 0 

TGACGAAACA ATTGGCACAA AAGGCTAAAG CTGAAGACGT TAAACAATTT ATTTTTATGA 25 80 

GTACTATGGC AGTTTATGGA AAAGAAGGTC ATGTTGGTAA AT C AG AT C AA G T T GAT AC AC 2640 

AAACACCAAT GAACCCTACG AC C AAC T AT G GTVTTTCCAA AAAGTTCGCT GAACAAGCAT 27 00 

TACAAGAATT GATTAGTGAT TCGTTTAAAG TAGCAATTGT GAGACCACCA ATGATTTATG 27 60 

GTGCACATTG CCCAGGAAAT TTCCAACGGT TAATGCAATT GTCAAAGCGA TTGCCAATCA 2820 

TTCCCAATAT TAACAATCAG CGCAGTGCAT TATATATTAA ACATCTGACA GCATTTAUG 2880 

ATCAAT TAAT AT CAT T AG AA GTGACAGGTG TGTACCATCC TCAAGATAGT TTTTACTTTG 2 94 0 

ATACATCGTC AGTAATGTAT GAAATACGTC GCCAATCACA TCGTAAAACG GTATTGATCA 30 00 

ACATGCCTTC AATGCTAAAT AAGTATTTTA ATAAGTTGTC GGTCTTTAGA AAATTATTCG 30 60 

GCAAT T TAAT ATACAGCAAT ACGTTATATG AAAATAATAA TGCACTTGAA ATTATTCCTG 3120 

GAAAAATGTC ACTTGTTATT GCGGACATCA TGGATGAAAC G AC AAC C AAA GATAAGGCAT 318 0 

AAGTCATCTA TTAAATAAAA T C AAC AT AC A AATCGTTTTA TTTGGAGGTT ATAGTATGAA 32 4 0 

GTTAACAGTA GTTGGCTTAG GTTATATTGG T T T AC C AAC A T C AAT TAT G T TTGCAAAACA 3300 

TGGcGTCGAT GTGCTTGGTG TTGATATTAA TCAGCAAACG ATTGATAAGT TACAAAGTGG 33 60 

TCAAATTAGT ATTGAAGAAC CTGGATTACA AGAGGTTTAT GAAGAGGTAC TGTCATCGGG 34 20 

AAAATTGAAG GTATCTACAA CGCCAGATGC ATCTGATGTT TTTATCATTG CCGTTCCGAC 438 0 

GCCGAATAAT GATGATCAGT ACCGGTCATG TGACATTTCG CTAGTTATGC GTGCATTAGA 35 4 0 

TAGTATTTTA TCATTTTTAG AAAAAG G AAA TACCATTATT GTAGAGTCGA CAATTGCGCC 3 600 
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TAAAACGATG GATGATTTTG TAAAACCAGT CAT T G AAAAT TTAGGGTTTA CAATAGGTGA 3 6 60 

AGATATTTAT TTAGTGCATT GTCCAGAACG TGTACTGCCA GGAAAAATTT TAGAAGAATT 37 2 0 

AGTTCATAAC AATCGTATCA TTGGCGGTGT GACTGAAGCT TGTATTGAAG CGGGTAAACG 3780 

TGTCTATCGC ACATTCGTTC AGGGAGAAAT GATTGAAACA GATGCACGTA CTGCTGAAAT 38 4 0 

GAGTAAGCTA AT G G AAAAC A CATATAGAGA CGTGAACATT GCTTTAGTA ATGAATTAAC 3 900 

AAAAATTTGC AATAACTTAA ATATTAATGT ATTAGATGTG ATTGAAATGG CAAACAAACA 39 60 

TCCGCGTGTT AACATCCATC AGCCTGGTCC AGGTGTAGGC GGTCATTGTT TAGCTGTTGA 4 020 

TCCGTACTTT ATTATTGCTA AAGACCCTGA AAAT GCAAAG TTAATTCAAA CTGGACGTGA 4 080 

AATTAATAAT TCAATGCCGG CCTATGTTGT T G AT AC AAC G AAGCAAATCA TCAAAGTGTT 414 0 

GAGCGGGAAT AAAG T C AC AG TATTTGGTTT AACTTATAAA GGTGATGTTG ATGATATAAG 4200 

AGAATCACCA GCATTTGATA TTTATGAGCT AT T AAAT C AA GAACCAGACA TAGAAGTATG 4 2 60 

TGCTTATGAT CCACATGTTG AATTAGATTf TGTGGAACAT GATATGTCAC ATGCTGTCAA 4 32 0 

AGACGCATCG CTAGTATTGA TTTTAAGTGA CCACTCAGAA TTTAAAAATT TATCGGACAG 4 380 

TCATTTTGAT AAAATGAAGC ATAAAGTGAT T T T T GAT AC A AAAAATGTTG TGAAATCATC 44 40 

ATTTGAAGAT GTATCGTATT ATAATTATGG CAATATATTT AATTTTATCG ACAATAAAA 4 500 

TGTGTCAAAC TAGGGCATAC ATGATTAAGG AAAGATAAGC TGTCATGTGT TTGAACTTCA 4 5 60 

GAGAGGATAA TGTTATGAAA AAAATTATGG TTATTTTCGG TACGAGACCC GAAGCAATAA 4 620 

AAATGGCACC ATTAGTAAAA GAAATTGATC ATAATGGGAA CTTTGAAGCG AACATTGTGA 4 680 

TTACAGCACA ACATAGAGAT AT G T TAG AT A GTGTGTTAAG TATATTTGAT ATTCAAGCTG 4740 

ATCATGATTT AAAT ATT ATG CAAGATCAAC AAACATTAGC AGGCCTTACG GCGAATGCAC 4 8 00 

TTGCTAAACT TGATAGCATC ATTAATGAGG AACAACCGGA TATGATTTTA GTACATGGTG 4 8 60 

ATACTACAAC GACTTTTGTA GGAAGTTTGG CAGCATTTA T CAT C AAAT T CCGGTCGGAC 4 920 

ATGTAGAAGC TGGACTTCGA ACACATCAGA AATACTCACC ATTTCCTGAA GAGTTAAATC 4 98 0 

GAGTCATGGT AAGTAATATT GCTGAATTGA ATTTTGCGCC AACAGTAATT GCAGCTAAAA 504 0 

ATTTACTTTT TG AAAAC AAA GACAAAGAGC GTATCTTTAT TACTGGAAAT ACAGTTATTG 5100 

ACGCATTGTC AAC AAC AG T T CAAAATGATT TTGTTTCAAC GATTATTAAT AAACATAAAG 5160 

GCAAGAAAGT TGTTTTACTA ACAGCGCATC GTCGTGAAAA TATTGGGGAA CCGATGCATC 5220 

AGATTTTTAA AGCAGTAAGA GATTTGGCAG ATGAATATAA AGATGTTGTC TTCATTTATC 5280 

CAATGCATCG TAATCTAAAG GTAAGAGCGA TTGCCGAAAA ATATTTATCT GGGAGAAATC 534 0 

GGATTGAATT AATTGAGCCA TTAGATGCGA TTGAGTTCCA TAATTTTACA AATCAATCGT 54 00 
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ACCTCGTGCT GACAGATTCT GGTGGTATTC AAGAGGAGGC TCCTACATTT GGAAAACCTG 54 60 

TGTTGGTATT AAGGAATCAT ACAGAGCGTC CCGAAGGCGT TGSGCGGGA ACATCGAGAG 5 52 0 

TAATTGGCAC AG AT TAT G AC AATATTGTTC GAAATGTGAA ACAATTGATT GAGGATGATG 5580 

AAGCGTATCA ACGTATGAGT CAAGCGAATA ATCCATATGG TGATGGACAA GCATCACGAC 5 64 0 

GTATTTGTGA AGCAATAGAA TATTATTTTG GATTGCGCAC AGACAAGCCG GATGAATTCG 57 00 

TACCTTTACG TCACAAATAA TAAAAAACCC CTAATCATGA AGTTGGTTTA GACAACCAGC 57 60 

GGTGACTAGG GGTTTTTAAT ATATTTATTT TTGATAGTGG TAGCCAATAT CATATTTGAA 5820 

TACTTTATTT GATAATATTG GACTTTGCTG TCCATCGTCA TCACTTTTTA AACGTACATT 5880 

TTTATGAGCT TCTTTAAATA CATCGSAATT CAACCAATTA TTAAAGCTAT CTTCAGATTC 5 94 0 

CCAAATAGTT AAGATTTTAA CTTCGTCTGT ATCCTCGGTA TTTAATGTTT TAGTGACAAA 6000 

CATTTGTTGG AAGCCTTCAA TAGTTTCAAT ACCTTGTCTA TTGTAAAAAC GTTCAATCGT 6060 

TTCTTCCGCA CTGCCTTTTT GTAATTGTAA TCTATTTTCT GCCATAAACATGGGCAATCA 6120 

CTCCTCTATT TTATGATTTG ATTTGGGTAA TGTTTTTACA AATGTAAAGA GTACAGCGGT 618 0 

TTGTATGATA ACCATTATGA TTAATCCTAC ACGGACTGCA AGAACATCCA CCATATAAAT 62 4 0 

TGAAAAACCT ATTACAATGT ATAAGCTAAT TAAAATTTTA ATTTTCTGTT GTAGCGTGTA 6300 

GCCTCGATGT AAATAAAAGT TTTCTACATA TTCTTTATAA ATTTTTTGAT TAATAAGCCA 6360 

ATTGTAAAAG CGATCTGAAC TTCGAGCAAA GCAAAAAACT GCTACGAGTA AAAAAGGGGT 64 2 0 

CGTTGGCAGT AAAGGTAATA CGGCACCTGC AATACCAAGC GCTGTAAATA TTAAGCCAAT 64 80 

G AC GAT T AAA ATAAGTCGCA TTGAAAAAAC TCATTCTAG TACTAATGCG CATGTAATAT 654 0 

TGTTTTAGTA ATATAACTCA TGCTAAATAT AATGTGTATG ATAAGTGCAA T G AC T C AG T A 6600 

AAATGAAACG ATGTTGAATT ATCCTTGTCA CATTAACGCA TTTTAAGCGC GACTTTCATA 6660 

ACAACCAAAC TATTTAATGA GAATTATTCT CAAGTATTAT AGTTATATTA TGTGTTTAT 67 2 0 

TTTTGAAAAG TGCAATATGT TTTCGAAAAT AAGATTATTT TTATGTGCAA AAAC GACGCA 67 8 0 

AAAGTTTTAA AAAT GAG AC T TCTGTGAGCT GATTATTTTA TAAAATGTAA ACGCTTACTA 684 0 

TATAATGTGA ATCATATCGT TTAAAAGCAT TATTAAATAT GATGCTAAGA GATTTATATT 6900 

ATAGCCAATA AACAAAGGAG AGATAATATG GCAGTAAACG TTCGAGATTA TATTGCAGAG 6960 

AATTATGGTT TATTTATCAA TGGGGAATTT GTTAAAGGTA GCAGTGACGA AACAATCGAA 7 02 0 

GTGACTAATC CAGCAACTGG AGAAACACTA TCACATATTA CAAGAGCAAA AGATAAAGAT 7 08 0 

j 

GTCGATCATG CAGTCAAAGT GGCGCAAGAG GCATTTGAAT CATGGTCATT AACTTCTAAA 714 0 

TCAGAACGTG CACAAATGTT GCGTGATATT GGTGATAAAT TAATGGCACA AAAAGATAAA 7 20 0 

ATTGCAATGA TTGAAACATT AAATAATGGT AAACCGATTC G T G AG AC AAC AGCAATTGAT 7 2 60 
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ATTCCATTTG CTGCAAGACA TTTCCATTAT TTCGCAAGTG TTATTGAAAC AGAAGAAGGT 372 0 

ACAGTGAATG ATATCGATAA AG AC AC AAT G AGTATCGTAC GACATGAGCC GATTGGCGTC 7 38 0 

GTAGGTGCTG TTGTTGCTTG GAACTTCCCA ATGCTATTAG CTGCATGGAA GATTGCGCCA 74 40 

gCCATTGCTG CAGGTAATAC AATTGTGATT CAACCTTCGT CTTCAACACC ATTAAGTTTA 7 500 

TTGGAAGTTG CTAAAATTTT CCAAGAGGTA TTACCTAAAG GTGTTGTCAA TATACTAACG 7 5 60 

GGTAAAGGTT CAGAATCAGG TAATGCAATT TTCAATCATG ATGGTGTAGA TAAATTATCA 7 62 0 

TTTACGGGCT CAACTGATGT AGGTTATCAA GTTGCCGAAG CTGCAGCAAA ACATCTAGTA 7680 

CCCGCTACAT TAGAGCTTGG TGGTAAAAGC GCCAATATCA TATTAGXGA TGCTAATTTA 77 4 0 

G AC C T T GC AG TTGAAGGTAT TCAGTTAGGT ATTTTATTCA ACCAAGGTGA AGTATGTAGT 7 8 00 

GCAGGTTCTC GAT TAT TAG T TCATGAAAAA ATTTATGATC AATTGGTGCC ACGTTTACAA 7860 

GAGGCATTTT CAAATATTAA AGTTGGAAAT CCACAAGATG AAGCTACACA AATGGGTAGT 7 92 0 

CAAACTGGTA AGGATCAATT AGATAAAATT CAATCATATA TTGATGCAGC AAAAGAATCA 7 980 

GAT G C AC AAA TTTTAGCAGG CGGTCATCGC TTAACTGAAA ATGGATTAGA TAAAGGGTTC 8 04 0 

TTCTTTGAGC CGACATTAAT TGctGTGCCA GACAATCATC ACAAATTAGC ACAAGAAGAA 8100 

ATATTTGGAC .CAGTGTTAAC AGTGATTASk GTGAAGGACG AT C AAGAAGC AAT T GAT AT A 8160 

GCTAATGATT CTGAGTATGG TTTAGCAGGC GGTGTATTTT CTCAAAATAT CACACGTGCA 8220 

TTAAATATTG CTAAAGCTGT ACGTACAGGA CGTATTTGGA TTAACACTTA CAACCAAGTA 82 8 0 

CCAGAAGGCG CACCATTTGG TGGTTATAAA AAATCAGGTA TCGGTCGAGA AACTATAAA 834 0 

GGTGCGTTAA GTAACTATCA ACAAGTTAAA AATATTTATA TTGATACAAG CAATGCTTTA 8 4 00 

AAAGGTTTGT AC TAG AAT AA ATATCGTTTC TGAAGCGTGT TTGTAGGTCA GTCTAGCGGT 84 60 

AAGTCTTAAC ATTTAACGGC GTTGTTTAGA TTTTAAGCAA AACAAAATAT ATAGGAACAC 8 52 0 

GTATCATGAT ATTAGGATAT AAT G AC T AAA ATAATAGCAG TAGGATGGTT TTTAATTGCA 8 580 

AATCATCTTA CTGCTGTTTT TAATTATGCT AATTTGCGAT GCGGCTATTA TAAGGACAGA 8 64 0 

GTTGTTTATT AATTATGGTG ATTTAGAAAT ATGAAGTTCA ATATGCAAAG TCATCGTTTG 8700 

TTTTAATATG CGGAACAATC ATTAAAGTTA TTGCGS'TTT TTGAACTTAA TGAAACTAAA 8 7 60 

CAATAAATTT GAGATACTTT TTTGTCATTT TTATGTAACT AACACAATAA TCTCGTACAT 8820 

TATTAAAATT TTCTATATGA TAGGAATAAA GCAAAGCGCG AGTGTGCTGT AAAAGT-TTTC 8880 

CAAGGTGATA TTACATAAAG CTATAAAGGG TAAAGATTAA TGAGTTGTCA TGTAAATGAC 8 94 0 

GATGATGTAT AAATCATGGT TAATTACGGA AGCATTAATA TTAACCTGAG AAGCTATAAA 9000 

GAATTATTTT TAAAAGCGAC AATATTAAAT ACGACGCATT TATTTAGGAG TGGCAAACGT 9060 



162 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



ATGAATGGGA AAAAGGCGAA TACGATAAAC AGATACAAAT ATTTTCATCA TGTCAATCAT 912 0 

CAAAAAATTC AACAAAGTTC TAAAAAGACG CTGTGGGCAT CACTAATCAT CACATTGTTA 918 0 

TTTACAGTGA TTGAATTTGT CGGAGGTTTA GTATCTAATt CATTGGCATT ACTGTCAGAT 92 4 0 

TCATTTCATA TGCTTAGTGA TGTATTAGCA CTTGGTTTAT CTATGTTGGC CATTTATTTT 9300 

GCAAGTAAAA AGCCGACTGC AC GAT AC AC A TTTGGATATT TAEATTTGA GATATTAGCT 93 60 

GCATTTTTAA ATGGTTTAGC ATTAATTGTA ATTTCAATCT GGATTTTATA TGAAGCTATT 94 2 0 

GTACGTATTA TTTATCCGCA ACCAATTGAA AGTGGCATTA TGTTTATGAT TGCTAGTATT 94 8 0 

GGTTTACTCG TCAATATTAT TTTGACTGTT ATCCTTGTAA GGTCTTTAAA ACAAGAAGAC 954 0 

AATATCAATA TTCAAAGTGC ATTATGGCAT TTCATGGGAG ACTTATTGAA CTCTATTGGT 9 60 0 

GTCATCGTTG CAGTTGTATT GAT T TACT TT ACAGGATGGC GCATCATCGA CCCAATCATT 9660 

AGTATTGTAA TTTCACTCAT CATTTTACGT GGTGGTTATA AAATTACGCG TAATGCgTGG 972 0 

tTAATTTTAA TGGAAAGTGT GCCTCAACAT TTGGATACTG ATCAAATTAT GGCAGATATT 9780 

AAAAACATAG ATGGCATATT AGATGTACAT GAATTTCATT TGTGGAGTAT TACAACAGAG 984 0 

CATTATTCAT TAAGTGCCCA TGTTGTGTTA GATAAAAAAT ATGAGGGTGA TGATTATCAA 990 0 

GCGATTGATC AAGTATCATC ATTGTTGAAA GAAAAATATG GCATTGCACATTCAACGTTG 9960 

CAAATTGAAA ACTTGCAATT GAATCCATTA GAT G AGC CAT ACTTCGACAA ATTAACATAA 10 02 0 

ATAAAACATT GTAGCGCCTA AAACATTAAT CTATGTCATA GGCGCACGTT TCGTTTTATA 1008 0 

CTTATGTTGC AT CAT T T AAA TGATTTTCGT CAATTTCTTT GATGCTATCT ACATCTAACA 1014 0 

CGACATCTTT AGGTTTCAAA ATATGAATAT GTTTTTCATC ATTTGTATGT AAAATGCGTT 10200 

CTATGATGTA CCTTTGACCG GCCATTGTTT C T AC AGCAAT CTTTTTGTTT CTAGCTAAAC 10260 

TTGCTACGAC AGATTCTTTA TCCATAATGA TAGCCCCCTA TATATATGTT TATTTACTTA 10 320 

TACCCTAACA TGATTTTTAT ACTCTTTGAA AffATATTTT ACAGAATTTT ATCTAAATAT 10380 

TTAAAAAAAT ATCTTAATAT CCTTGTAATC CGATAAGAAT TATAGTAATA TTTTTTCAAC 104 4 0 

CATtGTTATA GGAGGTCTTA TTAATGACAT TATTTTTATT AGAAGCTAAC AATCTTGATT 10500 

TTGCATCAAC GAAAGAAGAA CTAGAAGCAA AGGCAGCATC ACTATCTACG AAGACAATIC 105 60 

CAACATTAAT TGAAGTACAA GCTACTGAAA ATTTAACTCA TGGTTATTTT ATTGTGGAAG 10 620 

CAAATGACGA aGCAGAAGCT AAACAATTTT TAACAGAAGC AGATATTAGT ATTCAATTAG 10 680 

TCAAAGAAGT ACGCTTAGTT GGTAAAGATT TAGATGAAGT TAAAAATGGT GAT GC AC AT G 107 4 0 

TTGATTACCT TGTAACTTGG AACATTCCGG AAGGCATTAC GATGGATCAA TATTTAGCAC 108 00 

GTAAAAAGAA AAATTCTGTT CAT TAT G AAG AAGTGCCAGA AGTTGAATTT AAACGCACAT 10 8 60 

ATGTATGTGA AGATATGTCT AAATGTATTT GTTTATACAA CGCACCTGAT GAAGAAGCGG 10 920 
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TACGTCGCGC GCGCAAAGCA GTTGATACAC CGATTGATGGCATCGAAAAA CTTTAATAAG 10980 

ACAACAAGTT GATGAGATAT ATGTATATAG GTTTGGCATG GATTTCGATT GCAGTTAATT 11040 

AGAATAGCTC AATGCTATAA ATGTAAGTAG T T GAT AT G AA GAAACTAATG AACTAAATGC 11100 

AAGTATTGTC TAAAACAATC ATTTTATTGA AAT T TAG TAG AGCTGAAATT AATATAACGT 1H.60 

CGTTAATTGA ATAACGCTTA TGTTATAAGA GCACTCATAC CAAACCATAA TCATCTATAG 11220 

ATATAACAAT TCACGATATA AGGGCTGTGT TTGGCATAGC CCTTTAGATA TACACTTAAT 11280 

TCCTATTAAA ATAGTAGGGA TTAAAAGGGG GCTTGTCATG ATTAAAATTC AAC AAT T AC A 11340 

AC AT C AC T T T GGATCACATA ffiGTAATTCA TAACTTTAAT TTGGACATTA GCAAGGGAGA 114 00 

AATAGTCACT TTCATAGGGA AAAGTGGTTG CGGAAAGTCT ACTTTACTCA ATATTATCGG 114 60 

TGGATTTATT CATCCATCGT CTGGTCGTGT CATTATTGAT AACGAAATTA AACAACAGCC 11520 

ATCTCCAGAT TGTTTAATGC TATTTCAACA TCATAATTTG CTGCCAffiGA AAACGATTAA 1158 0 

TGACAACATT AGGATTGGAT TACAACAGAA AATTAGTGAT GAAGAGATTA ACGCACAGCT 11640 

TAAATTAGTT GATTTAGAAG ACAGGGGAAA GCATTTTCCC GAGCAACTGT CCGGGGGTAT 11700 

GAAACAACGT GTGGCACTAT GTCGAGCGCA TGTGCATAAG CCTAACGTTA TATTGATGGA 117 60 

TGAGCCATTA GGTGCATTAG ATGCATTTAC ACGTTATAAA CTTCAGGATC AACTAGTGCA 11820 

aCTAAAACAT AAAACGCAAT CAACTATTAT TTTAGTGACG CATGACATTG ATGAAGCTAT 11880 

TTATCTTTCC GACCGCATTG TTCTGTTAGG TGAAGGGTGC AATATTATTT C T C AAT AT GA 11940 

AAT TACAGC A TCACATCCAC GCAGTCGTSi TGATAGCCAC CTACTTAAGA TTCGTAATGA 12000 

AATTATGGAA ACATTTGCAT TGAATCATCA TCAAGTTGAA CCTGAATATT ATTTATAAGG 12060 

AGTGAGTGAC GATGAAAAGG TTAAGCATAA TCGTCATCAT TGGAATCTTT AT AAT T AC AG 12120 

GATGTGATTG GCAAAGGACG TCTAAAGAAC GGTCTAAAAA TGCCCAAAAT CAGKAGTGA 12180 

TTAAAATTGG ATATTTGCCG ATTACACATT CAGCTAATTT GATGATGACT AAAAAATTAT 122 4 0 

TATCACAATA CAATCATCCG AAATATAAAC TAGAATTAGT TAAATTCAAT AATTGGCCAG 12300 

ATTTAATGGA CGCATTAAAC AGTGGTCGTA TTGATGGTGC ATCAACTTTA ATAGAGCTAG 12360 

CGATGAAATC AAAACAGAAG GGCTCAAATA TAAAGGCTGT GGCATTGGGC CAT CAT G AAG 12 42 0 

GCAATGTCAT TATGGGACAA AAAGGTATGC ACTTAAATGA AT TT AAT AAT AATGGCGATG 124 80 

ATTACCATTT TGGTATACCA CATCGTTATT CAACACATTA TCTTTTACTT GAGGAAT T AC 12540 

GTAAACAATT AAAG AT T AAA CCGGGGCATT TTAGCATCA TGAAATGTCG CCAGCAGAAA 12 600 

TGCCAGCCGC ATTGAGTGAA CACAGAATTA CAGGGTATTC TGTAGCCGAA CCATTCGGTG 12 660 

CACTGGGTGA AAAGTTAGGC AAAGGTAAGA CTTTGAAACA TGGTGATGAC GTTATACCTG 12720 
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ATGCGTATTG CTGTGTGCTA GTACTGAGAG GGGAATTGCT TGATCAACAC AAGGATGTAG 127 8 0 
CGCAAgCATT TGTACAAGAT TATAAAAAGT CTGGCTTTAA AATGAATGAT CGCAAGCAAA 1284 0 

GTGTAGACAT TATGACGCAT CATTTTAAAC AAAGTCGTGA CGTTTTAACA CAGTCAGCGG 12 900 

CATGGACATC CTATGGTGAT TTAACAATTA AGCCATCCGG CTATCAAGAA ATTACGACAT 12 960 

TGGTAAAACA ACATCATTTG TTTAATCCAC CTGCATATGA TGACTTTGTT GAACCGTCAT 13020 
TGTATAAGGA GGCATCGCGT TCATGACACG TCCCACAAAT AACAAATTTA TATTACCTAT 13080 
TATCACATTT ATTATTTTCT TAGGCATTTG GGAAATGGTC ATTATTATTG GGCATTACCA 13140 

ACCTGTATTG TTACCGGGTC CTGCTCTTGT AGGAAAAAGT ATffiGGTCTT TCATTGTTAC 13200 
TGGAGAAATT TTCCAACATT TAGCAATTAG TTTATGGAGA TTTGTAGCGG. GCTTTGTTGT 13260 
CGCATTGTTG GTTGCTATTC CATTGGGCTT CTTGCTTGGA AGGAATCGTT GGCTATACAA 13320 
CGCTATCGAA CCGCTATTTC AATTGATTAG GCCGATATCT CCGATAGCAT GGGCACCATT 13380 
TGTTGTTCTA TGGTTTGGTA TTGGTAGTTT GCCAGCGATT GC GAT TAT T T TTATCGCTGC 13440 
TTTTTTCCCA ATTGTGTTCA ATACTATTAA AGGCGTTAGA GACATTGAAC CTCAATATTT 13500 
AAAAATAGCA GCAAATTTAA ATTTAACTGG GTGGTCATTG TATCGCAATA TATTATTTCC 13560 

CGGGGCATTT AAACAAATCA TGGCTSGGAT ACATATGGCG GTAGGAACAA GTTGGATATT 13 620 
TTTAGTTTCT GGTGAAATGA TTGGTGCACA ATCGGGATTA GGTTTTTTAA TCGTTGATGC 1368 0 

ACGAAATATG TTGAACTTAG AAGATGTTTT AGCAGCAATA TTCTTTATCG GATTATTTGG 13740 

TTTTATTATT GATCGATTCA TTAGTTATAT TGAGCAGTTT ATACTTAGAAGATTTGGTGA 13800 
ATAAGGAGAG AT GAT GAT GA CTTTAGAAAC GCTTATCAAA GAACAATTAG ATCCTCATTT 138 60 

AGTAGAAGTT GATGAAGGGA CGTATTATCC GAGAACATTT ATTCAGCAAT TATTTGTAGA 13920 
TGGTTATTTC GGTGAGGCGG CAT TGAG AAA AAATGCTGAA GTAATCGAAG CTGTATCGCA 13980 

GTCTTGTTTG ACAACAGGAT TTTGTTTATG GTGCCAATTA GCTTTTTCAA CGTATTTAGA 14040 
AAATGCCACG CAGCCACATT TAAATAATGA C T T AC AAC AG CAATTGTTAT CTGGAGAAAT 14100 
ATTAGGTGCT ACCGGATTGT CTAATCCGAT GAAGTCATTT AAT GAT T TAG AAAAGTTGAA 14160 

CCTTGAACAC ACTTATGTTG ATGGACAATT GG?TGTCAGT GGACGTATGC CAGCTGTAAG 14 220 

TAATATTCAA GAAGACCATT ATTTTGGTGC GATTTCGAAA CATGAATCAT CAGATGAATT 14280 

TGTCATGTTC ATTCTACGTG CCAATCAAGA TGGTATCACT CTTGTTGAAA AAACCAATTT 14 34 0 

TTTAGGAGTC AACGGGTCAG CAACGTATCA AATCACATTG AATCAAGTCG TAGTGCOSA 14 4 00 

ATCACAAATT ATCACGCATG ATGCGAAGCA GTTTGCGGCA ACTATTCGCC CGCAATTTAT 144 60 

TGCTTACCAA ATTCCAATAG GATTAGGCTC AATTAAAAGT TCTTTAGAGT TAATTGATGC 14 520 

ATTTTCAAAT GTGCAAAACG GAATAAATCA ATATTTAGAG TATGATGTTG AAGCTTTTAA 14580 
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AAAACGTTAT CGTCAACTTA GAGAGGAATA TTATGCAATA T TAG AT G AC G GTAACTTAAC 14 640 

TTCACATTTA AATGAATTAA TAT CAT T GAA GAAGGACATC GGCTATTTAT TGTTAGATGT 14 700 

AAATCAAGCT TCTGTTGTCA ATGGTGGTTC TAGAGCGTAC ACACCATATT CGCCACAAGT 14 7 60 

TCGCAAGTTA AAAGAAGGAT TCTTCTTCGC AGCAT T GAC A CCGACAT TAA GACATTTAGG 14820 

TAAACTTGAA GCAGAGTTGA AGGGGTAAGT GTGATAAGCT GATTTTTTGT TTAGATGCGT 14 880 

TTGTTGAAAC ATTTTTTAAA ATAATATAAA TCTTAGTTTA TAAACATTTT CTGTTAATTT 14 94 0 

GTTATATCCT TTTAACTAGG AAAATATACA TTTCGTAATA ATAATAATCG TTATCATTGA 1030 0 

AAAAGTGTTA ATAAGGTGTA TAATGAAAAT GTGAACAATT AATGAACTTC TTATTTTAAA 15060 

GAAGGTGAAT AC TAT AG AT A CGCATACTAA AGAACAACAA TTCTCGAATC TAG T AAG AT C 1512 0 

TTATCGTAAA GAATACGTGG GTAAAGGACC CAATAGTATT CGAGTGTCGT TTAAAGATAA 1518 0 

TTGGGCGATT GCACATATGA CAGGTGTTTT GAGTAAAGTT GAGAGTTTTT ACCTAAACGA 15240 

CAAACGCAAT GAATCGATGC TCCATTATAC ACGCACAGAG AAGATTAAAC AGATGTATAA 15 300 

AGAAATAGAT GTAAATGAGA TGGAAAGTCT TGTAGGCGCT AAGTTTGTAA AATTATTTAC 15360 

AGATATTGAT TTGAATGATG ATGAAGTCAT TTCAATATTT GTTTTCBTA AGTCAATAGA 15420 

ATAAGTGTTG CTGGTGTAAG GTACACGGTG CTGTTTGCTA ACTTCGCTTT GAATTTAACA 15480 

ATAATTCAAG GGGGTGGTAT GTCAAACGGT GCCGTTTTTT TGTCATATTT TTAAAACAAG 15540 

CAACATGCAA CACGTACTTT AAG G AAG T C A AAATTTATCA TTTAGGAGAG ATGGATATGA 15600 

AAATCGTAGC ATTATTTCCA GAAGCAGTAG AAGGTCAAGA AAATCAATTA CTTAATACTA 15660 

AAAAAGCATT AGGAT T AAAA ACATTTTTAG AGGAAAGAGG ACATGAGTTC ATTATATTAG 157 20 

CAGATAATGG TGAAGACTTA GATAAACATT TACCAGATAT GGATGTGATT ATTAGTGCGC 15780 

CATTTTATCC TGCATATATG ACTCGTGAA: GTATTGAAAA AGCACCGAAC TTGAAATTAG 15840 

CAATTACAGC AGGTGTAGGA TCTGACCATG TAGATTTAGC GGCAGCAAGT GAACACAATA 15 900 

TTGGTGTCGT TGAAGTTACA GGAAGTAATA CAGTTAGTGT GGC AGAACAT GCGGTTATGG 15960 

ATTTATTAAT ACTTCTTAGA AAC TAT G AAG AAGGTCATCG TCAATCAGTA GAAGTGAAT 1602 0 

GGAACTTGTC TCAAGTAGGT AATCATGCGC ATGAATTACA ACACAAAACA ATTGGTATTT 16080 

TTGGATTTGG TCGAATTGGA CAACTTGTTG CTGAAAGATT AGCGCCATTT AATGTAACAT 16140 

TACAACACTA TGATCCAATC AATCAACAAG ACCATAAATT GTCTAAATTT GTAAGCTTTG 16200 

ATGAACTTGT TTCAACAAGT GATGCGATTA CAATTCATGC ACCATTAACA CCAGAAACTG 162 60 

ATAACTTATT TGATAAAGAT GTTTTAAGTC GTATGAAAAA AC AC AG T TAT TTAGTGAATA 16320 

CTGCACGTGG TAAAATTGTA AATCGCGATG CGTTAGTTGA AGCGTTAgCA TCCGAGCATT 1638 0 
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TACAAGGATA TGCTGGTGAT GTTTGGTATC CaCAACtGC ACCTGCTGAT CATCCATGGA 16440 

GAACAATGCC TAGAAATGCT ATGACGGTTC ACTATTCAGG TATGACTTTA GAAGCACAAA 16500 

AACGTATTGA AGATGGAGTT AAAGATATTT TAGAGCGTTT CTTCAATCAT GAACCTTTCC 165 60 

AAGATAAAGA TATTATTGTT GCAAGTGGTC GTATTGCTAG TAAAAGTTAT ACAGCTAAAT 16 620 

AGAATAAGGA TGCTGGGCTA GCGATTAACG CTTTCAATTT TATATAAATG AATCATATAA 16 680 

GCACTACTGC TGTTGTAAAG ATGGCAGTAG TTTTTTTATG ATTACATCTA AGTATAGTCA 16740 

CGGCTATGTT AGGACAATGA TTTAACATTT ACGCACATAT GTGTTCACTT ACGCAATTAT 16800 

TGAnAAATnT CATTCATGTG GnAATC 1682 6 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4012 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

TTCAATGAGA GTAGTGGGCT GATGTTTAGC GATATCGCGT AAGATTAACC ATTGGCCATA 60 

ATATATATTG TGTTTTTCTA AAATCGGCTC GGCTAATTTT AAATAGGGGC GATATATTGT 120 

TATAAAACTA TTGAAAAATT CTTGTGATAG CAT AG T GAC A T C T C C T AAG A CAAAATAGTT 180 

AGCTTAGCTA mCCTTTTTAC AACAATAGTA AT T AT AAAAC GGGAGCAATT AGAAATCAAT 24 0 

ATATAATTAT TAAGAGCAAA AATAATTATA CTTTGTTAAA ATAAGCGTAA TTACATGTAA 300 

ATAGGGGGAT ACTAATGATA TTGAAATTTG aTCACATCAT TC AT TAT AT A GAT C AG T T AG 633 

ATCGGTTTAG TTTTCCAGGA GATGTTATAA AATTACATTC AGGTGGGTAT CAT CAT AAA T 4 20 

ATGGAACATT CAATAAATTA GGTTATATCA AT G AAAAT T A TATTGAGCTA CTAGATGTAG 480 

AAAATAATGA AAAGTTGAAA AAGATGGCAA AAACGATAGA mGGCGGAGTC GCTTTTGCTA 54 0 

CTCAAATTGT T C AAG AG AAG TATGAGCAAG GCTTTAAAAA TATTTGTTTG CGTACAAATG 600 

ATATAGAGGC AGTTAAAAAT AAAC T AC AAA GTGAGCAGGT TGAAGTAGTA GGGCCGATTC 660 

AAATGGAAAG AGATACACAT AAAGATGGTA AGGTAAAGTG GCAATTGCTT TATATAATGA 720 

ATCAGGATGA TGATGAAATT AAGCCACCAT TTTTTATTCA ATGGGAASA AGTGATTCCA 780 

TGCGTACTAA AAAAT TGCAA AAATATTTTC AAAAACAATT TTCAATTGAA ACTGTTATTG 84 0 

TGAAAAGTAA AAAC C GAT C A CAAACAGTAT CGAATTGGTT GAAATGGTTT GATAT GGAC A 900 

TTGTAGAAGA GAATGACCAT T AC AC AG ATT TGATTTTAAA AAAT GAT GAT ATTTATTTTA 9 60 

GAATT GAAGA TGGTAAAGTT TCAAAATATC ATTCGGTTAT CAT AAAAG AC GCACAAGCAA 1020 



167 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



CTTCACCATA TTCAATTTTT ATCAGAGGTG CTATTTATCG CTTTGAACCA TTAGTATAAA 1080 

TATACGTAAG TGCTATGAGC GAGAATGCCC ATATGAATAA TGACAAGCAC AAT GGAAAGA 114 0 

ATCGTTAATA TATTATTTAA TCGTGATGAC TTAATTAAAA T G AAAAAG AT T GAT AAT AT A 1200 

AATGTGAAAA AGATAAGTAT AACCCGTAAA CTAAAGTAAT TCACGGTGAG AGGTTGACTC 12 60 

AATGTCATAA TGATTGCAAC GATGTTCATA ATTATAAATA GACTTAAAAT AATTGTTCTC 132 0 

ATATCAAACA CCTCATTGTT AGATTATTGA CATTATAACA GGGGTAATTG TATAGAACA 1380 

TTAATGTGGT TGCTTGAGGA AAAATTTATT CATTGAAGTC AAGTTGGTTC ATTTTAGAAA 144 0 

T G AAT AT C G T GTTAGAT GAT G AAAG TAT AT T GAAG TAT AG GTAACTAGTT GAAAAGTATT 1500 

AATTGTACGA TAACATTAAA TTTAACACGA AACATAGATA TAAAATGATT C AC AAT T AAA 15 60 

ATGGGTAAAT TTGAACTTGC TAAACTATTA ATTGGAGCAT GGACATTTCA AAAATAAGAG. 162 0 

TTCAAATCTT ACACAAGCTC T G AAT C G AC A C T AT AAG AT A CAAACTGTAT AAT T AAAGGT 1680 

ATTGTTAAAT AGAAGGAGAT ATCATAAATC ATGGAAAAGA TGCATATCAC TAATCAGGAA 17 4 0 

CATGACGCAT TTGTTAAATC CCACCCAAAT GGAGATTAT TACAATTAAC GAAATGGGCA 1800 

GAAACAAAGA AATTAACTGG ATGGTACGCG CGAAGAATCG CTGTAGGTCG TGACGGTGAA 18 60 

GTTCAGGGTG TTGCGCAGTT ACTTTTTAAA AAAG T AC C T A AATTACCTTA TACGCTATGT 192 0 

TATATTTCGC GTGGTTTTGT TGTTGATTAT AGTAATAAAG AAGCGTTAAA TGCATTGTTA 1980 

G AC AG T G C AA AAGAAATTGC T AAAG C T GAG AAAGCGTATG C AAT T AAAAT CGATCCTGAT 204 0 

GTTGAAGTTG ATAAAGGTAC AGATGCTTTG CAAAATTTGA AAGCGCTTGG TTTTAAACAT 2100 

AAAGGATTTA AAGAAGGTTT AT C AAAAG AC TACATCCAAC CACGTATGAC TAT GAT T AC A 2160 

C C AAT T GAT A AAAATGATGA. TGAGTTATTA AATAGTTTTG AACGCCGAAA TCGTTCAAAA 2220 

GTGCGCTTGG CTTTAAAGCG AGGTACGACA GTAGAACGAT C T G AT AG AGA AGGTTTAAAA 22 8 0 

ACATTTGCTG AGTTAATGAA AATCACTGGG GAACGCGATG GCTTCTTAAC GCGTGATATT 234 0 

AGTTACTTTG AAAATATTTA TGATGCGTTG CAT GAAGAT G GAGffiGCTGA ACTATTTTTA 24 00 

GTAAAGTTGG ATCCAAAAGA AAATATAGCG AAAGTAAATC AAG AAT T G AA TGAACTTCAT 2 4 60 

GCCGAAATTG CTAAATGGCA GCAGAAGATG AAAACATCTG AAAAGCAAGC TAAAAAAGCG 2 520 

CAAAATATGA TTAATGATGC GCAAAATAAA ATTGCTAAAA AT GAAGAT T T AAAAC GAG AC 258 0 

CTAGAAGCTT TAGAAAAGGA ACATCCTGAA GGTATTTATC TTTCTGGTGC ACTATTAATG 2 64 0 

TTTGCTGGCT CAAAATCATA TTACTTATAT GGTGCGTCTT C T AAT G AAT T TAGAGATTTT 2 7 00 

TTACCAAATC ATCATATGCA GTATACGATG AT GAAG T AT G CACGTGAACA TGGTGCAACA 27 60 

ACTTACGATT TCGGTGGTAC AG AT AST GAT CCAGATAAAG ACTCAGAACA TTATGGATTA 2820 
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TGGGCATTTA AAAAAGTGTG GGGAACATAC TTAAGTGAAA AGATTGGTGA ATTTGATTAT 2880 

GTATTGAATC AGCCATTGTA CCAATTAATT GAGCAAGTTA AACCGCGTTT AACAAAAGCT 2 94 0 

AAAATTAAAA TATCTCGTAA ATTAAAACGA AAAT AG AT T A ACGACTGAAA CCTGAACGCT 3000 

CATAAGACTG TCATTTGCGT TCAGATTTTT TTACACAATA TAGAATGGTT GAGTAAAATA 30 60 

TTTTTGAATA TAG T G AAAG A GGGGGAAGTA CTGTGATAAA AAAGCTATTA CAATTTTCTT 3120 

TAGGGAATAA GTTTGCTATC TTTTTAATGG TTGTTTTAGT TGTCTTGGGC GGTGTATATG 3180 

CGAGTGCTAA ATTGAAATTA GAATTACTAC CAAATGTACA AAATCCAGTT ATTTCAGTTA 324 0 

CAACAACAAT GCCGGGTGCA ACGCCACAAA GT AC CCAAGA TGAAATAAGT AGTAAAATTG 3300 

ACAATCAAGT AAG AT CAT T G GCATATGTGA AAAATGTTAA AACGCAATCC AT AC AAAAT G 3360 

CTTCAATTGT AACAGTTGAA TATGAAAATA ATJCAGATAT GGATAAAGCA GAAGAACAGC 342 0 

TTAAAAAAGA AATCGATAAA ATTAAATTTA AAGATGAAGT TGGTCAACCA GAATTAAGAC 34 8 0 

GTAATTCGAT GGATGCTTTT CCGGTTTTAG CATATTCATT TTCAAATAAA GAGAATGACT 354 0 

TGAAAAAAGT AAC G AAAG T A CTGAATGAAC AATTAATACC AAAATTGCAA AC GG TAG ACT 3 60 0 

GTGTGCAAAA TGCGCAATTA AATGGGCAGA CGAACCGTGA AATCACCCTT AAATTTAAGC 3 660 

AAAATGAACT TGAAAAATAT GGGTTGACTG CTGATGATGT AGAAAACTAT CTAAAAACGG 3720 

CAACAAGAAC AACGCCACTT GGATTGTTCC AATTTGGTGA TAAAGATAAT CAATTGTTGT 37 80 

TGATGGTCAA TATCRATCTG TTGATGCTTT TAAAAACATA AATATTCCAT TAACGTGGCA 3840 

GGAGGACCAA GGGCATCTCA TCCCAAAGTG AC CAT AAACC AAATTCAGCC ATGTCAGACG 3 900 

TTATCAGGCA TCACCACAGC AAATTCAAAG CGTCAGCnCC AATATATAGT GGATGCCGCA 3960 

nGAACTAGGG GTTTAGCGnT ATCAGTGGTG TGGCGACTCT ATTCTAAACG AT 4 012 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

CAATATAGGT CGCCGAGTTT CAACTaCATC AACTGGTTCA GTTACATTAG ATAATGCGCT 60 

AGGTGTAGGT GGCTATCCTA AAGGACGAAT TATTGAAATT TATGGTCCTG AAAGTTCTGG 120 

TAAGACAACA GTAGCGCTTC ACGCTATTGC TGAAGTACAA AGTAATGGCG GGGTGGCAGC 180 

ATTTATCGAT G C T G AAC AT G CTTTAGATCC AGAATATGCT CAAGCATTAG GCGTAGATAT 24 0 

CGATAATTTA TATTTATCGC AACCGGATCA TGGTGAACAA GGTCTTGAAA TCGCCGAAGC 300 
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ATTTGTTAGA AGTGGTGCAG TTGATATTGT AGTTGTAGAC TCAGTTGCTG CTTTAACACC 360 

TAAAGCTGAA ATTGAAGGAG AAATGGGAGA CACTCACGTT GGTTTACAAG CTCGTTTAAT 420 

GTCACAAGCG TTACGTAAAC TTTCAGGTGC TATTTCTAAA TCAAATACAA CTGCTATTTT 4 80 

CATCAACCAA ATTCGTGAAA AAGTTGGTGT TATGTTCGGT AATCCAGAGA CTACACCAGG 54 0 

TGGACGTGCA TTAAAATTCT AT AG T T C AG T AAGACTAGAA GTACCECGTG CAGAACAGCT 600 

TAAACAAGGA CAAGAAATTG TAGGTAATAG AACTAAAATT AAAGTCGTTA AAAATAAAGT 660 

GGCACCACCA TTTAGAGTAG CTGAAGTTGA TAT TAT GT AT GGACAAGGTA TTTCTAAAGA 720 

GGGTGAACTT ATTGATTTAG GTGTTGAAAA CGACATCGTT GaTAAATCAG GAGCATGGTA 7 80 

TTCTTACAAT GGCGAACGAA TGGGTCAAGG TAAGGAAAAT GTTAAAATGT ACTTGAAAGA 84 0 

AAATCCACAA ATTAAAGAAG AAATTGATCG TAAATTGAGA G AAAAAT TAG GTATATCTGA. 900 

TGGTGATGTT GAAGAAACAG AAGATGCACC AAAGTCATTA TTTGACGAAG AATAGTACAC 9 60 

AAATTTATAT CTATAGTTAA ACTTAGCAAA TATCCTTATA GGATTGATTG AAAGTGATAT 1020 

TCATCTCATA AAGCTAGAAT AATATCTAAC TTTATGGGAT AC AC T AC AAA TCGAGACTAT 1080 

AAGGTTTTTT ATTTTATTTA TTATTACATT ATCAATAGTT TTATAATCGA GCTTCAAAAC 114 0 

TTTAGAAAAT AGTAGAAATA GCATTCAATA TAGTGCAAAA GTGCAAATTG ASACTTGAC 1200 

ACTTATCTCC TATAAACCGT ACAATTAATT TGTATGATTT ATATATAATT TCATAAAGTC 12 60 

ATATTGAATT TCATATAAAG AGCAAACCCT AGAAAAGGAG GTGTTTGTGT GAATTTATTA 132 0 

AGCCTCCTAC TCATTTTGCT GGGGATCATT CTAGGAGTTG TTGGAGGGTA TGTTGTTGCC 1380 

CGAAATTTGT TGCTTCAAAA GCAATCACAA GCTAGACAAA CTGCCGAAGA TATTGTAAAT 14 4 0 

CAAGCACATA AAGAAGCTGA CAATATCAAA AAAGAGAAAT TACTTGAGGC AAAAGAAGAA 1500 

AACCAAATCC TAAGAGAACA AACTGAAGCA GAACTACGAG AAAGACGTAG CGAACTTCAA 15 60 

AGACAAGAAA CCCGACTTCT TCAAAAAGAA GAAACTTAG AGCGCAAATC TGATCTATTA 1620 

GATAAAAAAG AT GAG AT T T T AGAGCAAAAA GAATCAAAAA TTGAAGAAAA ACAACAACAA 168 0 

GTAGATGCAA AAG AG AG TAG TGTTCAAACG TTAATAATGA AGCATGAACA AGAATTAGAA 1740 

CGCATCTCCG GTCTCACTCA AG AAG AAG C T ATTAATGAGC AACTTCAAAG AGTAGAGGAA 1800 

GAACTGTCAC AAG AT AT T G C AGTACTTGTT AAAGAAAAAG AAAAAGAAGC TAAAGAAAAA 18 60 

GTTGATAAAA CAGCAAAAGA ATTATTAGCT ACAGCAGTAC AAAGATTAGC AGCAGATCAC 192 0 

ACAAGTGAAT CAACGGTATC AGTAGTTAAC TTACCTAATG AT GAG AT G7AA AGGTCGAATC 198 0 

ATTGGACGAG AAGGACGAAA CATCCGCACA CTTGAAACTT TAACTGGCAT TGATTTAATT 204 0 

ATTGATGACA CACCAGAAGC GGTTATATTA TCTGGTTTTG ATCCAATAAG AAG AG AAAT T 2100 
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GCTAGAACAG CACTTGTTAA CTTAGTATCT GATGGACGTA TTCATCCAGG TAGAATTGAA 2160 

GATATGGTCG AAAAAGCTAG AAAAGAAGTA GACGATATTA TAGAGAAGC AGGTGAACAA 2 22 0 

GCTACATTTG AAGTGAACGC ACATAATATG CATCCTGACT TAGTAAAAAT TGTAGGGCGT 228 0 

TTAAACTATC GTACGAGTTA CGGTCAAAAT GTACTTAAAC ATTCAATTGA AGTTGCGCAT 234 0 

CTTGCTAGTA TGTTAGCTGC TGAGCTAGGC G AAG AT GAGA C AT T AGCGAA ACGAGCTGGA 2 4QD 

CTTTTACATG ATGTTGGTAA AGCAATTGAT CATGAAGTAG AAGGTAGTCA TGTTGAAATC 2 4 60 

GGTGTAGAAT TAGCGAAAAA ATATGGTGAA AATGAAACAG TTATTAATGC AATCCATTCT 2 52 0 

CATCATGGTG ATGTTGAACC TACATCTATT ATATCTATCC TTGTTGCTGC TGCAGATGCA 2 58 0 

TTGTCTGCGG CTCGTCCAGG TGCAAGAAAA G AAAC AT TAG AGAATTATAT TCGTCGATTA 2 64 0 

GAACGTTTAG AAACGTTATC AG AAAG T TAT GATGGTGTAG AAAAAGCATT TGCGATTCAG 27 00 

GCAGGTAGAG AAATCCGAGT GATTGTATCT CCTGAAGAAA TTGATGATTT AAAATCTTAT 27 60 

CGATTGGCTA GAGATATTAA AAATCAGATT GAAGATGAAT TACAATATC TGGTCATATC 28 20 

AAG G T G AC AG TTGTTCGAGA GACTAGAGCA GTAGAATATG CGAAATAATT TTTGTCTCCC 2 8 80 

TCACAAATTA GTGAGGGAGC TTTTTTAAGT TGTAGTCTTA AtCTAGTTAG ACAGCACTTT 2 94 0 

ATCGGTAATA ACTATATTAA ACAGTAGTTA TTTGAAAGTA AGACGGACCT TAT AT T AAAT 3 000 

AAGAAGTTAT TGCTTTTAAT AAAAATGTTT TAGGCTTCGT AATTACTATA TTTATATTAT 3 0 60 

GTAAACCTAT AAAG AT GAT T GGTTTTCTAT CCAATAAAAA AGAAGAGAAG ATGTAACACA 312 0 

TCTTCTCTTC yGCAATATTA ATTAGGATTT ATTTCTAAGT TGAGTTATTT TAATTGTAAA 3180 

TCTGTTTTCT TTAATTCTTT TATAACTTCT GCAGTATCAT AACAATTTGT TGCAATTGTT 3240 

GAATATCTCT CTGCTAAACG ATATGCATTA ATGTAAAGCT TTAAACTTTC TTTAGCTATA 3300 

TCCTCTGCAT CTTCGAATTT TGATGGGTTA GACATAACCA CTAATTCTGC AAATTTTTCT 3 3 60 

GGATCAATAT TAATAGACAT GTATTTATTT ACAACTCCTA TTTATTTTGA TGTCTAATA 34 2 0 

CTAACATATT GAAGTTTTCA G AC AAAG T AA TGTCTCTCTA TAATTGAAGA AAAATAATTC 34 80 

AAAGAATTTA ATATTTACTT T AAAT AT TAG AACGTATGAA T T AAAC TAG T ATTTATGAGA 354 0 

GGATGAACAA AAC AT G AG AA TAATGTTTAT AGGGGATATC GTAGGTAAAA TTGGACGAGA 3 600 

CGCAATTGAA ACGTACATAC CTCAACTGAA GCAAAAGTAT AAACCAACAG TTACAATTGT 3 660 

AAATGCTGAA AATGCAGCAC ATGGTAAAGG TTTGACTGAA AAAATATATA AACAATTACT 3 72 0 

AAG AAAT GG T GTAGATTTCA TGACTATGGG T AAT C AC AC A TATGGTCAAC GTGAAATTTA 3780 

TGATTTTATA GATGAAGCAA AACGACTAGT AAG AC C AG G AATTTTCCGG ATGAAGCGCC 38 4 0 

GGGAATTGGT AT GAG AT T T A TACAAATTAA TGATATTAAA CTTGCAGTTA TTAATCTGCA 3 900 

AG G AAG AG C G TTTATGCCAG ATATTGATGA TCCTTTTAAA AAG G C AG AT C AATTAGTCAA 3 9 60 
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GGAAGCACAA G AAC AAAC T C CGTTTATATT TGTTGATTTT CATGCAGAAA CAACTTCTGA 4 02 0 
AAAGTATGCA ATGGGATGGC ATT TAG AT GG TAGAsTAGCG CTGTTGTTGG AACGCATACA 4 080 

C AC AT T C AAA CAGCAGATGA ACGTATTTTA CCAAAGGGGA CAGGGTATAT AACGGATGTT 414 0 

GGTATGACAG GTTTTTATGA TGGCATTTTA GGAATAAATA AAAC AG AGG T AATTGAGCGT 4 200 

TTTATCACTA GTTTGCCACA AAGACATGTT GTTCCAAATG AAGGTAGAAG TGTATTATCT 4260 
GGTGTTGTTA T T GAT T TAG A CAAAGAAGGT AAAACAAAGC ACATCGAACG TATATTGATA 4 320 

AATGATGACC ATCCATTTTC AAC AT T T T AA AATTACGTAA GTAAACATTC GAATTGGACC 4 38 0 

CTATCGTCCA TTAGTATGAA TTTAATATAG TACCACTGTT TACAffiGTAA ATCGGTGGTT 4440 
CTTTTTGTTA TCATTTAATA TGAAATATAT CCATAGGAGG CATATAACTA TGAAACCACA 4 5 00 

ATTATCGTGG AAAGTTGGCG GTCAACAAGG CGAAGGTATT GAATCAACTG GGGAAATCTT 4 5 60 

CGCTACGGCT ATGAATAGAA AAGGAT AT T A TTTATATGGA TATAGACATT TTTCAAGTCG 4 620 

TATCAAAGGT GGACATACGA ATAATAAAAT TAGAGTTTCT ACGACGCCTG TTCATGCAAT 4 68 0 

TAGTGATGAT TTAGATATTT TGATTGCATT TGACCAAGAA ACAATTGATG TTAACCATCA 4 74 0 

T GAAAT GAGA G AAG AC AG T A TTATTTTArC TGATGCCAAG GCTAAACCTG TGAAaCCAGA 4 800 

AGGATGTCAT GCACAGCTTA TTGAATTACC TTTTACAGCA ACCGCTAAAG AATTAGGTAC 4 8 60 
AGCATTAATG AAAAACATGG TTGCAATAGG TGCTACTAGC GCATTGATGA ATTTGAATAC 4 92 0 

AAAT AC AT T T G AAG AAC T T A TTACTAATAT GTTTTCTAAA AAAGGTGACA AGGTAGTTGA 4 980 

AGTCAATATC CAAGCAT T AA ACGAAGGTTA TCAATTAATG CAATCTCGCT TflCTGAAAT 504 0 
CTACGGGGAC TTTGAATTAG AG T C AAC AG A TGCACTACCA CATCTATATA TGATTGGTAA 5100 
CGATGCCATT GGATTAGGTG CAATTGCTGC AG G T T C AC AA TTTATGGCGG CATATCCTAT 5160 
TACACCTGCG TCTGAAGTTA TGGAATATAT GATTGCCAAT ATAT CTAAAG TAAACGGAGC 5220 
GGTTATTCAA AC AG AAG AT G AAATTGCTGC TGTAACTATG GCTATTGGTG CAAATTATGG 52 8 0 

T GG AG T C AG A GCGTTTACGG CTAGTGCTGG TCCAGGTTTA TCTTTAATGA TGGAAGCAAT 534 0 

TGGATTATCT GGTATGACTG AAACGCCATT AG T CAT T AT T AATACCCAAC GAGGTGGACC 54 00 

TTCTACTGGA TTACCTACGA AACAAGAACA GTCSATTTA AT GC AAAT GA TTTATGGTAC 54 60 
ACATGGTGAT ATTCCAAAAA TTGTTGTAGC ACCAACAGAT GC AG AAG AT G CATTTTATTT 5 52 0 

AAC TAT G G AA GCATTTAATT TAGCAGAACA ATATCAATGC CCTGTTATAG TTCTAAGTGA 5580 

TTTGCAATTA TCTTTAGGTA AAC AAAC T G T TGAAAAATTA GATTATAATC GTATTGAAAT 5 64 0 
TAAACGTGGT GAAATCATTC AATCTGATAT TGAACGTGAA GAAGAT GAT A AAGGTTATTT 5700 
CAAGCGTTAT GCGTtAACAT CCGATGGTGT TTCTCCTAGA CCTATCCCCG GTGTTAAAGG 57 60 
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AGGTATTCAT C AT AT AAC T G GTGTGGAaCa CAATGAAGAA GGTAAACCTA GTGAATCTGC 5 82 0 

GTCAAATAGA CAACAACAAA TGGAAAAACG AATGCGTAAA ATTGAGCAGT TACTAATTGA 588 0 

ATCGCCAGTA GAAGCTAACT TACAACATGA GGATGCAGAT ATTCTTTATA TCGGTTTTAT 5 94 0 

TTCTACAAAA GGTGCAATTC AAGAAGGTAG TAACCGTTTG AATCAACAAG GCATAAAAGT 6000 

TAACACTATA C AAAT TAG AC AATTGCATCC ATTCCCAACA SCGTTATTC AAGATGCAGT 6060 

TAATAAAGCG AAGAAAGTCG TTGTAGTGGA GCACAATTAT CAAGGACAAT TGGCTAGTAT 612 0 

TATAAAAATG AATGTCAATA TTCATGATAA GATTGAAAAT TATACAAAGT ATGATGGGAC 618 0 

ACCTTTCCTA CCACATGAAA TCGAAGAAAA AGGCAAAATA ATTGCTACTG AAATAAAGGA 6201 

GATGGTATAG ATGGCGACAT TTAAAGATTT TAGAAATAAT GTTAAGCCTA ACTGGTGCCC 630 0 

CGGATGTGGC GATTTCTCAG TACAAGCTGC AATTCAAAAA GCAGCCGCAA ATATAGGGTT 6360 

AGAACCTGAA GAAGTAGCTA TCATCACCGG TATAGGATGT TCTGGCCGTC TTTCAGGATA 64 2 0 

TATTAATTCT TATGGCGTTC ATTCTATTCA CGGACGTGCA TTACCTTTAG CTCAAGGTGT 6480 

AAAAATGGCG AATAAAGATT TAACTGTTAT TGCATCGGGA GGAGATGGTG ATGGTTATGC 654 0 

TATAGGTATG GGGCATACAA TCCATGCTTT AAGAAGAAAT AT G AAC AT GA CGTATATAGT 6 600 

CATGGATAAT C AAAT T TAT G GTTTGACAAA GGGACAAACA TCGCCGTCIB CAGCAGTAGG 6 6 60 

ATTTGTTACT AAAACAACGC CAAAAGGTAA TATAGAAAAA AATGTTGCGC CTTTAGAATT 672 0 

AGTATTATCA TCTGGTGCCA CATTTGTAGC CCAAGGTTTT TCAAGCGATA TTAAAGGATT 6780 

AACAAAACTA ATT GAAG AT G cAATTAATCA TGATGGATTT TCATTCGTTA ATGTCTTTTC 68 4 0 

ACCATGTGTG ACTTATAATA AAAT T AAC AC ATACGATTGG TTTaAAGAAC ATTTAACAAG 6 90 0 

TGTTGATGAc ATTGAAAATT ATGATTCTAC AGATAAACAA TTAGCGACTA AAACTGTTAT 6 96 0 

TGAACATGAA TCTTTAGTAA CTGGTATTGT TTATCaAGAT AAAGAAACAC CATCATATGA 7 02 0 

ATCtCAAATT AAAG AG T TAG ATGATmCACC ACTTGCTAAA AG AG AT AT C a AAATTaCTGA 7080 

AGACACCTTC AATGCATTAA CTGAACAATT TATTTAATAA TTAAAACTGA GTCGTATAAA 714 0 

TGTATTTATA ACAGATCCAT TTATGCTACT CAGTTTTTTA CTATTACAAA AAATAAAGGA 7200 

GTTTTTAAAA AT G AAAG AC A CATTAATGAG TATACAAATA ATTCCTAAAA CACCA&CAA 72 60 

TGACAATGTT ATACCTTACG TAGACGAGGC GATTAAAATA ATTGACGAAT CTGGTTTGCA 7 32 0 

TTTTAGAGTA GGTCCGTTAG AAACGACAGT ACAAGGAAAT ATGAATGAAT GTTTAATTTT 7 38 0 

AATACAATCA TTAAATGAAC GAATGGTGGA ACTTGAATGT CCAAGTATTA TTAGCCAAGT 744 0 

TAAGTTTTAT CATGTGCCAG ATGGCATCAC TATTGAAACT T T AAC T G AAA AATATGATGA 7 500 

ATAACATTAA AAGTGAAGTA AACTGGATTT GAATTGGCTT GTTAGAGATG ACGTATAACT 7 5 60 

TTAACTGTTT TTGCACTTTA TAGTTAAATT TAATATAATT ATTAAATGAT ACGGGCAAAT 7 620 
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AGAAAGGATT TTGTAAAGTG AACGAAGAAC AAAGAAASC AAGTTCTGTA GATGTTTTAG 7 68 0 

CTGAGAGAGA TAAGAAAGCA GAAAAAG AT T ATAGTAAATA TTTTGAACAT GTTTATCAGC 77 4 0 

CGCCTAATTT AAAAGCAAGC GCAAAAAAAG AGGTnAAA 777 8 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1128 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

AGATGAAGTT GTTACgAAAA TTGCGTACGC TGTTTCAGAA CATGTCAAAA TAGAAACAGG 60 

TAATCCATTC TTTCAAACAT CACATAGTGG TTGTGCGACG GGCGGATCCT GTAATTGTTC 120 

AT TAT AAAAA ACATCGAGTC AGAAAAAGGT GGTTATTGAA cCACTAACTA GCATCTGACT 180 

CGATGTTTTT ATTTATTCGG GATTGTTTGT TTGAATTGTT GTGCTAAATC TGGTCGATCT 240 

GTCACAATCG TGTGTGCACC TTTTTGGTAT AAATCATTCA T C AG AT T TAT ACTATTTACG 300 

CCATAATAGC CTGGAATGAT ATTCATATCA TTTAACCATT TGATAAAACG AGATGAAGTC 3 60 

AAATCAATGC CTTTAAAATG AGTAGGCATT TGGAACGTTT GTGCTAATGG TTGGTAGTAC 4 20 

CTACCACCTA ATAAATGATA TTTTAAAAAT GCTTCTGTAA CTCCTGTTG GCTAGCACCA 4 80 

ATTGCGACGG ATCCTTGTGC AATTTTATTA AAACGAACGA TTTGTTCTTT ATAAAAACTT 54 0 

GTCACAAGAA CGCGGTCAAA TGCTTGATTT TCTGCAATTG TATCAAACAT AATTTGTGGT 600 

GCGATTGAGC CTTCATAGGA TTCAGGAGCA TCTTTTAAGT CTACGTTTAT ATACATATCA 6 60 

GGATATTGCT TCAGCAACTc ATCGAAGGTT AGTATAGCTG TGTGTGCATG ACCACGATAT 72 0 

GGTGTTAGTC CATTGATATC TTTGAAGTGA TAAGCTGCGT CTAATTTCTT TAATTCTGCT 780 

AATGTATGGG CACTAACTTT TCCAGAGCCG TTCGTCGTTC TATCAACAGT TGCGTCATGA 84 0 

AAAACGATAA GCTGTTGATC TTTTGTGAGT CTCACATCTG TTTCAAAGCC ATCAACGCCT 900 

AATTGTTTAG CATAGTCAAA TGCAAGTTGC GTTTGCTCTG GTCTTAAAGC CATACCACCG 9 60 

CGATGCGCAA ATATATATGG TGCATTGCCT TTGAAAAAAG CAGGGATGGT TTGCTTTTTA 1020 

GTAATCACTT TATTTTTATT GATCATTAAT AGACTACTTA AAAAT C C AG C AC CGACTAGT 1080 

ACCGCATTTA AAATGTTTCT GTTTACnTTT T T CAT AAAAA ATTCCTCC 1128 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6252 base pairs 



174 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 '■ 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

CAAGCAAACA ATCGTCGATA AAATTGCTAA AATAATAAAA GTAATTCGAA CTTTCATCAT 60 

GATCATCCTT TGTTTATAGA GTCAATATAA GT AT GGAATA TGTTAGGTAT ATAGTCAAAT 120 

GCGTCAACTA ATGGGAATTT TGGCATAGAT AGAGAATTTA AGGCAATTAA AAAGGCATCA 18 0 

AACAGTAATA TGCTGCTTGA TGCCCAAATG ATGACTTTAG CTAAATTGAT TAGTCACTTT 24 0 

TAAAGATAAA GAATTGTCAT GAATTAAAAC TCATGTAATG ATGTGTTACA TTTCGCAATG 300 

ATGGCTTTCA GTTATTTATC GATAACATGi CTCTTGATAC CTTTAGATTT TAAGAAATCT 3 60 

TTAATTTTAT CTTGTTGCTT TTTATTAACA TCACCGGCAT ATTTTGTTGG CACGTCGACA 4 20 

ACATTGATTT TATTTTGCGG TTGATAGCTA AGCTTTTCAA TATCTTCATC AACATTGGCG 480 

ATTGTACTAT TTAAAGCTTT GAAGTAATTC ATCATTAATT CAACGGGTTT CTTAATTCT 54 0 

T T AGGAAT AT TGTTTTCAGT GACAAATTTC TTGAAATGCA AATCGTTTTT AACAGC T AAG 600 

TTAGATAAGT GGCTAAGTGT TTCTGCTTGT TTTTCAGTCA CTTTTGTTTG ACTGTCAATT 6 60 

TGTTTATCTA GTTTATGTTG CATAATATAT TTGTTATCAA GTATATCGCT ATTTACAGAC 72 0 

AAATACTTTT CTATAGCTTG CTTCATCTCT GCATCACTAA TATCACTATT TTTCTTATCT 780 

GAGTTAAAGA TATCTTTTGT tTCTAATTTT TTAGCGCTTT TAGGTGCATG GATGCCAGTA 84 0 

CTTGTATGAT GATCTTCGTT AT C AG AT T G A TCGGACGCGC AACCTGTAAG AATTAATGTC 90 0 

GATGCTAAAA ATGTACTTAG TAGTAATCTC TTTTT(ATAA TGTAATATAA CTCCTTAGTT 960 

TATCTTTAAT TGAAAAAATA TGTATTCATG TTTAATAGAG TAACATTGAA TTAGTTTGGA 102 0 

ATGTCACGAT GACCTTGCAA TGACCATAGA CGTAAATGAT TACGTGCATG AGTTGCTTTT 108 0 

TCTATCAATA ATGCATCATT TTGGACGTTG TTAAGGATAG CTTTATCTAT AAATAACTGC 114 0 

ATAATTGGTT GTACTAATTT AGACGTAGGT ATCGTACGTA AAAGCATAAT AATTTCGTTC 1200 

ACATACTTTT CTTTCTCAAT ATCATTTTTC ATATTGATTT GTTTGCGAGA GGTACATACT 12 60 

TTAAGCATTA TCGCACATCT CGTTGTATAT ATTAAGTTTA TCATAACATG ATTTTATGTC 132 0 

GGGATAAAAA AATAACAG2A TCTTAACAAA TGTAAGATAC TGTCAGTGAA AT G AAT G AAA 138 0 

CTTTAGTTTC TGaTAATATA GTCAAAGGCA TTTAATGCTG CATTTGCACC AGCGCCCATT 144 0 

GAAATGATAA TTTGTTTGTT CTTCTGATCT GTGACATCGC CAGCAGCAAA TATTCCAGGA 150 0 

ACATTCGTAT TATTGTTACG AT C AAT C AC A ATTTCACCAC GTCGTTTAA T TCAACAGC A 15 60 

TCGTTTAACC ATGATGTGTT TGGAAGTAAA CCAATTTGAA CAAAGATACC ATCTAAGTTA 162 0 
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AGTAGATGTT CTTCGCCGGT GTTCATGTCT TCGTAACGTA TACCTGTAAC ATGGTCTTCT 1680 

CCGACAACTT CAGTAGTTTT GGCATTTGTT TTGATATCAA CATTTGATAA AGAACGTAAA 174 0 

CGATCTTGTA ACACGTTGTC TGCTTTTAAT TCGCTAGCGA ATTCGAATAA TGTAACATGA 1800 

TTAACGATAC CAGCAAGGTC AATTGCTGCT TCAACCCCAG AGTTACCGCC ACCGATAACT 18 60 

GCTACGTCTT TATTTTCAAA TAGAGGTCCG TCACAGTGAG GGCAGAATGC AACACCTTTA 1920 

TTAATCAATT GCTCTTCACC TGGASTGTTT AGCTTACGCC AACCTGCACC AGTAGCAATA 198 0 

ATGACTGTTT TACTTTCTAA GACAGCACCG TTTTCTAACG TAACTTTAAT TGCTTCGTCA 2040 

GTCTTTTCGA TATCTGTAGC ACGTATACCT GTCATTGCAT CAATGTCATA TTGATCAATG 2100 

TGCGCTGCTA AG T T AG AAG A AAATTCAGAA CCAGTTGTTT . CTTTAACAGTAATGAAGTTC 2160 

TCAATACCAG CAGTATCATT AACTTGGCCA C C GAT AC GAT CAGCAACTAT ACCAGTACGT 222 0 

AAACCTTTAC GTGCTGTGTA AATCGCTGCA CTACCACTAG CAGGACCACC ACCAACGATT 22 8 0 

AAG AC AT CAT AAGGTTCTTT ATTTTCAAAC TCAGATGCAT CTGCCGTACT GCCTAGTTTC 2340 

GAAAGAATAT CTTGGATTGT CATACGACCA TTGCCAAATT CTTCGCCATT TAAAAAGACA 2400 

GCAGGGACTG CCATGATGTT TTCAGATTCT TCACGGAACA CTGCACCATC AATCATAGAA 24 60 

TGCGTGATGT TAGGGTTGAT CACACTCATT AAGTTAAGTG CTTGAACGAC ATCAGGACAT 2 52 0 

TTTTGACACG TTAAACTAAT GAATGTTTCA A&TGGAATG AACCTTCTAA TTTTTTAATT 2580 

TGGTCAATGA TTGACTGTTT TTCTTTAGGT GCACGACCAC TAACCTGTAA AATTGCTAAA 2 64 0 

ACAAGTGAGT TAAACTCGTG ACCTAATGGA ATACCTGCAA ATGTTACACC TGTTTCTTCG 2700 

CCAGGACGAT TGACTGAGAA ACTTGGTGTA CGTTTTAAAG ATTTTTCAGA AAGAGAT&T 27 60 

CTAGGTGACA TATCAGTAAT TTCTGTCAAC AAATCTTTAA GTTCTTTGGA TTTATCATCT 2820 

GAACCAAGGC TGGCAACGAA TTCAACGTTG CCCTCCATTA GTTCTAATAG TTGTTTAAGT 2880 

TGTTGTTTTA AATCAGCATT AAGCATGGTT GTAATGCCTC CTTAGATTTT ACCTACTAAA 2 94 0 

TCTAAACCAG GTTGCAATGT TTTAGCGCCT TCTTCCCATT TAGCTGGGCA TACTTCGCCA 3000 

GGGTTTTTAC GAACATATTG AGCTGCTTTG ATTTTGTGAG CTAATGTACT AGCGTCACGG 3060 

CCAATTCCGT CAGCGTTAAT TTCAGATGCT TGTACAACAC CGTCTGGGTC GATAATGAAT 3120 

GTACCACGTT GAGCTAAACC AGTAGCTTCA TCTAATACAT CAAAATTACG AGTGATTGTT 3180 

TGTGATGGGT CACCAATCAT AGTGTAAGTG ATTTTGCTAA TTGCATCTGA ATGGTCATGC 324 0 

CATGCTTTGT GTACGAAGTG AGTATCAGTT GATACTGAGA ATACATTTAC GCCTAATTTT 3300 

TGTAATTCTT CATATTGGTT TTGTAAGTCT TCTAATTCAG TTGGACAAAC GAATGAGAAG 3360 

TCAGCAGGAT AGAAGCATAC TACGCTCCAA GAACCTTTTA AATCTTCTTG TGTAACTTCT 34 20 
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TTAAATTGAT CTTTTTTTGG ATCGAAArCT TGCGCTGTAA ATGGTAAGAT TTCTTTGTTA 3480 

ATTAATGACA TAAATATCTT CCTCCTAAGA ATTTAAGTAT GAATTAGAAC TATCAATTGA 354 0 

TTGCGCTTAA TTATAATAAT TCTAATCTCT TAGTTAGCAT TATTACATTT TGATCCAGAA 3 600 

TAGTCAACTG GATAACTTTG TAAAGTGAAT GATTACTTTT AAAATAAAGA AAGATAATAT 3 6 60 

AAAGTGCTTT GATAATGGAT TTTGTAGTTG AT GAT T T AAA AGGTTGTGTC TATATTTAAT 3720 

ATCTTGATTT TAATGTAAAA AATGTAAAAA AAGAAGATTT GTATTCIAA CTAAGTCAAC 3780 

CTTATTGATA ATGGTATGAG AATATTTGTT CGAGATGGAT GAAGGTAATG AG T GAG AAA C 38 4 0 

TGGATTTTTA AAG TAT GAGA CAATATTTTA AAAAGTTCAA TTATTAACTT ATAAGCAAAT 3900 

AATTGCTATA AAAAAGTTTG GACGTGTACA ATTGCAATAT GAAGATTTTA AATTAATTGT 3960 

AAAGTATCGA GGAGTGGGTA ACGTGTCAGA ACATGTATAT AATCTTGTGA AAAAG CAT C A 4 02 0 

TTCTGTTAGA AAATTTAAGA ATAAACCTTT AAGTGAAGAC GTTGTTAAGA AATTGGTAGA 4080 

AGCTGGACAA AGCGCTTCGA CGTCAAGTTT CCTGCAAGCA TACTCAATTA TTGGTATCGA 414 0 

CGATGAGAAG ATTAAAGAAA ATTTACGAGl AGTTTCTGGA CAACCTTATG TTGTAGAAAA 4 200 

TGGCTATTTA TTCGTCTTTG TTATTGATTA TTATCGTCAT CATTTAGTTG ATCAACATGC 4 2 60 

TGAAACTGAT ATGGAAAATG CATATGGTTC AACGGAAGGT TTGCTAGTAG GTGCAATCGA 4 320 

TGCAGCATTA GTTGCCGAAA ATATTGCGGT AACTGCTGAA GATATGGGGT ATGGATTGT 4 380 

CTTTTTAGGA TCATTAAGAA ATGATGTTGA ACGCGTTCGA GAAATTTTAG ACTTACCTGA 4 44 0 

CTATGTCTTC CCGGTATTTG GTATGGCAGT AGGGGAACCc GCAGATGACG AAAATGGTGC 4 500 

AG C C AAG CCA CGCTTACCAT TTGACCATGT CTTCCATCAT AATAAGTATC ATGCTGATAA 4560 

GGAAACACAG TAT GC AC AAA TGGCAGATTA C G AC C AG AC A ATCAGCGAGT ACTATGATCA 4 62 0 

AC G T AC AAAC GGGAATCGCA AAGAAAC AT G GTCGCAGCAA ATTGAGATGT TCCTAGGAAA 4 680 

CAAAGCAAGA TTAGATATGT TAGAACAATT GCAAAAATCA GGCTTAATAC AGCGATAgCA 4 74 0 

AGAT AC C AAA ATAACCCGCC CCCCTCTAGC TTAAAAGAT AAGTATAGCT AGAGGGGGCG 4 800 

GGTATTTCTT GCAATGAATT AGTGTGAAGT TAATGCAGCA TTATCATTTG AATCGAAAGT 4 8 60 

ATCTTTATCC CAATGTTTAG TTAACTTGGC GGTACCTGTA CCAGCTAGCA TTGAATCGTT 4 920 

CACGTTTAAT GCTGTTCTAC CCATGTCAAT CAATGGTTCA ACGGAGATGA GCACGCCGGc 4 98 0 

TAAAGCGACT GGCAAGTTTA ACGTTGACAA CACCAATATG GATGCAAATG TAGCCCCGCC 5040 

ACCGACGCCA GCAACGCCGA ATGAACTAAT AATCACGACA GCGATTAACG T T AC AAT AAA 5100 

TTGTAAATCA ATTTCTACAT TAGCGACGGG TGCGACCATA ATTGCAAGCA TGGCAGGGTA 5160 

AATGCCTGCA CAACCATTTT GTCCAATCGA CAATCCAAAT GTCGCAGCGA AATTGGCAAT 5220 

ACCTTCTGGC ACGCCTAGAC GTCTTGTTTG TGTTTGTACA TTCAATGGTA AGGCACCCGC 5280 
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GCTTGAGCGT GATGTGAATG CAAAGATTAA TACTTCCAAA GTCTTTTTAA CATAGCGAAT 534 0 

TGGGCTAATA CCTAACAGGC TTAAAATAAT TAAGTGAATG ATffiACATCG TAATTAATGC 54 00 

AGCGTACGAT GCGATTAAGA ATTTTCCTAA AGTCCAAATG GCGCCAAAGT CACTTGTCGA 54 60 

TAATGTGTTG GCCATAATTG CTAATACACC GTATGGCGTT AAACGTAAGA CGAACGTCAC 5520 

AATCGCCATT ACTAGTGAAT AGATAGCGTC AATCGCACGC TTAAGCAATT CACCATGATC 55 80 

AGGTTGTTTG CGTnTACGCG TAAATAAGCA AATCCTATAA ACGAAGCAAA TATCACGACA 564 0 

GCAATCGTGG aAGTTGCACG TTGTCCaGTG AAATC T AAGA ATGGATTTTT AGGCAATAAT 57 00 

TCCAAAATTT GTTGTGGTAA CGTATGTGCT GTTAAATCTT TCGCTTGTTT AGCAATTTCG 57 60 

CTTCCACGTG CTTGTTCAGC GTTACCAAGG TTAATTGTTG ATGCATCTAA ACCAAACACC 5820 

AAGGCATACA CAACACCAAC AATCGCAGCA ATGGTGACAG TGCCAATTAA AAAGATAAAA 58 80 

AT GAG ACT AC CAATTTTAGC AAACTTTTCT CCGATTTGAA TTTTAGTGAA TGCAGCTACA 594 0 

AT AG AAAT G A AAATTAAAGG CATAACAATC ATTTGCAACA ATGCAACGTAACCTTGTCCG 6000 

ACAATGTTGA ACCAGTCACT TGTTGATGTA ATAACATTCG AATGTGTGCC ATAAATAAGA 60 60 

TGCAATAACA CACCGAATAC TATACCAATC CCTAAAGCTG T AAAC AC AC G TTTCGCAAAA 612 0 

GATATATGTT TGCGAGCCAT CATGTGCAAT ATTACGATGA AAAT C AC C AA TACAATAATA 618 0 

TTAATCAGTG TAAGAAAAGC ATTCATGAAC GTCACTCCTT AAATTTTTGA ATATAATTCC 62 4 0 

GACTAGTATG CT 62 52 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6730 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

ATCAAATCnC AAAATATTTA TTAATnAnAA GGGGATTATC CaTGTgAGAA ACAAAGTAAT 60 

GCTCTTTTTT TACCTCTTGT GGGTTGAAAA aTGGATCATC AGAGATAGAC TTCTTCTTTT 120 

TCGAAGATGA C AT T T GAT AC TTTAATCTTC TAAAACCATA ACTTGTCGCA TCAAAAATGC 180 

CTTCTTGTAC AAGTAAAATC AAAAAT AT GC TAATAAAAAT AATTAATGAA ACATAAAACA 240 

AT AT AT T T AA ATATGTAATG ATAGTATGGC TATTAAAAAG CCATATAATA AACGTAATA 300 

TTGGCGTTAT TAGTGCCATT CCAAGCCATT TTTTCAACAT TTGATCACTC CCAC T TAT AG 3 60 

AAAACTCTTA CGCATAGTTT AC AT T AAAAT C AG AC AT T G A GGAATGATTT TTTAATTTCT 4 20 
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TCAGCTTTAT TGAAATTCTA AAATCAATCA TTCTTCATTA GTTTAAAGCA AAAAAATATT 4 80 

GATATATAGT AAATATTGTA TATATAATAT TAGTTAAGAT TTCaGAAAAT TTTGAAGGGA 54 0 

ATGGAAATTT AGAAATCGGA ATTTGTTAGA GGAGGGGATT AGATGGGGAA ATATATTTTC 600 

AAACGAT T T A TTTATATGCT TATTTCTTTA TTTATTATTA TTACAATTAC ATTTTTCTTA 6 60 

ATGAAATTAA TGCCAGGTTC GCCATTTAAC GATGCTAAT TAAATGCTGA ACAAAAAGAA 7 20 

ATTTTAAATG AAAAATATGG ATTAAATGAT CCTGtAGCTA CGCAgTATTT ACATTATTTA 7 80 

AAAAATGTTG TTACAGGCGA TTTTGGTAAT TCATTCCAGT ATCATAATCA ACCTGTGTGG 84 0 

GATTTGATTA AACCGAGACT ACTACCTTCT TTTGAAATGG GTCTTACAGC AATGTTCaTC 900 

GGTGTGATAC TGGGACTTAT TTTAGGTGTT GCAGCAGCTA CTAAACAAAA TTCTTGGGTT 9 60 

G AC T AT AC AA CTACAGTTAT TTCAGTTATT GCAGTATCTG TACCATCTTT TGTACTTGCT 102 0 

GTACTTTTAC AATATGTATT TGCAGTTAAA TTAAGATGGT TCCCAGTAGC TGGATGGGAA 10 80 

GGTTTTTCGA CCGCGGTATT ACCGTCACTT GCATTATCTG CAGCTGTTTT AGCAACTGTC 114 0 

GCCAGATACA TAAGAGCAGA GAT GAT AG AG GTATTAAGTT CAGACTATAT TTTATTAGCG 12 00 

AGAGCTAAAG GTAATTCGAC AATGCGTGTA CTTTTTGGAC ATGCACTTAG AAATGCTTTA 12 60 

ATTCCAATTA TTACAATTAT CGTTCCCATG TTAGCAAGTA TTTXACAGG CACTTTAACA 132 0 

ATTGAAAATA TTTTTGGAGT TCCTGGATTA GGGGATCAAT TCGTACGTTC AATTACAACA 138 0 

AATGATTTCT CAGTAATCAT GGCAATCACA CTATTATTTA GCACACTGTT TATCGTTTCT 14 4 0 

ATTTTTATTG TAG AT AT T T T GTACGGTGTG ATAGATCCAC GAATTCGTGT TCcAAGgAGG 15 00 

TAAAAAATAA TGGCTGAAAA TAAAAACAAT TTGTCGATTA ACGACGATCA TTCTAATGCA 15 60 

GCTATGACGC ATACCTCTGa CgTATCGCAT CATCTGATTT TATTATTAGA GAATTAGATT 1620 

TGAATCAGGA ACCTGAAATG C AAC GAG AAA GCAAAAACTT TTGGCAAGAT GCTTGGGCTC 168 0 

AGTTAAAACG AAATAAGTTA GCTGTISTCG GTATGATAGG TTTAATTATC ATTGTAATAT 17 4 0 

TTGCTTTTAT CGGTCCAGTT ATAAATAAAC AT GATTATGC TGAACAAAAT GTAGAACATA 18 00 

GAAATCTTCC GGCAAAAATA CCTGTATTAG ACAAAGTTCC ATTTTTACCT TTTGATGGTA 18 60 

AAG AT G C AG A TGGCAAGGAT GCTTATAAAG CAGCAAATGC TAAAGAAAAT STTGGTTTG 1920 

GTACTGATCA GTTGGGTCGA GATTTATGGA C AAGAAC AT G GAAAGGTGCT CAAATTTCAT 198 0 

TGTTTATCGG TGTTGTTGCA GCGATGTTAG ATATTTTTAT TGGTGTTGTA TATGGTGCGA 20 4 0 

TTTCTGGATT CTTCGGTGGA CGTGTCGATA CGATTATGCA ACGTATACTT GAAGTCATAG 2100 

CATCTATTCC GAATTTAATT GTCGTAATTT TATTTGTATT AATTTTTGAA CCATCCATTT 2160 

GGACAATTAT ATTGGCTATG TCTATCACAG GCTGGTTAGG CATGAGCAGA GTTGTACGTG 2220 

GAGAATTTTT AAAATTAAAA AATCAAGAGT TTGTCATGGC TTCGAAAACA TTGGGGGCTT 22 8 0 
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CAAAATTCAA ATTGATATTT AAGCATATTT TAtCTAATAC ATTAGGTGCT ATCGTGGTTA 2 34 0 
CATCAATGTT TACAGTACCT AGTGCTATTT TCTTCGAAGC ATTTTTAAGT TTCATTGGTA 24 00 

TAGGTGTACC CGCACCTCAA ACATCGTTAG GGTCATTAGT AAATGATGGG CGCGCAATGT 24 60 

TATTAATTTA TCCACATGAA T T AT T TAT AC CAGCAATGAT TTTAAGTTTA TTAATTCTA 252 0 
TCTTTTACTT ATTTAGTGAT GGATTACGTG ATGCATTTGA TCCGAAAATG CGTAAATAAA 2580 
AAGGGGGCAT AGCATATGAC T G AAAGAAT A TTAGAAGTAA ATGATTTGCA TGTTTCCTTT 2 64 0 

GAT ATT AC AG CAGGGGAAGT GCAGGCAGTG AGAGGCGTAG ATTTTTATTT GAACAAAGGG 2700 

GAAACATTGG CAATTGTTGG TGAATCAGGT TCAGGTAAAT CTGTAACAAC AAAAGCAATT 27 60 
ACAAAATTAT TCCAAGGGGA CACAGGAAGA ATTAAAAAGG GAGAAATTTT ATTTTTAGGG 2820 
GAAGATTTAG CAAAAAAACC T G AAAAT GAG TTGATTAAAT TACGTGGCAA AGATATTTCA 2880 

ATGATCTTTC AAGATCCAAT GACATCTTTA AACCCAACGATGCAAATTGG TAAACAAGTC 2 94 0 
ATGGAACCAT TAATTAAGCA CAAAAATTAT AG T AAAG C AC AAGCTAAAAA GCGCGCATTG 3000 
GAAATACTAA ATCTTGTAGG TTTACCAAAT GCAGAAAAAA GATTTAAAGC ATATCCTCAT 3060 
CAATTTTCAG GTGGACAAAG GCAAAGAATT GTTATTGCAA CCGCATTAGC TTGTGAACCT 3210 
AAAGTGCTCA TTGCTGATGA ACCAACGACT GCATTAGACG TAACGATGCA GGCACAAATT 318 0 

T TAG AT T T AA T G AAAG AAC T ACAACAAAAA AT CGAT AC AG CAATTATTTT TATAACGCAT 32 4 0 

GATTTAGGGG TTGTTGCGAA TATTGCTGAT AGAGTGGCAG TTATGTATGG TGGTCAAATG 3300 

GTTGAAACAG GAG AT G T T AA CG\AATATTT TATGATCCAA AGCATCCATA TACATGGGGA 3360 
TTATTATCGT CAATGCCTGA TTTATCAACA ACAAATGACA CACCATTACT AGCGATTCCT 34 20 

GGAGCGCCAC CTGATTTATT ACACCCACCT AAAGGTGATG CATTTGCGAG ACGTAGcAAT 34 8 0 

ATGCATTAGA TATTGATTTT AAAG TAG AAC CACCGTGGTT TAAAGTTIA CCGACACATT 354 0 
TTGTGAAATC TTGGTTATTA GACGCACGTG C AC C AAAAG T TGAACTACCC GAGCTGGTAA 3600 
AAC AACG T AT GAAACCGATG CCTAATAATT AT GAAAAACC ACTCAAGGTA GAAAGGGTGT 3660 
CGTTCAATGA AAAAT GAT GA AGTGCTATTA TCTATTAAAA ATTTAAAGCA ATATTTTAAC 37 2 0 

GCAGGAAAGA AAAACGAAGT GgaGCGATTG AAAAT ATTTC GTTTGATATA TACAAAGGGG 37 80 
AAACATTAGG TTTAGTAGGA GAATCGGGGT GTGGTAAATC TACAACTGGT AAATCAATTA 38 4 0 

TTAAACTTAA TGATATTACA AG T GG AG AAA TTTTGTATGA GGGTATTGAT ATACAAAAGA 3900 

TTCGTAAACG T AAAG AT T T G CTTAAATTTA ATAAAAAGAT ACAGATGATT TTTCAAGACC 3960 
CATATGCGTC TTTAAATCCT AGGTT AAAAG TAATGGATAT AGTAGCTGAA GGTATTGATA 4 020 

TCCATCATTT AGCAACTGaT AAGCGTGACC GAAAAAAACG TGTCTATGaT TTACTTGaAA 4 080 
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CTGTTGGATT AAGTAAAGAA CATGCCAATC GCTATCCTCA TGAATTTTCA GGTGSCAAC 414 0 

GCCAACGTAT TGGaATTGCC CGTGcATTAG CCGTTGaACC AGAATTCATT ATCGCGGACG 4 200 

AACCAATATC GGCAT TGGAT GTTTCAATCC AAGCTCAAGT AGTTAATTTA TTATTAAAAT 4 2 60 

TACAACGTGA AAGAGGGATT ACGTTCCTAT TTATAGCTCA TGATCTATCA ATGGTGAAGT 4 32 0 

ATATTTCAGA TCGTATTGCA GTCATGCATT TTGGGAAAAT AGTTGAAATT GGACCGGCAG 4 380 

AAGAAAT T T A TCAAAATCCA TTACACGATT ATACTAAGTC TTTATTATCA GCCATTCCAC 4 440 

AACCTGATCC TGAATCAGAA CGCAGTCGCA AACGATTTAG TTATATTGAT GAT GAAGCAA 4 500 

ATAATCATTT AAGACAATTA CATGAAATTA GACCGAffCA CTTTGTCTTT AGTACTGAAG 4 5 60 

AAGAAGCGGC AC AAC T AC G A GAAAATAAAT TGGTGACACA AAATTAAGGG GAAGGGGGAA 4 620 

ATGCAATGAC GAGAAAATTT AGAACACTTA TTTTAATTTT GATTGCTACA ATTGCATTAA 4 68 0 

GTGGTTGTGC TAATGACGAT GGTATTTATT CAGATAAAGG TCAAGTATTC AG AAAAAT T T 4 74 0 

TGTCATCAGA C T T AAC AT C C CTTGATACAT CATTAATAAC GGATGAAATA TCTTCTGAAG 4 800 

TGAcTGCGCA AACATTCGAA GGTTTATACA CATTAGGAAA AGGTGACAAA CCGGTGTTAG 4 8 60 

GTGTTGCGAA AGCTTTTCCT GAAAAGAG T A AAGATGGTAA AACTTTAAAG GTTAAATTAA 4 92 0 

GAAGCGATGC TAAATGGAG3 AATGGTGACA AAGTGACTGC ACAAGACTTT GTTTATGCTT 4 98 0 

G G AG AAAAAC AGTTGACCCT AAAACAGGTT CTGAATTTGC AT AC AT TAT G GGGGACATTA 50 4 0 

AAAAT GC GAG TGATATTAGT ACTGGTAAGA AACCTGTAGA GCAATTAGGT ATCAAAGCAT 510 0 

TAAATGATGA AACATTACAA ATTGAATTAG AAAAGCCGGT TCCffiATATT AATCAATTAT 5160 

TAGCACTTAA TACCTTTGCA CCTCAAAATG AAAAAGTTGC CAAAAAATAT GGTAAAAATT 5220 

ACGGTACGGC AGCTGATAGA GCGGTATACA ATGGTCCaTT TAAAGTTGAT GAT TGGAAAC 528 0 

AAGAAGATAA AACCTTACTA TCTAAAAATC AGTATTATTG GGATAAAAAG AATGTAAAAT 534 0 

TAGATAAAGT GAAT TAT AAA GTTATTAAAG AC T T ACAAGC CGGTGCATCA TTGTATGATA 5400 

CTGAATCAGT AGATGACGCA TTTATTACTG CAGATCAAGT AAATAAATAT AAAGACAACA 54 60 

AAGGATTAAA CTTTGTGTTA ACGACTGGGA CATTTTTTGT AAAAATGAAT GAAAAACAAT 5520 

ATCCTGATTT TAAAAACAAA AATTTiAGAT TGsTATCGCA CAAGCAATAG ATAAAAAAGG 5580 

ATACGTTGAT TCAGTGAAAA ACAATGGCTC AATTCCTTCC GAT AC AC T AA CAGCCAAAGG 564 0 

AATTGCGAAA GCGCCTAATG GCAAAGATTA T GC GAG T AC C ATGAATTCGC CTTTAAAATA 57 00 

TAATCCTAAA GAAGCAAGAG CACACTGGGA CAAAGCTAAA AAAGAGTTAG EAAAAATGA 57 60 

AGTGACATTT TCAATGAACA CAGAAGATAC ACCAGATGCA AAAAT ATCTG CTGAATATAT 5820 

CAAATCGCAA GTTGAGAAAA ATTTACCAGG AGTTACTTTG AAAATTAAGC AATTACCGTT 58 80 

TAAACAAAGA GTATCACTAG AAC TGAGTAA CAATTTTGAA GCATCACTTA GTGGTTGGTC 594 0 
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TGCAGATTAC CCTGATCCTA TGGCTTATTT AGAAACAATG ACCACAGGTA GCGCACAAAA 6000 

TAATACAGAC TGGGGTAATA AAGAAT AT GA TCAATTACTT AAAGTAGCAA GAACCAAATT 6060 

GGCACTTCAA CCGAACGAAC GATATGAAAA CTTGAAAAAA GCAGAAGAAA TGTTCCTAGG 6120 

AGATGCACCG GTAGCACCAA TTTATCAAAA AGffGTtGCA CAT T T a AC AA aTCCTCAAGT 6180 

AAAAGGAT T A ATTtACCATA AATTTGGTCC AAATAACTCA CTTAAACATG TATATATTGA 62 4 0 

TAAATCGATA GATAAAGAAA CAGGTAAGAA GAAAAAATAA TATGCTTTGT AAATTAGGCT 6300 

GGAGACATAT CTCCAGTCTT TTTGTGTTGG ATAAAAaCTT TGGGAATAAA AATTTAAAA 63 60 

AAGTCGTTTT TTAAATTACT GAAATTGATT AAATGCATAA ATAACTGAAT ATTCTAAAAA 64 2 0 

TAAACTTGTA ATAATTTTTT CTATGAGTAA ACTAAAAAGA AAAAATTAGA TTGAAAGTAG 64 8 0 

GAGGCATATG TATGGGGAAG CTAATTAAAT ATATTTCAAT ACTTCTTATT GTCGTTTTAG 654 0 

TGTTGAGTGC TTGCQ3AAAA AGCAGTAATA AAGAT GAAGG AGTAAAAGAT GCTACTAAAA 6 60 0 

CGGAAACCTC AAAACATAAA GGTGGTACCT TAAATGTAGC ATTAACAGCA CCGCCAAGTG 6 6 60 

GTGTTTATTC TTCGTTATTA AAT AG T AC AC ATGCAGATTC TGTAGTTGAG GGATATTTTA 6720 

ACGAAAGCTT 67 30 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

AATTTTTGTC ATTATTAAAA ACCTCGCTTT TAAAAGATTG AAAAGTAAAT GAGTGAAATT 60 

AAAGATTATG CACATTAAAA TCACGCCACA ATTTAATTGT GAAAAATATC ACAAATATAT 12 0 

TATAACACTA AATTTCCCAA AATTCAAAAG TGTGTTTTAT TGCAGAAAAC TTATAACAyG 180 

TGCACAAGTT ATAGTGAATT GCAAACGGAT TACTTTAGTC TTTTTAAAAC AT G AAG TATA 24 0 

ATTTGTATAG CAATAAATAT AAAAATGGGA GGCTATGTTC AATGAGCAAT AT G AAT C AAA 300 

CAATTATGGA TGCATTTCAT TTCAGACATG CGACTAAGCA ATTCGATCCA CAAAAGAAAG 3 60 

TTTCGAAAGA AGATTTTGAA ACAATATTAG AGTCAGGTAG ATTGTCTCCA AGTTCTCTTG 420 

GGTTAGAACC TTGGAAGTTT GTCGTGATTC AAGATCAAGC GTTACGTGAT GAATTAAAAG 4 80 

CGCACAGTTG GGGCGCAGCA AAAC AAT TAG ATACAGCGAG CCATTTTGTG CTAATTTTTG 54 0 

CGCGTAAAAA TGTAACGTCA AGATCACCGT ATGTACAACA TATGTAAGA GAT AT T AAAA 600 
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AATATGAGGC ACAAACGATT CCAGCTGTTG AACAAAAATT CGATGCATTC CAAGCAGATT 660 

TCCATATTTC TGATAATGAT CAAGCCTTGT ATGACTGGTC AAGTAAACAA ACGTATATTG 7 20 

CATTAGGCAA TAT GAT G AC G ACAGCCGCAT TGTTAGGTAT TGATTCATGT CCGATGGAAG 7 80 

GTTTTAGTCT GGATACAGTG AC AG AC AT T T TAGCAAATAA AGGGATCTTA GATACTGAGC 840 

AATTTGGTTT ATCAGTGATG GTCGCATTTG GCTACAGACA ACAAGAGCCA CCGAAAAATA 900 

AAACACGCCA AGCTTATGAA GATGTTATTG AATGGGTTGG ACCAAAAGAA TAAATAGAAT 960 

ACCGTATGTC TAAATATATA AAATTASLAA GTTAGCAATA AAAAAGCCTG CGATTACATA 102 0 

AATGAATCGC AGGcTTTTGC GTGAAAAAAT TGTATTAATA AAGTATGGAT GATTATTTTT 1080 

CTGGAACAAG GTCAGTATTT GAATGAACTG TGATGTCAAA CCCTTCTGGT GCCGTAAATG 114 0 

TATGTGTTGA GGCGTCGGGT TGATAAATAT CAACATGTGT TAATCCATAA CTTGTGAAT 1200 

TGTTTTGTCT TGCTTGATTG GATTGCCAAG TAT T AGCAGC AATATGATGG TGATAATGAT 12 60 

TCGTTGACAT AAATAGCGCA CGTGGAAAAT C AGAC AC AT G TTGGAATCCT AATTGTTCAA 132 0 

TGTAACATTG ATATGCTGCG TCTAAATCAT GTGTTTTTAA ATGTAAGTGT CCAATCATGC 1380 

CTTTTGCTGG CATTCCTTGC CAACCTTCAT CAGTACGATG TGTTAATAAG GTTTGGCTAT 14 4 0 

CAACTTCTAA AGTATCCATT TTAACTTTGC CATTTTGCCA TTCCCATGAA GATGAAGGTC 1500 

TATCGCGATA GACTTCAATA CCATTACCTT CGGGGTCGTT GAAATATAAA GCTTCACTTA 15 60 

CTAAATGATC ACCAGCGCCG ATGCCCATAT TTTTTGTGC CACGAAATAT AAGAAGTTAG 162 0 

CTAGATCCTC AGTAGTTGGT AATAAAAATG CTATATGAAA AAGCCCTGCT TcACGTGGGG 1680 

aAGTCTGACG GcCGTCTTCT AATAAATGTA ACGTTAGAGT ATGGcCACCA GTCCCAACAG 17 4 0 

ATAATACGGT TGTATTATCG TCAGAACTTT TAACGGATAG TCCTAAAATG TTTTTGTAAA 18 00 

ATGTTGTCAT TAAGTCTAAG TCTCTTACGT TCAGTACAAT GTTTGTCACT TGTGTTGCTG 18 60 

TTTTATCGTG AAATGCCATT ATGCATCGCC TCTTTTTCTA TTTTTCTATA AGTTAGTATA 192 0 

AAAAGTATAC CAGAAAAGAA AATGAATTGA TAGCATAAAG TTTGAAATGC AAAATAACTA 198 0 

GTCGTTTTGC AATTTTAtAT TGATGCGAAC AAAAAAGCGA TGGTACAGTT GCACCATCGC 204 0 

AAAATTTATT TAACCAAGAT ATACATCTTG ATATGAATCT TCTTTTTCTA ACATATGTTT 2100 

GGCAAATGAA CATGAGGCAA TAATTTTCAA ATTATTTTCT CGAGCGTGTT CAACAACTGc 2160 

TTTAAGTAGT TTTTTGCCAA CACCTTGACC ACCAAGTTCA CAGATACGC CTGTATGATC 2220 

AATGTTAATT T CAT TAT TAT CCACAAAACG GTATGTGATT TCAGCTAAAG CATTATTTTC 22 8 0 

AT CAT C AC C A ATATAGAATT TGTTCTCGCC TTGTTTGATT TCAAGGTTAC TCATACATAT 234 0 

CAACTCCTAT CATGATTGAT TATAGTATTT CCCTATTCTA TTTTAACTTA AACGAAGTCA 2 4QD 

AAGGTGCATG ACAGTCATGT GACGACATTG CCACATCTAT GTAGTCGTTT TTATTAAGCA 24 60 
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CAGTTTGAAA TGAAGATGAA AACACGTATC TTGACATTAA ATCTATTCAG CTATATAATT 2 520 

TATCTCGAAA TCGAAATAAA ATAAAAAAGT TGGTGATCAT ATGGATCGAA CGAAACAATC 2 580 

TCTCAATGTT TTTGTCGGAA TGHVTAGGGC G T TAG AC AC A TTAGAGCAAA T T AC AAAAG A 2 64 0 

AGACGTAAAG CGATATGGCT T AAAT AT T AC TGAATTTGCA GTGCTCGAGT TGCTTTATAA 2 7 00 

TAAAGGTCCG CAACCAATTC AACGTATTAG AGACCGCGTA TTAATTGCAA GTAGCAGCAT 2 7 60 

TTCATATGTT GTAAGTCAAT TAG AG G AC AA AGGTTGGATT ACACGTGAA AGGATAAAGA 2 82 0 

TGATAAACGT GTATATATGG CTTGTTTAAC TGAAAAAGGT C AAAG T C AAA TGGCAGATAT 2 880 

TTTCCCTAAG CATGCTGAGA CAT T AAC AAA AGCGTTTGAT GTGTTAACAA AGGATGAATT 2 94 0 

AACAATCTTA CAACAAGCGT TTAAGAAACT AAGTGCACAA TCTACAGAAG TGTAAGGCGT 3000 

GCACTAAAAA TTTACATTAA AGTATCTCGA TTTCGAGATA AATGCACTAA AAA TAT AAAG 3060 

AGGGTATATA AAAT GAT AAA TAATCATGAA TTACTAGGTA TTCACCATGT TACTGCAATG 312 0 

ACAGATGATG C AG AAC G T AA TTATAAATTT TTTACAGAAG TACTAGGCAT GCGTTTAGTT 3180 

AAAAAGACAG TCAATCAAGA TGATATTTAT ACGTATCATA CTTTTTTTGC AG AT GAT G T A 32 4 0 

GGTTCGGCAG GTACAGACAT GACGTTCTTT GATTTTCCAA AT ATT AC AAA AGGGCAGGCA 3300 

GGAACAAATT CCATTACAAG ACCGTCTTTT AGAGTGCCTA AC GAT G AC GC ATTAACATAT 33 60 

TATGAACAGC GCTTTGATGA GTTTGGTGTT AAACACGAAG GTATTCAAGA ATTATTGGT 34 2 0 

AAAAAAGTGT TGCCATTTGA AGAAGTCGAT GGCCAAGTGT ATCAATTAAT T T C AG AT GAG 34 8 0 

TTAAATGAAG GGGTAGCACC TGGTGTACCT TGGAAGAATG GACCGGTTCC AGTAGATAAA 354 0 

GCGATTTATG GATTAGGCCC CAT TG AAAT T AAAGTAAGTT ATTTTGACGA CTTTAAAAAT 3600 

ATTTTAGAGA CTGTTTACGG TATGACAACT ATTGCGCATG AAGATAATGT CGCATTACTT 3660 

GAAGTTGGCG AAGGAGGCAA TGGTGGCCAG GTAATCTTAA TAAAAGATGA TAAAGGGCCa 3720 

GCaGCACGTC AAGGTTATGG tGAGGTACAT CATGTGTCAT TTCGTGTGAA AGATCATGAT 37 8 0 

GCAATAGAAG CGTGGGCAAC G AAAT AT AAA GAG G TAG G 1 A TTAATAACTC AGGCATCGTT 38 4 0 

AATCGTTTCT ATTTTGAAGC ATTATATGCA CGTGTGGGGC ATATTTTAAT AGAAATTTCA 3900 

ACAGATGGAC CAGGATTTAT GGAAGATGAA CCTTATGAAA CATTAGGCGA AGGGTTATCC 3960 

TTACCACCAT TTTTAGAAAA TAAAAGAGAA TATATTGAAT CGGAAGTTAG ACCTTTTAAT 4 02 0 

ACGAAGCGTC AACATGGTTA ATTGGAATGA GGAGGATTTG TGATGGAACA TATTTTTAGA 4 080 

GAAGGACAAA ATGGTGCGCC AACACTAATA TTATTGCATG GTACAGGTGG TGATGAGTTC 4140 

GATTTATTAC CGTTAGGCGA AgcATTGAAT GAAAATTATC ACTTGTTAAG TAT T AGAGGA 4 2 00 

CAAGTTTCAG AAAATGGGAT GAACCGTTAT TTCAAACGTC TTGGTGAAGG TGTTTATGAT 4 2 60 
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GAAGAAGATT TGGCATTTCG TGGACAAGAA TTGTTGACGT TCATTAAAGA AGCTGCTGaA 4 32 0 

CGTTATGATT TTGaTATTGA AAAAGC AG T A CTTGTTGGAT TTTCAAATGG ATCAAATATA 4 38 0 

GCGATTAACT TAATGTTGCG TTCAGAAGCA CCATTTAAAA AAGCXTGTT ATATGCACCG 44 40 
T TAT AC C C AG TTGAAGTAAC GTCAACAAAG GATTTATCAG ATGTCAGTGT GTTGCTTTCT 4 500 

ATGGGGAAAC ATGATCCAAT TGTGCCATTA GCTGCAAGTG AACAAGTCAT TAACTTGTTT 4 560 

AATACACGTG GGGCACAAGT CGAAGAAGTT TGGGTGAAGG GCCATGAAAT TACAGAAACT 4 62 0 

GGATTAACGG CTGGTCAACA AATACTTGGG AAATAACAGT TCTATTAAGA AGCGGACAGA 4 68 0 
TGGAAAAGAT TTTTACTTTT CATCTGCCCG CTTTTTTGAT TTTGAAGTGC TGTACTAAAT 4 74 0 

TTTACAATAG TATAGATATT TTAATCGATA TGAGATTTGC CGGTAATACG CTTAATTAAA 4 800 

CCTTTATAGA GTACAGGTAT GAGTAAGVTG AAACCGAACA ATCCCATAAT AGGGAATACT 4 8 60 

TTTCCAATTA ATGAAATGAa ACCGATAAAT GTACTAATAT AAGTGATGAC AGCCATTGTA 4 92 0 

ATAATAATGA TGAAGTAACG TCTGCTGAAT GGAACGCTGA AACGTGACGC AAATGCATAC 4 98 0 

ATTAATCCAA CAACAGTATT G TAG AT G AC A AG TAT C AT AA TGACAGACAT AEAATACCA 504 0 

ATTGACGGAG ACATTTGTGT CGCTAATTTT AATGTAGGTA GATCTACGTG TTTAATTTTA 5100 

TCGAATTGAG AAATTAAACC TAGATTAATC ATCATGAGTA AAAATGTAAT GATTAAACCG 5160 

CCAATCAAGC CCCCGTATAA CGTTGAGTCA CGATATTTAA CTTTACTACC CATCACTGAT 5 22 0 

AAGAAGCTGA AGGCAGCAGC AATTTGCAAG CTTGCATAGT TAATCGCATC AAACCACCAT 52 8 0 

CCAGGTGATA ATGATTTCTG CTTATGAATC TGAGCATCAT TATTAGCGGC AG T AAAAT C A 534 0 

AGATGACTTG TTGTGAAATA GTAGACCGCA ATCATAATGA CAATCGCAAT TAAAAATGGG 5 4 00 

GTAACACCGC CAAGCACAGC AATTAAACGA TCGffiTTTTA GAAACAGTGT TGCTAAAATA 54 60 

AAGGCGACTA ATATGAGTGC GCTCAGCCAA TACGGTAAGT TGAAACTTTG ATGAATGGTT 5 520 

GACGCACCAC CTGCAGTCAT AATAATAGCT AAAGACAACA TAAACATTGT TAAAATAATA 5 58 0 

TCAAAACCTC TTGCAATAGA GGGGTATAAG AAATAGTTAA TTGAATCAGA ATGATTTCTG 5 64 0 

GACTTTAGAT GATGACCTGT ATGCATGACA ACCATTCCAC CTAAAGTAAT CAATAGTCCT 57 0 0 

GTTACAATAA TGCCTGAAAT GCTATATGCG CCATGACTTG TGAAAAACTG GAAAATTTCT 57 60 

TGACCAGTAG CAAAGCCGGC ACCAACGACA ACACCAACAA AGGCAAATGC CACAATAATG 582 0 

GACTCTTTTA AGATACGCAT GATTTAAAAA TGTCCCTTCG TAATTTTAAG TAATATAGAA 5880 

AATGTAACAT ACATGTTAAT GAAAAATATA GTACTAATAT AGTATTTTGT TAAATTGGAG 594 0 

TAGAAGCGAG GGTGTCGGTC AT T T CAT T AA TTTATTAGTT GATTTTGCAT TTTTTTGCTG 6000 

TAAAGTTGTT ATAATACAGT TAACAGGAAT TAGCATAGAT EACCAATCC CCTCACTACT 60 60 

CGCAATAGTG AGGGGATTTT TTTCGGTGTA GCTAGGTCGC CTATTTATCA TCGTGTTTGC 6120 
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GTAgCaATGC GTAAACACAG TACCACTAAA TAAGTGCACG AT AC AT GCAT CAAATGTCGT 618 0 

CTTTAGTcTA AGTAACGATC AT GCAT T AAC ATTTTCAAAA TATCTATTTG AGCTTGAAGA 6233 

TCTTTACCAA TATTGGTATC ACGAATCTTC TTACGTTGTA ATTCTTTATC TACGACGCGC 6300 

TTTATAGAAA GTTCATCGAT ACCTTCGGAA AGTATTTTTn CTTTAGCGTT AAATTGTTGG 6360 

TGTGCAACGA GTTGCATACC GAATGAATTA TACAATAGTG TATAGCCTGC AATGCCAGTn 64 2 0 

GTTGACTGAT AAGCTTTTGA AA2GCCACCA TCAATGACAA GCATCTTTCC ATCAGCCTTG 64 8 0 

AT 6482 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16592 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

ATTTAAGGCG ATTGCTTGTG TATTTCTCTC TTTTGTAGGC AAACCTGCAC TCGTTCCAAA 60 

AAATGTAACT TCCATATATG CCCCTCCTTT TCTTCAATTC ATTTXT CAT AAAATTTGTA 120 

TCAATTTAGT TTTAATTTAT ACTTGAATCA AAATCTGTTA AAAATTCGTC TTTGCTAGTT 180 

AATTTTTCTA ACTTTAACGT AGACATAACT ATATAAATTT TGATAATTAC GTTATACTTA 24 0 

TCATTAATAA GTATCACATT AAAC AT GAT A CATGAATCGA TATTTCATTT AAGACACTGC 300 

ATACAGTCGA GCATATTGTA TGACCTACTG AATGGATTAT CTTATAATAA TAAATCATAT 3 60 

ATCTAATTAA GAATTGAGGT TTTAATCTTG AGTACTAAAA ACAAACACAT CCCATGTTTA 4 20 

ATCACAATCT TTGGTGCACT GCGTGACTTA AGCCATCGTA AGTnGTTTCC ATCAATATTC 4 80 

CATCTCTACC AACAAGACAA TTTAGATGAA CATATTGCCA TcATCgGTAT TGGACGTCGT 54 0 

GACATkwnTA AT GAT GAT T T CCGTAATCAA GTAAAATCAT CAATTCAAAA GCACGTAAAA 600 

GATACAAACA AAATTGACGC GTTTATGGAA CATGTCTTCT AT C AT AGACA TGATGTTAGT 6 60 

AATGAAGAAA GCTATCAAGA ATTACTAGAT TTTAGTAATG AATTAGATAG CEATTTGAA 720 

TTAAAAGGTA ATCGACTATT CTATTTAGCA ATGGCACCAC AATTCTTTGG CGTTATTTCT 780 

GAT T AT C T AA AATCTTCTGG TCTTACTGAT ACAAAAGGAT TTAAACGCCT TGTTATCGAA 840 

AAACCATTCG GTAGTGATTT AAAATCAGCC GAAGCATTAA ACAATCAAAT TCGTAAATCA 900 

TTTAAAGAAG AAGAAATTTA TCGTATTGAC CACTATTTAG GAAAAGACAT GGTTCAAAAT 9 60 

ATCGAGGTAT TACGTTTTGC GAATGCGATG TTTGAACCAT TATGGAATAA CAAATATATT 102 0 
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TCAAACATCC AAGTTACATC TTCTGAAATA CTAGGTGTTG AAGATCGTGG TGGTTATTAT 1080 

GAATCAAGTG GCGCGCTAAA AGATATGGTG CAAACCACA TGTTACAAAT GGTTGcATTA 1140 

TTAGCTATGG AAGCACCTAT TAGTTTAAAT AG T G AAG AT A TCCGTGCTGA GAAAGTAAAA 1200 

GTACTTAAAT CACTGCGTCA TTTCCAATCT G AAG AT G T T A AAAAGAACTT TGTTCGTGGT 12 60 

CAATATGGCG AAGGCTATAT CGATGGTAAA CAAGTTAAAG CATACCGTGA TGAAGATCGC 1320 

GTTGCAGATG ACTCTAACAC ACCTACCTTT GTTTCAGGTA AATTAACAAT TGATAACTTT 138 0 

AGATGGGCTG GTGTASCATT CTATATTCGT ACTGGTAAAC GTATGAAATC TAAAACAATT 14 40 

CAAGTTGTCG TTGAATTTAA AGAAGTACCA ATGAACTTAT ACTATGgAAA CTGaTAAACT 15 00 

GTTAGATTCA AACCTATTAG TAATCAATAT CCAACCTAAT GAAGGTGgTA TCTTTtACAT 15 60 

CtAAATGcTA AGaAAAATAC ACAAGGTATC g AAAC AG r AC CTGtCCmATT GtCTTACTCm 1620 

ATGaGCGcTC aAGaTAAAAT GaATACTGTA GATGCATATG AAAATCTATT ATTTGATTGT 1680 

CTTAAAGGTG ATGCCACTAA CTTCACGCAC TGGGAAGAAT EAaATCAAC ATGGAAATTT 17 4 0 

GTTGATGCAA TTCAAGATGA ATGGAATATG GTTGaTCCAG AATTCCCTAA CTATGAATCA 1800 

GGTACTAATG GTCCATTAGA AAGTGATTTA CTACTTGCTC GTGATGGTAA CCATTGGTGG 18 60 

GGACGATATT CAATAATTGA ATTAAAACGC ACATGTTAAA CAAAAATAAA TGAGCGAATG 192 

ACAAGGATTG ATTACGAATC AAGTAAATCA ATTGTTCTAT CATTCGCTTT TTTTCTAGAC 198 0 

TATATTATGA AATTATATTT TACAATGCCC AAAACTATTT TAATAATCAT TGAACAAATG 20 4 0 

GGTGTATAAT TTATAGAAAT AATGTAGAAT AAAAATAAAT GATTGAATTA ATTGGAGTGA 2100 

AAGTTTTGGA CGTTATCAAG CAS^TACAAC AGGCAATTGT TTATATTGAA GATCGTTTAT 2160 

TAGAGCCTTT CAATTTGCAA GAATTAAGTG ATTACGTTGG TCTTTCGCCA TACCATCTTG 2220 

ATCAATCATT TAAAATGATT GTCGGCTTAT CTCCAGAAGC TTATGCACGC GCGCGTAAAA 22 8 0 

TGACACTCGC TGCAAATGAT GTGATTAATG GTGCTACACG ACTTGTAGffi ATCGCTAAAA 234 0 

AAT AT C AC T A TGCAAATTCA AATGATTTTG CAAATGATTT TAGTGATTTT CACGGCGTAT 24 00 

CACCTATTCA AGCCTCTACT AAAAAAGATG AATTACAAAT TCAAGAGCGA TTATATATCA 24 60 

AATTATCAAC TACTGAGAGA GCACCTTATC CATACAGATT AGAAGAGACA GATGATATTT 2 52 0 

CATTGGTTGG ATATGCACGA TTTATAGACA CTAAGTATTT GTCACATCCT TTTAATGTTC 25 80 

CGGATTTTTT AG AAG AC T T G CTCATTGATG GTAAAATTAA AGAGTTACGA CGATATAATG 2 64 0 

ACGTTAGTCC ATTTGAACTA TTTGTTATTA GTTGTCCTCT TGAAAATGGT TTAGAAATAT 27 00 

TTGTAGGTGT AC C AAG T G AA CGTTATCCTG CACACTTAGA AAGTCGATTT TTACCTGGCA 27 60 

AACATTGTGC GAAATTCAAT TTACAAGGTG AAATTGATTA TGCAACTAAT GAAGCTTGGT 28 20 

ACTATATTGA ATCAAGTTTG CAGTTAACAT TGCCATATGA ACGAAATGAT TTATATGTTG 2880 



187 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



AAGTGTACCC TCTCGATATT TCATTTAATG ACCCATTCAC TAAAATTCAG CTTTGSTTC 294 0 

CTGTTAAACA GAGTCCTTAT GACGAAGATT AAATAATAAA AAACAAAGAA GCCCCCTAAT 3 000 

ATATCTATAG GTCTACAAAT GGCCTTAGAT TCTATTAGGG GGCATATTAA TATGTTAATT 30 60 

TAGTTCGATA ACACATGCTT CATATGGACG TAACTGTTTT AAATTAACTT TGGCATCATA 312 0 

ATTAAATAGC TTTACTTCTC CATGGCTTAA ATCAAATGGT ACAGTTAATT CTGCTTCGTG 318 0 

GTTAGTAAGA TTACCTACAA TAAGAACTTG CTTTTCATTT AATGTTCTCG TGTACGCAAA 324 0 

AACTTGTGAA TTTTCAGCAT CTACTAAATC AAATTGACCA TATACGTATA CATCATTAGA 3300 

CTTTCTTAAT TGAATTAAAT CTTTATAAAA TTGTAATAT GAATGCTCAT CTTCTAATTG 3360 

TTGTGCAACA TTGATAGTTT TATAATTCGG ATTCACTGGG AACCACGGTT CACCATTTGT 34 2 0 

AAATCCTCCA TTTAACGTAT CATCCCATTG CATTGGTGTG CGAGAATTAT CTCGGTTCTC 34 8 0 

ATCTTTATAT TTCGCAAGTA AAGCGTCTAC ATCTCCACCT TGAGCTTTCA CTATTTGATA 354 0 

GTCATTTTTA ACAGCAACAT CGTTAAACGT TTCAATACTT TCAAATGGAT AATTCGTCAT 3600 

ACCAATTTCT TGACCTTGAT AAATGAATGG CGTACCTTGT TGCAAGAAAT AAACAGCTGC 3660 

ATGACTTGTT GCTGATTCAT ACCAATACTT GTCATCGTCA CCCCACGTCG ATACACGTCG 372 0 

TGGTTGGTCG TGATTTTCAA TAAACAACGC ATTCCAACCT TTATTTTCAA GTTGTTTTTG 37 8 0 

CCATCTATTT AAT AC AG AT T TATACGAATT T AC AT C AAAG TGAGAATCAC CACTATTCCA 38 4 0 

CAGTCCCAAA TGTTCAAATT GGAATATCAT ATTAAATTTA CCATTTTCTT CCCCGACCCA 3 90 0 

GTCATCAGCA TCATCAGGGC TTACACCATT CGCTTCACCA ACAGtCATAA TGTCATACTT 39 60 

ACTTAATGAG CGATCTTTCA TCTCTTGTAA CCAAGTTTGT ATACCTGGCT GATTCATATC 4 02 0 

TACATCAAAT GCTGGGGCAT ATGTTTTACC CTCAGGTACA GGTAAGTCAC CCGCTTCAAA 4 08 0 

CGTCTTCTTA ATATGCGTAA TTGCATCTAC TCTAAATCCA TCAATGCCTT TATCAAACCA 414 0 

CCAGTTCATC ATTTCAAATA CAGCATCTCT AACTTCCGGA TTACCCCAAT TCAAATCAGG 4200 

TTGTTTTTTA C T G AAT AAA T GGAAATAATA TTGCTCAGTA TTAGCATCAT ATTCCCATGT 4 2 60 

AGATCCATTA AATATACTTT CCCAGTTGTT AGGTTCAGAG CCATCTGGCT TTGGATCTTG 4 32 0 

CCAAATGTAC CAATCACGTT TGGGATTSTC TTTACTAGAT TTGGATTCTA TAAACCAAGG 4 380 

ATGTTCATCA GATGTATGAT TTACAACTAA ATCTAAAATA AGCTTCATGC CTCTATCATG 4 44 0 

AACACCTTTT AATAAACGAT CAAAGTCTTC CATCGTTCCA AATTCATCCA TAATCTCTTG 4 500 

GTAGTCACTA ATATCATAAC CATTGTCATC ATTAGGTGAT TTAAACATTG GSTGAGCCA 4 5 60 

AATGACATCG ATACCGAAAT CTTTTAAGTA GTCCAATTTA T C AAT CAT T C CAGGTAAATC 4 62 0 

CCCAATACCA TCGTGATTAC TATCATTAAA ACTTCTTGGA TATACTTGAT ATGCTACTGC 4 68 0 
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TTCTTTCCAC CATTGCTTAT TCATTTTAAA ACTCCTTTGC TATCGCTGTG TTGATTTTCT 4 74 0 

TATTTTTAAT TCTGTATCTA TAATGACGAG TTCAATAACA TCCTGTGCTT TGTTTTTCAA 4 8 00 
TATATTTAAA ATTGCTGCAC CAGCCTGTTG ACCTAACATT CGAGGCTTGA TGTCAATACA 4 8 60 

GGTTTGTGGT GGTGACGCAA TTTCGGTTAA ATAAGAATCA TTGAACGTTG CTGTCATTAC 4 92 0 

ATCTTTCGGA ATTTCAATAT TAAGTTCATA TAGG\CACTT AAAATCGCTA AATGTAACAT 4 980 
AGCATCTAAC GAAATGATTG CCTGTTTAAT ATTTGGGTCC TTCAAACGCG TATGTAGATT 504 0 

TTGCATGTAA TTAAAAATAA CTTCTCTTTC ATTACTAGTC TCAATAATTT GATAATTAAT 5100 

TTTATTTTGA GAAGCTATCG TTTCAAATCC TTGAATTCTA TCTTTTGAAA CTTCAAAATT 5160 
TCCTTTTTCT GTAATAAATA TTAATTCATC TACACCTTGT TCAATAACAT GTCGTGTCAA 5220 
ATTTTCAGAA GCTAATATAT TATCATTATC TATATGTGTA AATTGATGAT CTATATCCGA 5280 
TGTAGGCTTA CCAATCACAA TAAATGGCAT GCTTTCATCA ATTAACATTT GTTTAATCGG 5340 

ATCATTTTCT TTTGAATAGA GCAGTATAAA CGCATCAACC ATTCGTTGTT TAATCATTTT 54 00 
ATAAACTTCA TCCATTAAAT CATTCATATT ATTTGAGACT GTCGTTTGTG TACCATAGCC 5 4 60 

ATGCTGGTTA CACGTTTCAG AAATTCCTAG CAATACATTG ATGTAGAATG GATTCAGTCG 5 520 

AATAGGCTCC TCAGACCCTT TTAACACTAA ACCAATTTTA SLTGTTTGGT TTGTAATTAA 558 0 
AGTTCTAGCA GCGGTATTAG GAAAATAATT CAATTCTTCC ATAACTTTCT TCACTTTTGA 5 64 0 

AATTGTCGCT TCGCTAATAC GTTGATTTCC TTTTATAACT CTTGAAACTG TCGAAGGAGA 5700 

AACACCGGCT TTTAGTGCAA CATCTTTAAT CGTAACCATT TAATCACCTC CTGTTAATTT 5 7G6 
CTGCATCGGA AAACGCTTCC AACCACTGTA TAATACCAGT TTAGTCACAC TTTCTAAAAA 5 82 0 

AGTCAAAAGA TTTGTGCAAA CGATTGCATA AAACGATAAA AATAAAACCT TCATACTGAA 5 8 80 

ATTCAATCCG AAAATCAATA TAAAGGTTTG TATAAATATT AAAATCGATT GTTTAGTCAC 5 94 0 

TAACTGCAAA ATAGTTACCT TGCCCATCTT GAAAATTAAA TACACGTTGA CCATTCATTT 6000 

CTACTATATC ATGCCCAGTT AAACCTAAAT CATTTAATTT TGAGTATAAT GCATCAAAGT 6 0 60 

TTTTCTCTTT AAACATTAAA GATGGTGTTC CTAGGTTCAC TTCCGGGCTA TGCTTTTCAA 6120 

TAAATTCTTT TGCCATAATC GTCAATGACG TTTCAGCATC TTTGGTAGE GATACTTCAA 6180 

CTGCAACATA GTCCTCAGCT AACGGTGTTT CACTTACAAC AACAAATTCT AAAGTTTCTG 62 4 0 

TCCAAAATGC TTTCGCTTTT TCGACATCAT CAACATATAA CATAACTTGA TTTAACTTTT 6300 

C C AT AAAAT A GTACCTCTAT TTCTCTATAG TACATGCTAT CATAACACAG TAAATATTTT 6360 

ATTACTTCAC AAAATGCTTA AAAAT ATGGC GGGATGCTTT TAAGGTCAAG GATAATACTT 64 2 0 

GTGTAATTTT TTATAGGTTG TAGCTACTCT AT C AC AC T C T CTTTTATATT TATCAAAAGA 64 80 

TATAAAAAAG GATAGTATCT TTCAACTATC CTTTAATCAA TATTATTCTT CAATCCATTG 6540 
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TGTATGGAAT ACGCCtTCTT TATCTTTTCT TTCGTACGTA TGAGCACCGA AGTAGTCACG 6600 

TTGTGCTTGA ATTAAGTTTG CAGGTAAATC AGCAGCACGG TAACTATCAT AGTAATTAAT 6660 

ACTTGATGAG AAACCAGGTG TTGGTACACC ATTTTGAACA CCAGTTGCGA CAACATCACG 6720 

TAACGCATCT TGATAT TCAG TAACGATGTT TTTAAAGTAA GGATCTAGCA ATAAGTTTG 67 8 0 

TAATCCTGGA TTATTATCGT AAGCATCTTT GATCTTTTGT AAGAATTGTG CACGGATAAT 68 4 0 

GCAACCTTCT CTCCAAATCA TAGCTAAATC ACCAAGTTTT AAATTCCATT CATTATCTTC 6900 

ACTTGCTTTA CGCATTTGcG CGAAACCTTG TGCATAAGAA CAAATTTTAC TCATATATAA 6960 

TGCTTTACGA ATTTTTTCTA AAAAGTCTTT CTTGTCACCA TCAAATGATG CTTTTGGACC 7 020 

ATTTAATTCT TTAGAAGCAT TTACGCGCTC TTCTTTGaTT GAAGAGATAA AACGTGCAAA 70 80 

T AC AG AT T C A GTAATGATTG TTAATGGAAT ACCTAATTCT AATGCGTTAA TTGAAGTCCA 714 0 

TTTTCCTGTA CCTTTTTGaC CTGCAGTATC AAGAATTTT TCAACTAATG CTTCTTTATT 7200 

TTCATCTAAT T T CAT G AAAA TATCACCAGT GATTTCAATT AAATAACTTT CTAATTCACC 72 60 

AGCATTCCAG TCTTTGAACG TTTGAGCAAT GTCTTCATGA GACATGCCTA ATAATTCTTT 7 320 

CATCATAGCA TAACTTTCTG CAATTAATTG CATGTCAGCA TATTCGATAC CATTGTGTAC 7 38 0 

CATTTTCACA TAGTGTCCAG CACCATTAGG TCCAATATAA G T AAC AC AT G AAGCACCGTC 74 4 0 

TTTTGCCTTT GCAGCAATTG CAT C AAG AAT ATCTGCAACT TTGTTATAAG CTTCTTCTTG 7500 

TCCACCCGGC ATTAATGACG G AC C AG T T AA CGCTCCAATT TCACCACCAG AAACGCCCAT 7 5 60 

ACCAATAAAG TTGATTGCAC TTTGTGywAA TGCTTTATTA CGTCTGATAG TATCTTGATA 7 62 0 

GTTTGTATTA CCACCATCAA TTAAAATATC TCCATCATCT AATAAAGGTA ACAAACTATC 7 68 0 

AATCGTTGCG TCCGTAGCTT TACCTGCTTG AACCAT T AAT AAAATTTTAC GTGGTTTTTC 77 4 0 

TAAAGAATTA ACAAATTCTT CCAATGAATA CGTTGGATGA ATATTTTCC CTTTTGATTC 7 8 00 

T T C AAC CAT T AAATCAGTTT TTTCACTTGA GCGGTTAAAT ACAGATACAC TATATCCGCG 7 8 60 

TGATTCAATA TTCCAAGCTA GGTTTTTACC CATAACGGCT AAACCAATAA CTCCAATTTG 7 92 0 

TTGTGTCATA TTACTTACCT CACTTGTTGA TTTTTCATTA GTATTGTATC ACAAAATAGA 7 980 

CAT AC ACT AC ACTAAATCAT TTCGAATGTC GCGCAACTAT TTTGATTATT TCTAACACTT 804 0 

GACTTGCAAG CAAGTTCAAT GATTTAATCG GCATTCTCTC ATTTGTTGTA TGGATTTTTT 8100 

CATAACCCAC TCCTAAAATG ACTGAAGGAA TACCAAATGT ATTAATAATA CTGCCGTCTG 8160 

AACCGCCACC AGAAATAATT GTATTTG:AG ATAATCCTAA ATTACGAGCA CTTTCTTGTG 8 22 0 

CAATTTTAAC AACCGCTTCA TTATCATTAA TTTTAAATCC TGGATAACTT TGCTCCACTG 8280 

TAACTACTGC TTTCCCACCT AATTCTGATG CAGTAGTTTC AAAC AC AT C A GTCATATGTT 834 0 
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TGACTTGTGT TTTTATTCTT TCTGGATCGT GAGAACGTGC CTCTGCTTCT AAATGACTT 84 00 

CATCTGCAAC AATATTCGTA GCTGAACCGC CAT G AAAC T T ACCAATATTG GCAGTAGTTA 84 60 

TTTCATCAAC TTGTCCTAAT TTCATTCGAC TAATTGcTTT CGCCGCAATA TTAATAGCAC 8 5 20 

TAACACCCTC TTTTGGCGTA CTTGCATGAG CCGTTTTGCC AAAAATTTTA GCTGAAATTA 8580 

ACATTTGCGT CGGTGCACCT ACAACCGTAG TACCGACATC AGCACTTGCA TCAATAGCAT 8 64 0 

AACCAAAGTC CGCGTCCAAC AACTCTGAAT TTAATTCTTT AGCACCAATT AAACCTGATT 87 00 

CTTCTCCAAC AG T AAT C AC A AATTGAATTT GTCCATGTGG GATTTGTTGT TCCTTTATCA 87 60 

CTTGCAAAAC TTCAAGCATC GCTGATAATC CTGCTTTATC ATCTGCACCT AGAATAGTCG 8 820 

TACCATCAGA GTATATGTAG CCGTCATCTT TTACAATTGG CTTTACATTA ATTGCGGGTA 88 80 

CAACAGTATC CATATGGCTC GTCAAATATA ATTTAGGTAC TTCGCCTTCT TCGATAGTAC 8 94 0 

TATTCATTGT ACACACTAGA TTATTGGCAC CTAATTTAGG ATGTTTAGCC GCTTCATCTT 9000 

CTTTAACATC TAACCCTAAT GCTATGAATT TTTCTTTTAA AATAGGTTGG ATTGTTGATT 90 60 

CATTCCCTGT CTCAGAATCG ATTTGTACAA GTTCAAAAAA CGTATTAAGT AATCTTTGCT 9120 

CATTAATCAT AATGGAACCC CTTTTATCAA AATTTTCATG TACGCCAAAC GGTAAAAATT 918 0 

GATGAAATAA AATGTTACAG TAATTGACGT TACACAGATT TATCAGGTTT GTAAATTGTG 92 4 0 

T CAT AT TAT T TTCAATTTAT TATATATAAT TATTGTAACT CAAACTAAGC TTTGTCAAAA 93 00 

ATATATTGAT TGATTTTTCA AAGATATCGT ATAATGAGGA AAATGACATA AGCAAAC T T A 93 60 

CTCATGTTTT TTATTATATT CCTTTATGAT GATTGCTAGT ATATCGTCT CAAGTTAAAA 942 0 

GTTTTATATC TTATGTCGTA AT TAT T AAT A CAAAGGTTAT TCATTTGGAG GCACACAAAA 94 80 

TGCAAAATAA AGTTTTAAGA ATTATCATTA TCGTTATGCT TGTATCAGTT GTATTAGCAT 95 4 0 

TGTTATTAAC GAGTATCATT CCAATTTTAT AAACTATATC TCAACTACCT AT AC AAAAT C 963) 

ATACAATTAA AAATCCATCC ATTATAAACG CATGTATTAA TAAGTTATCG TATTGCAACG 9 6 60 

ATTACTTTCA AACATGGGTC ATACGGATGG ATTATTTTTT AAGCTACTTC ACTATGCATT 97 2 0 

TTCAATGAAC CAAATTGCGA TTTGATTTGT AAATATTCTT CTAATTCATT TAATATTTGA 9780 

AT AAT AC T T G CTCTCGAGTT AAGCGCTTTG TGTGTTGTTG GCAATGGCAG TTCATCCAAT 98 4 0 

TTCAAACGCG TCTCATACAA ATTGTGTAAA CGCATTGCTG TATAGTCATT ACTATTCACA 9900 

TTTAGACCAA TTTCTTTCAG CAGTGACGCA ACATCATTTA AAAGCGGATC TTTATGACAG 9 9 60 

ATACTTTCGA TGAGCGGTTT CATTCTCATT AACAATTCCA CTTGCTCTC TCGCATATCA 1002 0 

AAATAATGAT AGTATGAATT TTCGTTTCTA ACAAAATGAT TTTTAACATC TCGGAACGCG 10080 

ATAGACTtCG CCTTTTTAAT ATTTAAAAGT AACACTTCAA ATTCAATCGC AATGGTATCT 10140 

TCATATTTTT CACAAATATA ACTATATTTA CTAAAAATAT CAGCAATTTG TTGCTCAATT 102 00 
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TTACATTTGT ATTCGTCtAG TTGTTTGTCT AAACTTGGCA TCATTAAATT CaTTGTAAAT 10260 

GCAATGCTTA GTCCAATTAA CAGTAATAAT GTTTCATTAA CAATTAAATG TGCATCAATT 10320 

GATTTTGCAT TAAAAACATG AAGTAATATA ACGCAACTCG TAATGACACC TTCTTGTACT 1038 0 

TTTAATACGA CAGTTAATGG TATAAATAAC AATACGATAA TACCGAGTAC AATTGGACTC 10440 

TGACCTAATA AACTAAATAT TGCTGAACCT AAAAACAATA CTAAAAAACA TGATACTAAT 10500 

CTTGAAATAA TCGCTTGTAG CGAATGTACT TTTGTATGTT TAATACATAA TACGACTAAT 10560 

ATGGCGCTTG AAGCATAATT ATCTAAACCT AACAGCTTAC TAATAATTAC ACCTflaGTC 10 620 

ATACCCACTG CTGTTTTTAT TGTTCTAAAT CCAATCTTGT AAGGATTTAA CTTTAACATG 10 680 

GGTTAGCGCC TCTTATCTTT CTTCACAATA TTTATTGAAT AATGTTTGTA ATTGATTAAT 10740 

TACGTTCATC ACATCATGAC CTTCGATTTG ATGTCTTTCA ATCATTTCTG TAATCTTTCC 10800 

ATCTTTTACT AATGCAAATG ACGGACTTGA AGGCGCATAA CCTTCGAAGT ATTCACGCGC 108 60 

TCTTTGTGTC GCTTCTTTAT CTTGTCCAGC AAATACTGTC ACTAGACGAT CAGGTAATAC 10920 

GTCATAATGT AAAGCATGTG ATGCTGCTGG TCTTGCGATA CCACCTGCAC AACCACATAC 10 980 

AGAATTGATC ATAACTAGTG TTGTACCATC TTGTTTASA ACTTTGTCAA CATCTTCTGC 11040 

AGTAGTTAAT TGCTCATATC CCGCAGATTC AATTTCATTC CTTGCTTGTT CTACAACACC 11100 

GTTCATGTAT AAATCGAAAT TCATGnCCAT AAGTTCAATC ACCTATCCCT TTATATTTAA 11160 

ACTAtCCTCA TTCTACTAAT TAATAACATA TTGTTCAATA AACTAATCTG AATCACACCT 11220 

ATATTTAGAC ACAATTTTAA CAATATACCA AACATTATTG TGCTTAAAAT CATGGTAACT 11280 

AATTTGTTCA CATGTTTTCA TTAATATGTT TCAAGTATGA TGTCTTATTT TGACTTTACT 1134 0 

GCAAAAATGC ATTCAACCAT GTTGATTATT GTTCTTTATC TTTTTTGAAT AT AT T G C AC A 114 00 

TATTTTAGTG CCAAAAAATA ATACATCCAT CGACAAGAAC AAGATAAAAC AAGTTGTCGA 11460 

TAGATGCATC TATGTTATCA CTAATATATA TTTGTATTTT CTAAAGTATA CTGTTCGATA 11520 

CGCTGTTTAA TATGATTCAT ArATTTACCT GTTTGTAAAC CAT C T AAAAT AC GAT GAT C A 11580 

ATTGAAATAC ATAAATTAAC CATGTTACGA ATTGCAATCA TATCffiTAAT TACTACTGGC 11640 

TTTTTAACGA TTGATTCTAC TTGTAAAATC GCTGCTTGTG GATGATTTAT AATACCCATT 117 00 

GATGATACTG AACCAAATGT ACCAGTATTA TTTACCGTAA ATGTACCGCC CTGCATATCT 117 60 

TCAGCTGTCA ATTGCTTATT ACGCGCTTTC GTTGCTAAAG TATTAATTTC TCTAGCTATA 11820 

CCTTTGATTG ACTTTTCGTC TGCATGCTTA AT CACAGGT A CGTATAATTT ATTTTCATCA 118 80 

GCAACAGCAA T TGAAAT AT T AATGTCTTTA TGTAAGACAA TTTCATTTCC TTGCCAGCTA 11940 

CTATTTAATA AAGGAT AT G C TTTTAAAGCA TCTGCTACAG CTTTTACAAA GAAAGCAAAG 12000 
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AACGTTAGAT TATATCCTTC TTTATTTTTA AAGCTGTTTT TATAATGATT TCTCGTATTC 12060 

ACAAGATTTG TAGCATCTAC TTCAATCATC ATCCATGCAT GTGGAATCTC TGTTACACTA 1212 0 

TTAACCATAT TTTGCGCAAT TGCTTTACGC ACACCATTTA CTGGTATTGT GCTGTTTTCA 12180 

CTATTGTCTT C AG AT GAT T G GTTACTTGAT GTATCTACTG ATGTTGATTT TETTGAACT 1224 0 

TGTTTGTCAG ATTGAGCTGT GGTACCACCA TTTTCAATAA C T G AC AT TAT ATCCTTCTTA 12300 

GTTACACGAC CTTCAAATCC ACTACCTACA ACTTGTGATA AATCAATGTC ATGCTCTGAA 12360 

GCGAGTTTAA ATACAACAGG TGAAAAGCGA CCATTATTAC GTGGTTGATT TTGTTTAGCA 124 20 

GTAGATGTCT GTTCCACTGT TGCACTAGCT TTTTTAGTAG ATTTCTGAGT ATGCTCATCC 124 80 

ACTTTTGCTT GTATCTCTTC AGTTGTTTCA TTTGTCTTTT CATCAGCAGT TTCAATTTTA 12540 

CAGATAATTG TATCAATAGC TACTGTCTGC CCCGCTTCAA CTAAAATTTC TGTAATTGTT 12600 

CCTGATATCG TGGAAGGGAC TTCAGCTGTC ACTTATCTG TAATAACTTC ACATAATGGT 12 660 

T CAT AT T CAT CAATATGATC ACCAACAGAA ACTAACCATT GTTCAATGGT GCCTTCATGA 12720 

ACACTCTCAC CTAACTTAGG CATTGTTATT TCCATGACTT TCCCTCCCTA GAATTCTGCT 127 80 

AATTCACGCA TTTTATTTAA GATTTTTTCT GGATTCATCA TAATTTCATT TTCTAATACA 12840 

G G AG AAAAT G GCATAGATGG TACAtCTGGA GCAGCTAAAC GCATGATTGG TGCATCTAAA 12 900 

TCGAACAAGC AATGCTCTGC AATAATCGCT GACACTTCTG ACATAATACT ACCTTCTAAA 12 9 60 

TTATCTTCAG TTACAAGTAA AACTTTACCT GTATGTTTAG CACGATCAAT AATTGTTTCT 13020 

TTATCTAATG GATAAACAGT TCGTAAATCA AC G AC T T C AA CATTGATACC GTCTGCAGCT 13080 

AAAATATCCG CTGCTTGTAA ACAATAATTG ACCATTAATC CATAACAAAA TACTGTTAAA 1314 0 

TCTTCACCTT CACGTTTCAC ATCTGCTTTT CCTAAAGGTA CAGTGTAATA TTCTTCTGGC 13200 

ACTTCTTCCT TTAAGAAACG ATAAGCTTTT TTATGCTCAA J5TACAATAC TGGATCATTT 132 60 

GATTCGATAG ATGATAATAA AAGCCCTTTA GCATCATACG GTGTGGAAGG AATAACAATT 1332 0 

GTTAAACCTG GCGATGAAGC AAAT AT AC T T TCAATACTTT GTGAATGATA TAGTCCTCCG 1338 0 

TGAACACCGc CACCAAATGG TGCACGAATC GTTAATGGGC ATTGCCAATC ATTATTTGAA 13401 

CGATAACGCA TTTTCGCAGC TTCACTAATA ATTTGATTTG TCGCAGGTAA AATAAAATCT 13500 

G C AAAT T G AA TTTCTGCAAT TGGTCTTTTA CCTACCATAG CTGCACCAAT GGCAGTTCCA 13560 

ACAATATTTG AC T C AG C T AA TGGCGTATCG ATAACTCTGT CTTCACCATA TTTTTGTTGC 13620 

AGTCCTTGAG TAGTACCAAA TACECCACCT TTTCTACCAA CATCTTCACC AAGAATAAAC 13 680 

ACATCTTTAT TTTGTTGTAA TGCTAAGTCT TGTGCCtGcG TATCGCCTCT AAATAAGATA 137 4 0 

ATTTAGCCAT TAGTTAAGAC TCCCTTCTTC GT AC AC AAAT GCATAGGCTT CTTCGACACT 13800 

TGGATATGGC GCGTCTTCAG CAGCCTTTGT CGCTTTATTG ATGATGTCT TnATgTCCGC 138 60 
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TTCTATTTCT GCCAACCAAG CAT CAT C GAT AATGCCAGCT GAAAGCAACT CTTTTTTGAA 13920 

CTTTTCATTG CAGTCTGCTT TTTTAAGcGT TTCACGCTCT TCTTTCGTAC GATATTGGTC 13 980 

GTCATCATCT GATGAATGAG CTGTCATACG ACTTGTTACT GCTTCAATCA AAGTTGAACC 14 040 

TTGACCAGAA ATAGCTCGAT CTCTTGCTTC TTTCATCGCT TTATACATTG CTAATGGATC 14100 

ATTACCATCT ACTTGTTCAC CATGTATACC GTAACCAAGT GCTCTATCCG ATAATTTTTC 14160 

AGCTGCGTAT TGTAATGAAT CAGGTACTGA AAT TGCATAT TTATTATTTA TAATGACACA 14220 

TACAAAAGGA AGTTTGTGTA CACCCGCGAA GTTTAAACCT TCATGGAAGT CACCTTGGTT 14 280 

TGAGCTACCT TCACCAACAG TTGCTGTTGC AATTTTCTTC TTACCATCCA TTTTTAAAGC 14 340 

TAAAGCAGCA CCAACAGCAT GGGGTATTTG AGTTGCTACC GGTGAACTTT GAGACAAAAT 14 4 00 

ATTCTTAGCT CTACTACTAA AGTGTGATGG CATTTGTTTT CCACCAGAGT TAACAOGTC 144 60 

TTTCTTTCCA AACGCTGATA AAAACGTATC ATACGCTGAG ATACCCATAT AAGTAACGAA 14520 

AGCTAGATCT CTATAATAAG GCGCTGTAAT ATCACCTTCT TCTAATGCGT ATGCCATCCC 14 580 

AATCTGAGTT GCTTCTTGTC CTTGACCACT TACAACAAAT GGAATTTTAC CTGCACGGTT 14 640 

CAATAACCAC AGTCTTTCAT CTATTTTTCT ACCTAAATCC ATCCATTTAT ATATTACTTT 14700 

TAGGTCTTCT TCGCTAAGGC CTAATGATTT ATAATCAATC ATGTTAAATC CTCCTATTTA 14760 

TACGTGAATA GCTCTACTTT CTGCTTTCAA TCCTAATTCC ATCAACACTT CAGAGATGGA 14820 

AGGATGTGCG TGTGTTGTTA GTCCTAATTC TAATGCC&G CCATTCATGA ACTGTAACAG 14880 

TGATGCCTCA TTAATCAATT CTGTTACATG TGGACCAATC AT ATT AAT AC CCACAATTTC 14 940 

TTCAGTTGAT TGATCAATCA CCATTTCGCT ATACCCTTCG TTTGTGTCAT GGCTATCAAT 15000 

CACTGCTTTA CCAATTGCTT TAAATGGTAC TTTAAAACTT TTAACTTTCA TTCCCTCTGC 15060 

CTTTGCTTGT TCAATGTTTA AACCGATAGA AGCAATTTCA GGTTGTGAAT AAATACACTT 15120 

AGGCATCATG TTATAGTTTA CTGGGATTGG GTTCCCCTCA AACATATGAT CAACAGCCAC 1518 0 

AACACCTTCT TTTGATCCAA CATGTGCCAA TTGTAATTTT CCTATACAAT CACCAGCTGC 1524 0 

ATAAATATGT TTATCTTCAG TTTGTTGAAA TTCGTTCGTT AAAATATGTC CTGATGTTGa 15300 

AAGtTTTATT TTAGTGTTGT TTAAACCAAT ATCTGATGTG TTAGGTTTTC TACCAATCGA 15360 

TAGCAACACT TTATCTACTT TAATTATGTC TGAGGAAATT TCAAACGTAA CACCATCTTC 154 2 0 

GTTAACATTT ATATCATTTT CAGAAAGTTT TATTCCCTCA TAGAffiTTAA CACCACGTGC 15480 

TGACAATGAT TTTTTTAATA GTTGTGAAGC TTGTTTACTT TCAGTTGGTA AAATTCTTTC 15540 

ACCTGCTTCT ATAACTGTTA CGTCAACACC TAAATCTATC AT C AAT GAT G CAAATTCCAT 15600 

TCCGATAACA CCACCACCAA TAATACCAAT ACTTGATGGT AACGTCTTTA ATGATAATAT 15 660 
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ATCATCGCTA GATAAAATTT TATCATGATC AAATGATAAG AATGGCAACT CTGCAGGCGA 15720 

AGAACCAGTT GCAATTAATA CAAATTGGTT GGGTAATAAG TCTGATTCAC CATCTTCATA 15780 

TTCGACAGAA ATTGTGCCAC TTTGAGGTGA AAATATAGAT GTACCTAGAA TACGTCCCGT 1584 0 

GCCATTATAA ATGTCAATGT GATTGTGTTG CATTAAATGC TTTACACCTT GAT AC AT T T G 15 900 

ATTAATAATG TCTTCTTTTC GTGCCAACAT ATTTTCAAAA T T AAC AT TAG CATCTTTGAC 15 960 

ATCAACGCCA AACATTGCTG CCTGTTTTAC TGTTTGAAAT ACTTCAGCAG ATTTAAGCAG 16020 

CGATTTAGTA GGAATACAAC CTTTATGGAG ACAAGTACCT CCTAATAGTT GCTGTTCTAC 16080 

TATTGCCACT TTTTTACCTA ATTGAGACGC ACGTATCGCA GCAACATATC CTGCAGTACC 1614 0 

TCCACCGAGA ACGACTAAAT CATATTGTTT CTCTGACATG TTCTTACTCC TAACTAATGA 16200 

TATATATCCA TTGAAAATTT ATTAATACAT AGTTTTCATG TCCATTAATT ACCTATTTTA 162 60 

CATGATTGTC TATTTAGTTT GAATGCACAT AAATAAATCC AT AAAT GAG T ATTCAACACA 1632 0 

CTTTGGAAAT TATCTACGTC TTTGATTGTG TTGTATCATT TCTCTTGCAT TTTCTCGAGT 16380 

TAAATCAGTA ACACTTGCAC CTGAAATCAT TCGTGCAATT TCATCTACTT TATCATCGCT 16440 

AATTAACTCT TGAACTTGTG TTGTTGTACG ATCA'CTTTT GATGATTTCG AAATTAATAA 16500 

ATGATGGTCG CTCATCGATG CAACTTGTGG TAAGTGAGAG ATACAAATAA CTTGTATATA 165 60 

TTCTGCTaTA TCTCGCATTT TCTCTGCCAT TT 16592 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

CCAATACAAC G T AAAAAG AT TGCTTGTGTT ATTAATGAGT TAGATAAAAT AATTA&GGA 60 

TTTAATAAGG AAAGAG AC T A CATAAAATAT CAATGGGCTC CAAAATATAG CAAAGAnTTT 12 0 

TTTATACTTT T TAT G AAC AT TATGTACTCA AAAGATTTTT TAAAATATCG ATTTAATTTA 180 

ACATTTCTTG ATTTATCTAT CTTATATGTA ATATCATCTC GAAAAAATGA GAT AC T AAAT 24 0 

TTAAAAGATT TGTTTGAAAG TATTAGATTT ATGTATCCTC AAATTGTTAG GTCAGTTAAT 300 

AGATTAAATA ATAAAGGTAT GCTAATCAAA GAACGATCCC T T G C AG AT G A AAGGATTGTG 3 60 

TTAATCAAAA TAAATAAAAT ACAATATAAC ACTATTAAAA GCATATTCAC AG AT AC T T C C 4 20 

AAGATTCTCA AAC C AAG AAA ATTTTTCTTT TAAATTTAA CAGATTTACC TCTTGATAAA 4 80 

ATAAATAAGC AATCATACTA CTTCTCAATT TAG TAT AAAT AAAAATACAT AATTAACTTT 54 0 
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CTTTTGTTTT TATATTATTT CAATACCCTA CTATATATCA CAACACATAA ATTAAGCATG 600 

ACACTCATTC AATTTAGTTC ACCATTTCGT GTTCCAATTT TACTGAGTAT CATGCTTTTA 660 

ATGTTATAAA CCTAATGCTT TAATAAATCG TGTTAATTCT TCTCGCATAC TGTCATCTTT 720 

CAATGCATAT TCTATGGTAG TTTTAACGAA GCCTAATTTT TCTCCAACGT CATAACGTTC 7 80 

GCCTTCGAAG TCATATGCAT ACACTTGGTT ATCATTATTC ATACGTTCAA TCGCATCTGT 840 

TAACTGAATT TCGTTACCTG CGCCTTCTTT TTGCGTTTTT AAATAATCGA AAATTTCAGG 900 

CGTTAATACA TAACGTCCCA TAATAGCTAG GTTTGATGGT GCCGTACCTT GTGCTGGCTT 960 

TTCAACAAAC TTTTTCACTT CATACTGACG TCCGTTTTTA GTTAATGGGT CAATAATTCC 1020 

ATAACGATGA GTATCTGCTT CCGGAACTTC TTGGACACCT ATAACC GAGT GCCCTGTTTC 108 0 

TTCATAAACG TCAATCAACT GTTTCACTGC TGGCACTTCA GATTCAACAA TATCGTCACC 114 0 

TAATAATACC GCAAATGGTT CATTGCCGAT AAATTGACGC GCAGAACTAA TCGCATGCCC 1200 

TAAACCTTTT TGTTCTTTCT GCCTTACATA AAAAATATTC GCAAGTTCCG TTGAATACTG 12 60 

AACTTTCTCT AGTAATTCAG ATTTACCTTT TTCTTTTAAC ACCATTTCTA ATTCTTTTTG 1320 

ACTATCAAAA TGATCTTCAA TCGCGCGTTT GTGGCGACCT GTCACTATAA TAATATCTTC 13 8 0 

AATTCCAGCT CTTGCAGCTT CTTCAACGAT ATATTGTATT GTGGGTTTAT CTAAGATAGG 14 4 0 

AAGCATTTCC TTTGGCATCG CTTTAGTTGC TGGTAAAAAT CTAGTCCCTA AACCAGCAGC 1500 

GGGAATGATT GCCTTTTTTA TTTTTTTCAA AGTTAATGTG CTCCTTTTCC TAAGTATTAA 15 60 

ATCTATGTAT CAACGTCATT TTAACACTAA TTAGAACGCC TTCATAGTGT CATTGAGTAT 162 0 

GTAATTATTT CTTGGGAAAT TTGTTTTAAT TTTAAAAAAC AGGCTTACTT CEATAATTT 168 0 

ATGAAATAAA CCTGTCAATT TTGGATTGAT TATGCTTTGT GATTCTTTTT ATTTCTGCGT 17 4 0 

AATAACGCTA AACCTAAAAT GCTAAATAAT CCGCCGAACA ACATGCCGTT GTTTGTTGAT 1800 

TCTTCTCCAC CTGTTTCAGG TAGTTCAGAT TTCTTAGATT GTGCTTTTTT AGTTGGTACC ' 18 60 

ACTGCTTTAA CCTTTTCATT GATTTCAATA ACAGGTGTTA CTACTTTACC TTGTTCCACT 192 0 

GGTTTAGAAG GTTTTTTAGG TTCTTCTTTA GCAGGTGGTA TTGGTTTACC AGGTTCAGTT 1980 

GGTACCTCTG GCGTTGGCGG TGTTGGTGTT TCCGGCTCGC TTGGTACTTC TGGTGTCGGT 2040 

GGTGTTGGTG TTTCCGGCTC GCTTGGTACT TCTGSTGTCG GTGGCGTTGG TGGCACGATT 2100 

GGAGGTGTTG TATCTTCTTC AATCGTTTGT TGACCTTCAT TATGACCACT TACTTGTGGA 2160 

AGTGTATCTT CTTCAAAGTC AACACTATTG TGTCCACCGA ATTGATAATT TGGTTTATCT 2220 

TTATTTGTAT CTTCTTCAAT AATTTCAGTG TGCTTATTGA ATCCGTGAAT ATGTGGCACA 2280 

CTGTCGAAGT CGATATCAAT GATATTACCA CCTTGTTCAT ACTTAGGTTT GTCTTTCTCT 23 4 0 
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GTATCTTCTT CGAATGATTG GTTACCATTA TTTTGACCAT GAATTTGAGG TACACTATCG 24 00 

AAATCGATAT CTACGATATT GCCACCTTGT TCATATTTCG GTTTATCTTC TTCTGTGTCT 24 60 

TCCTCAAATG ACTGATTACC GCTATTTTGG CCACCTTCGT AACCTAATTC ACTCTTAATA 2 52 0 

TCCACGTGGC TATTTTCTTC GATTTCTTCA ATCACGCCAT AATTACCGTG ACCATTTTCA 2580 

GTTCCTAAAC CAGAATGAGA AATATGATGA TTGTTTTCAG TAATTTCCTC GATTGGTCCT 2 64 0 

TGCGCTTGAC CATGTTCTTC AGGTAGTTCA TCTACTAGTT &ATCAGATT ACTTTCAGTC 27 00 

GTATATTCTT TCGTATCTTC AATTGTTGTA TGATCGCTAA CAGCACCAGT TACAATACCT 27 60 

TTTGTAGAAT CTTCGTCAAA TTCAACTAGG TTAGACTCAG TAGTAACCTG ACCACCACCT 2820 

GGGTTTGTAT CTTCTTCATA TTCAACAACA TCAGCATGAT GTTTTGAATT TTCATGTGTC 2 8CB 

GATTCTTCAA AGTCTACATG AATAGAATCT TCTTCAGTTT CAATGGTACC TTCTGCATGA 2 94 0 

CCTTCTGCAC CTTCAACAGC TGTATGATAT TCAAAATCAA TTGGCTTAGA ATCATTACTT 3000 

TCTTCGATTG TACCAGTCAA TTCATGCTTC TCCACTGGCG GCTCTGATTT AAATTCAAGT 3060 

TCGATAGGAG TACTATGTTC TATAATAGGT TCCTTTAGTT TATCTTTGCC GTCGCCTTGA 3120 

GCGTTATTAG AGTAAAATGC AACGCCATTT TTCCaAGTTA AATTACTTGT ATAATAATAG 3180 

TTATAATATC CAAAAAGGTG TGTTTGAAAT TCTAAGTTGC TAGCATTTGA ATCATAATAC 324 0 

CCTTCATATT T T AT T AC AT A ATTTTTACTT TGGTCTAAAT TATTAAAGT TAAAGAATAA 3300 

CCACCATTAG TATCAAAATC TAAACTCATA TTATCAGTCA CATCTTCAAA TTTGCTGACA 3360 

TCATCAAGCT TTGCATAnTn AgctTTCAGC TAAATCGTCT GAACCAATGT GTTTATATAC 3420 

CTTAACTGTT GGATTATTAA CCCCTGGTTT ATTTCCTTTA GTTACTTGAC CAGTTACTGT 34 80 

CACAGAGCTT AACGACTGGT TGTTAGGTTT CATGTACGCA AAATGACTAA ATTTCCCATC 354 0 

TACTTTATTT AAAGTATCAA TTCGACCATT AGCTGTTACT CCCCAATTAT CTCTAACTCC 3600 

ACCTAAATAT TGAATATTAA ATATTTTGCT AACCGTAGTC TCACCCAATT TAACTTCAAC 3660 

ATTTTGGTTA CCTTTTTGCG TCACTGTTGT AGGATCAATA AATAGATTTA AAGATAATTC 3720 

AGCAGTTAAA TCTTTCTTTT CTTGTACATA TTCTTTAAAC GTATATCTAA CTTTTCTTTC 3780 

TCCAATTATT TCTCCTGTCG CCATAACTTG ACCATCTGTA CTTTTTATCT CCGGAACTTT 384 0 

ACGCAGTGTT GAGATACCAT GAGTTTCAAC ATTATCGCTT AATGTGAAAT CAAAASATC 3900 

TCCCGCCTTA ATTCCTTCTC CAAATTTCCA TTTATATTTC AAGGTTACTC TTTCTGCGTT 3960 

ATGAGGATTT ACAACATTCG TATCTTGTTT ATGTCCTACA ATTTCACTAC CTTCTTCTAC 4 02 0 

TTCCACTTTA TTTGTTACAT CTGTACCTGT CGCTTTAGTT TCTTCCACTA CTTCTTTCTC 4 080 

TGCAACTGCT GTAACGTCAt TGatCTTTTC ATTCTTGGTT TAATTTCTGA GACGTTACTT 414 0 

GGTTGAGCTA TGTCAACTTG AGTTCCTGTA GTTTCCTTAT CAGCAACTTT TTCCGATGGC 4 200 
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AAATCAACTC GCGAAgTTTC TACTTTTGGT GCTTGCAcAG TTTTCGGTGC TTCTTCTGTT 4260 

GTTACTTGTG TTGATTGTGA TGGTTGCTCA GTTGATGEG CGCTGTATGA TTGTGTTTCA 4320 

TCTATTGTAT TAACGTTATT TGTAGTTGTT TGTGTTTCGC TTGCTTTACT TTCAGTAGCT 4 38 0 

GAACTCCCAC TTTCCTCTAC TGTAGTATTG TTTTGTTCCG ATGCTGCAGC TTCTTTTTCT 44 40 

TGTCCCATTC CAACAACGAT CATTGTTCCT AAGAATACTG AGGCCGCTCC CAATTTGTGT 4 500 

TTTCTTATGC CGTATCTAAG ATTGCTTTTC ACTATAATAT TCTCCCTTAA ATGCAAAATT 4 5 60 

CATTTATTTT TAAAACTCAA TAAATGCAAT TCTATATTGT TCGGTTTTTA AAAGCAATGA 4 620 

AAAAAAGCGA GTTAATAAAA AGTTAAGATT GTTGTTAACT TTATGTATAA TGAGTTTTTT 4 680 

AT TAT T T G AA ACTCACATAT ATATTGCATA CAAAGCTCTT GAACACCTTG ATATAACAGG 4740 

CTTGTATTTT TTATACTTAC TTTTTAAATT AAATTCAAAT TCATCTAATT TAAAACAATA 4 8 00 

TACTAAACCA TACATAATAA TCGCCTGTAC AATGCATCAT TAACAAGTCA CTGAAACGCC 4 8 60 

TTTCATTGTA TTAATAACGT CACTATAATT TTTATATCGT TCGGTTTTG TTTGATTTTA 4 920 

ATGATTATTT ATACAAAAAC AGCCGTATTT CAAGCCGACA TTTTAAATTT AACTAAATTT 4 980 

GCATCTAGTT AATAATTGCA TTTATCAAAT TTGTCTTATT GATCCAATCT AATTTGTACT 504 0 

CACAAACTAG TTTAAAATTC TAACTTTATC TCTCAGTTCG TTATCAATCA T C AG AC AT AA 5100 

ACCAATGAAG C AAT C AG AAA ACACTCTAAT TTTCTATTAG AAATTTGATT TAATATAAAA 5160 

AAACAGGCTT ACTTCATATA ATTTATGAAA TAAACCCGTC AATTTTTGTT TAATTATGCT 5220 

TTGTGATTCT TTTTATTTCT GCGTAATAAT GCTAAACCTA GAATGCTGAA TAATCCGCCG 5280 

AACAACATAC CTTTGTTTGT TGATTCTTCT CCACCTGTTT CAGGTAGTTC AGATTTCTTA 5340 

GATTGTGGTT TTTTAGTTGG TGCCACTGCT TTAACCTTTT CATTGATTTC AATAACAGGT 54 00 

GTTACTACTT TACCTTGTTC CACTGGTTTA GAAGGCTTTT TAGGTTCTTC TTTGGCAGGT _54 60 

GGTACTGGTT TACCAGGTTC AGCTGGTACC TCTGGTGTTG GCGGTGTTGG AETTCTGGC 5520 

TCACTCGGCA CTTCTGGTGT CGGTGGTGTT GGTGTTTCCG GCTCACTTGG TACTTCTGGT 558 0 

GTTGGTGGCG TTGGTGTTTC CGGCTCACTT GGTACTTCTG GTGTCGGTGG CGTTGGTGGC 5 64 0 

ACGATTGGAG GTGTTGTATC TTCTTCAATC GTTTGTTGAC CTTCATTTTG GCCGCTTACT 5700 

TTTGGAAGTG TATCTTCTTC AAAGTCAACA CTATTGTGTC CACCGAATTG ATAACTTGGT 57 60 

TTATCTTTAT TTGTATCTTC T T C AAT AAT T TCAGTGTGCT TATTGAATCC GTGAATATGT 5820 

GGCACACTGT CGAAGTCGAT ATCAATGATG TTACCGCCAT GTTCATACTT AGGTTTGTCT 5880 

TTTTCTGTAT CTTCCTCGAA TGACTGATTA CCTTATTTT GACCATGAAT TTGAGGTACA 5 94 0 

CTATCAAAAT CGaTATCTAC GATATTGCCA CCTTGTTCAT ATTTAGGTTT GTCTTCTTCT 6000 
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GTGTCTTCCT CGAATGACTG GTTACCGCTA TTTTGGCCAC CTTCATAACC TAATTCACTC 60 60 

TTAATATCAA CGTGGCTATT TTCTTCGATT TCTTCAATCA CGTCATAATT CCCGTGACCA 6120 

TTTTCAGTTC CTAAACCAGA ATGAGAAATA TGATGATTGT TTTTAGTAAT TTCCTCGACT 6180 

GGTCCTTGTG CTTGACCATG CTCTTCAGGT AATTCATCCA CTAATTCAAT CAGATTACTT 62 4 0 

tCAGTTGTAT ATTCTTTCGT ATCTTCAACT GTTGTATGAT CGCTCACtGC GCCAGTTACA 6300 

ATACCTTTTG TAGACTCTTC GTCAAATTCA ACTAAGTTAG ACTCAGTAGT AACCTGACCA 6360 

CCACCTGGGT TTGTATCTTC TTCATATTCA AC AAC AT C AG CGTGATGTTT TGAATTTTCA 64 2 0 

TGTGTAGATT CTTCAAAGTC AATTGGATTT GATTCCTCAG AGGACTCAGT GTATCCTCCA 64 8 0 

ACGTGACCTG ctTCGCTATC CACAGCAGTA TGGTAATCGA 21TCAATAGC TGATGAATCC 654 0 

GTTTCTTCTA TTGTTTCAAT GTATCCATCA ACATATCCAC CTCCACCATC TATAGCTGTG 6600 

TGGTAATCAA TGTCAAGAGT TGATGAATCA TATTCCTCTT CAACAGTAGT TACTAAATTC 6660 

TTATCATATT GACCTGTAAG AGTTTCTTTA ATTGTATCTT CTTTATATTC AAATTTATTA 6X2 

TTTTGAATAA TCGGACCATT TTTCTCATTT CCGTTCGCTT TATTACTGTA TAAAACTAAA 67 8 0 

CCATTATCCC AAGTTAAGGT ATATCCTCTA TCATAATAAT ACT TAT AAA G TTGCTCTGGA 68 4 0 

TGTCCTACCA TTTGTGTTCT AAAATCAACT TCATCAGTAC CATTTAAATA CTCTCCATCA 6900 

TAG T GAAC AA CATAAGTTTT ATCTAGATTT TCTATATTCA ATGAATAGCT TCCATTATTT 6960 

TGTAAATTCA AATTCCCACT CAT AT T AC T T GTGACTTCTT TAAATTTAGA AGTATCTGTC 7020 

GTATTTGCAT ATACACTCTT CGCTATGTCT TCATTATTAC CCAAGTATTC AAATATCCTA 7080 

ACTTTTGGTT GATTTCCATT CTGATTACTA CCTTTCATTA AAGTTCCAE AACAGTCACA 714 0 

CTTGTCGTTT TACCATTATT AGGTTTAATA AATGCAACAT GCGAAAATCT ATTATTCGCT 72 00 

TTATTAAATG TCTCAATCGA TCCATTTAAA TTGGCATAAT AATTCCCAAT ACCATCTTTA 7 2 60 

TATTTAACAT CTAATTCCTT TGAAGTTTGT TCTTCATTTA GTGTTGAAGT TATAGTTTGA 7320 

TTTCCATTAG TTTGTACAGT TTTAGGATCA ATAAATAAAT TAATTTCTAG TTCAGCCGTT 7380 

AC AT C AAC C T TATCTTCAAT ATCATTTGTA AATGTATATC TAATCTTTCC ACCTTCTAAA 74 40 

ACTTCACCTG TCGCCATTAC GACTGAACCA TTTTTAATTT CTGGTACTTT TCTAGCAGTT 7500 

GATACGCCAT GCGTATTTAC ATTATTTGAT AAAGTAAAGT CAAAGTAGTC ACCTTGATGT 7560 

AAACCATTCT CAAATTTCAA CTTATATTTT AGTACCGCTC GTTGTCCTGC ATGAGGTTCT 7 620 

ACTTTATTTG TATTGTTATG CCCCTCAATA GAACCAATTT CTACTGTAAC TTTACTTGTT 7 680 

ACATCTGTAC CCGTTTCCAC TTTCGCGTTA CTAGCTTCCT TAGCTTCCGC TACATCGCT 77 4 0 

GATCTTGTCA CACGTGGCTT ACTTTCTGAT GCCGTTCTTG GCTGTGCCAC TTCAACTTGT 78 00 

GTTTCTGCGA CTTGATTTTG TGTAGCCTTT TTAGGTGTTA AATCTACTTG TCTTTGATCT 7 8 60 
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CCGCTATTGT CTTGAGATTG TGTTGTTTCC TTAACTTGAG GTTTCGCTTC TTCCTTAACT 7 920 

ACCTCTTCTT TAACTGTTTC TATATTTGCT GGTTGTGCAG TTTGTGGTGC TTGTACTGCT 7 980 

TTTGGTGCTT CTTCAGTTGT TACTTGTGTT GCGTTTGACG GTTGTTCTGT TACTGTTGCG 804 0 

T TAT AT GAT T GAGTTTCTTC TATATGATTA ACGTTAGTTG CAGTTGTTTG TGTTTCACTT 8100 

GTTTTATTAT CAGTAGCTGA ATTCCCATTT TCTTCTACTG TAGTTGTCTT TTGTTCTGAT 8160 

GCTGCAGCTT CTTTGTCTTG TCCCATCCCA ACAACGATCA TTGTTCCTAA GAAT AC T GAT 8220 

GCTGCTCCCA ATTTATGTTT TCTAATGCCG TACCTAAGAT TGTTTTTCAC TATAATATCT 8280 

CCCTTTAAAT GCAAAATTCA TTAATTTTTT AAACTTAATA AATGCAAGTC TATATTGTTC 8340 

GGTTTTTAAA AGCAATACAA AAAAGCGAGT TAATGAAAAG TTAAGATTGT CATTAATTTT 84 00 

ATGTTAATTG ATAATTTTAT TATTTGAAAT AT AC C T AT AA ATTGTATTCA AG T CAT C AG A 84 60 

AACCCTTGTC ACACAAGGCT TGTATTTTTT ATACTTATTT TTTAAATTAA ATTCATCATT 8520 

ATCTAATTTA AAACAATATA CTAAACGTTT CATAATTATC GCCTGTACAA TACGCACAAA 858 0 

AACATGTCTT GAAACGCCTT TCATTACTCT AAAATACCCA ATATACTTTT TATATCGTTC 8 64 0 

GGATTCTGAG TATTTCAGAC GATTTTCTGC ATAAAAATAA ACGTGTTTCA AGGCAATATA 8700 

TTGCAATTAC CTAAAAACAC GTTTACTTAA TATTTAGTTA AACAATAAG CTAATGAATA 8 7 60 

AAATGAAGAT GATACCTGAA ACGGAAATAA TCGTTTCTAA TAATGACCAT GTTAAGAATG 8820 

TTTCTTTTAC AGTTAAACCA AAATATTCTT TAAACATCCA AAATCCTGCG TCATTTACAT 888 0 

GAGACAAAAT CACACTACCT GCACCTATCG CAAGTACAAC TAATGCAACA TTTACATCTG 8 94 0 

ATGATTGTAA TAATGGTAAG ACAATACCTG TAGTTGAAAT CGCAGCTACT GTAGCCGAAC 9000 

CTAATGCGAT ACGTAGCACA GCTGCAACAA TCCATGCTAG TAAAATCGGA GACATCTCTG 90 60 

TACCTTCAAA CATTTTAGCA ATTGTATTTC CGACACCGCC GTCAATTAAT ACTTGTTTAA 9120 

ATGTACCGCC ACCGCCAATA ATCAATffiCA TCATTCCGAT TGGATAAATC GCATTCGTCA 918 0 

CTGATTCCAT AATATGATTC ATCTTACGCT TTCTCATTAA TCCCATCGTA ACGATTGCAA 92 4 0 

ATAATACTGC TATTAGCATG GCTGTCCCTG CTGTTCCTAT CATATAAATG ATAGATTCAA 9300 

ATAGATTTGT AGGTTTGTCA TGCCCAGTTA CAAGTTGCGT TATCGTAGAC AtEAACATTA 9360 

ATATGACTGG TAATGTTGCT GTTAATAAAC TCATACCAAA TCCTGGCATC TCTTGATCCG 9420 

TAAATTCTTT TTGTGCACCT AACGCTGAAA TATCGCCTTC TCGTGTATAC G C AG AC G G AA 94 80 

TCATTTTTTG TGCAcTTTGT TAAATATAGG CCCTGCAATG AGTGTAACTG GaATGGCAAT 95 4 0 

AATCATACCA TACAGTAATA CATCTCCAAC ATTTGCCTTT AATTCTTTTG CGATGACTAC 9600 

CGGTCCTGGA TGTGGTGGTA AAAAGCCATG TGTCACTGAT AAAGCTGTTA CCATAGGTAG 9660 
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TCCTAGTTTT AAC AC T G AAA CATTTGCGCG TTTTGCTACT GTAAATACTA ATGGAATCAG 97 2 0 

T AAG AC T AAA CCTACTTCAA AGAACAATGC AAT&CGACG ATAAATGCTG CAACAAGCAT 97 8 0 

TGCCCATTGT ACATGTTTTT GACCAAATTT TTGAATCAAC GTGTCTGCGA TTCGAGTTGC 98 4 0 

ACCACCACCA TCAGCAAGCA ATTTCCCAAG TATGGCACCT AAACCGAATA TCAGTGCAAT 9 900 

GTGGCCGAGC GTACTGCCCA TTCCTTTCTC AATCGTCTCC ATAATTTTAG TCAATGGTAT 9960 

ACCTAGCATT AACGCTGTAA TCATCGATGT GATAATTAAT GAAATAAATG TATTTAATTT 10020 

AAACCCAATA ATTAATACTA ATAAAATAAC GATACCTAAA AC AAC AC T G A TTAACGGCCA 10080 

TATTTCGTTA AACATGACAT TCCCCTCTTT CTCTTTTCAA TAGAATGTAA CACCGTCGTC 1014 0 

GCGCACGTAT TCCACATAAT ATATAGTAGT CATGGCACAA CAACTTTATC TAAGATGCTT 10200 

GAGTGACGTA TTTATTGTGT TTTATTTTCA GCGATATGTT GGCGTTGAAA ATCTGCAATT 102 60 

TGTTCATAAT TCTCTGTTAA AG AAC G AC T T AAATTGATAA AAATGGATAC GATCTCTTGG 10320 

TAAACAGTGA CATTTTCTTC AATCGGCGTA TGATTGTTTG EGCACCGAC CATCGATGAA 10380 

AC GAT T G AAA AATCTTCAAT GTCACCTACA GCTTTAAGTC CGAGCACGCA GGCACCTAAG 10440 

CATGAACTTT CATAACTTTC AGGAACCACT AACTCTGTGT CAAATATATC TGACATCATT 105 00 

TGACGCCATA CTTCACTTTT CGCAAAACCA CCTGTTGCTT TTATCATCTT AGGTGTTTCA 10505 

TTCATTACTT CAATAAGCGC AAGATAGACG G TAT ACAAAT TGTAAAGAAC ACCTTCTAAT 10 620 

GCAGCGCGAA TCATATGTTC TTTTTTATGA GATAAAGTTA AAC C G AAG AA TGAACCTCTT 10 680 

GCATTTGCGT TCCAAAGCGG CGCACGTTCT CCTGCTAAAT AGGGATGGAA TATTAAACCA 10740 

TCTGCACCTG GTTTAACACG CTTTGCAATT TGAGTTAAGA CATCATAAGG ATCAACACCG 10800 

AGACGTTTCG CAGTTTCGAC TTCACTCGCT AGCAACTCGT CGCGCAACCA TCTCAATACG 108 60 

ACACCACCAT TATTTACAGG ACCTCCGATG ACGTAGTGGT CCTCTGTTAA GACATAACAA 10 920 

AATATTCTAC CTTTGTAATC AGTACGCGGT TTATCTATCA CAGTACGAffi. CGCCCCAGAT 10 980 

GTACCGATTG TGACAGCAAC TTCTCCTTTA C C AAC AC TAT T G AC AC C T AA ATTAGAAAGG 110 4 0 

ACCCCATCAC TCGCACCAAT AACAAACGGT GTATCTTTAT TAAGCCCCAT TAATGTTGCA 11100 

TAACGTTCTT TCATACCTTT CAtCACATAC GTTGTTGGAA CTAATTCCGG CAACATTTCC 11160 

TTGGAAATAC CCAGCAGTTC TAATGCCTCA ACATCCCAAT CTAATGTTTC TAAATTAAAC 11220 

ATCCCTGTTG CGGAAGCCAT TGAATAATCA ATGATATATG TATCAAATAA AT GAT AGAAA 112 8 0 

ATGTATGTTT TAATATCTGC AAACTTAGCA GTACGTTGAA ATACATCTTG CCATTCATGT 1134 0 

TTCATCCAAA AAATCTTCGC TAATGGCGAC ATAGGATGAA TCGGTGTGCC TGTTCGCTGG 11400 

TAAATCGCAT TGCCATCATG CACTTCATTT ATTACTGTTG CATATTTTGC AGCGCGGTTA 114 60 

TCTGCCCAAG TAATATTATT TGTTAATCTT TGATGTTGCT GATCCATCGC AATCAAGCTA 11520 
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TGCATTTGCG CACTAAATGA CACAAACTTA ATGTCGTCTT TATTAACTTT GGATTECTC 1158 0 

ATAACATATT TAATAGTCAT TAG T AC T G C A TCAAATAATT CATCTGGGTT TTCTTCTGAG 1164 0 

ACATCAACGT TTGGTGTGTG TAAATCATAG CCTATTTGAT GTTTCATGAT AAAAGTTCCA 11700 

TTTTCATCAT ATAAGACTGA CTTGGTACTC GTCGTTCCAA TGTCGACACC AATCATATAT 117 60 

T T CAT G AT AA ATCCTTCTTT CTTTCATTTT AATTCAACCA AAATCCTTCA ATATCTTTAC 11820 

CAACATCGTC GAAATTTAAA TGAAACGCTT CTTTCAAAAT TTGACTGTCG TATTGTTCCA 118 8 0 

CTGCATCAAT AAACACTTGA T GAT TAT GAT GTATGCGTTC AAAATCTTGC GGGTTCTGTT 11940 

GCATACGTTG TCGCATGGAT GTTAAAACGA GTGCCAT&T TACTGGTTTT AAATGACTCC 12 00 0 

AAAATGAGTT TAAATATTGA TGATTAGATG CTTTGATTAA TGTTTCATGA AATTCAAAGT 120 60 

CATGCTTCGT AAATGATTCT GCATCCTCAA ATTTTACTGC CACTTTCATC ATTTCAAGTT 1212 0 

GTTTCTTCAT TTCTTTTACG ATAGGTAGTC GCTCTTGATT TTTAACTCTT GAAAATGCAA 1218 0 

AT G AC T C T AA CATCAGTCGC AAATCATACA TTTCTTTCTT TTCTTGTTCC CCAAACGGCA 1224 0 

ACACATGTGC ACCCATTCTT TCTAATTGGA TGAGTTGATT TTGTTGCAAT AATTTAAATG 12300 

CATCTCGAAT' TGGCGAACGA CTCACATTAA ATTGCTTTGC CATTTGATTT TCAGTGAGTA 12360 

ACGTACCTTC AGCTATGTGA CCATTCACAA TGCCTAAGCG TAATTCTGCC GCGATACCTT 12420 

CTCCAGTTGT CATACCTTCC AACCATTTCT CTGGATATCC ATACATCATC AAAGTCACTC 124 8 0 

CTTCATTACA CGACATACTT GTATACAAGT ATGTTAATAT AGTTATTATG AGTTTGCAAG 1254 0 

CGCTTTCTTT ACGAGCACTA AAATAGTGAC CACCCCTTTT CGATTAAAT TTAAAGGAAA 12600 

TGGTCACTAT C AC AC GAAT G ATTTAATTGT TATGTTGTAT GTGGGATATT TCTAATTGTT 12 660 

CTGTACTCAT ATGCGCTTTA GGTACTTCAA TGCAATAATG CGTTTCATGA CAGTTTGGAC 12720 

ATTCGAATCG ACGTGTTGTC GCTGTATGTT TCGCTTTGAT AACTGCCCAC AAAGATGGTG 12780 

AG AAT AT AT G CTGGCAGTTA GGACATAAAT AGGCAACCTT TTGTTGGTAA TAAAAAGTAA 12 84 0 

CACCAATGCC ATAACCAATC ATAAATGGTA AAGCAATTAA AAACGGCCAT TTATTTTTCA 12 900 

TCAAAATTGC ACTTATAATG CTAGAATATT GAATTATTCC TATAATACCA GCACTAATCC 12 960 

AAATGTTACG ACGAATACTT TTCATTEAG CTGATTTACT CATGACATGC TCTATGTCTT 13020 

TTAAGTGTGT GATTGGAGAC GTCGACGCTT CATTTACGTA ATATTGAACA TTTTTAATTT 1308 0 

TGTTTAATAC CGCTTGTTGC TGTTTAACTT GTTGGTTAAT TTCTTGTTGT TTCATAGTTA 1314 0 

GTAAAGTATT GAGCGTCTTC AAAGTACCTT CACCTTTTAG CAACATATCT ASTCGCTTA 13200 

ACGCACAACC TAAATCTTTA AGCAATAAGA TTAACTCTAA TGTTTGTCGC TGTTGTTCTG 132 60 

TATACACACG ACGCTTTCCT TCTGTAAATC CTTGTGGTTT CAAAATACCT TTGCGATCAT 13 320 



202 

( opied horn 1080fi; ()(> on 08/ 1/ t?00 I 



AATATTGAAT CGTTCGTGTT GTCACATTGC ATAATTTTGC GAGTTCTCCA GTCGAATAGT 13380 

TAGACATAGA TTCCACCTCC TATAATTACC ATAGTTGATG ACCCGACGTC ACGAGCAAGT 13440 

ACAATTTCCA CATTTTAAAG AAATTTATTA TACTAGGCGT CTTATTTTTA TGATTTCGTA 13500 

CCATGTTGAT TTACAAACTC ACTCAAACTA AGTAACACAC CTACTAAACA TCTACTCTGT 13560 

TATTTCAGAA TGAATTTGTT GTAATTTATC TTCMCTTCA GTAATCTCTG TCGCACATTC 13 620 

TTTCAGTAAA TCTCGATACT TTTCCGTCTC TGCATTGTTT TTATAACGTA TTTTATGTTC 13 680 

TAAACTTGcC CACATATCCA TACCTATCGT TCTAATTTGA ATTTCAACAG GCAATACCTC 137 4 0 

AACGCGTTCT GCTAAAAAGA CTGGAATAGA TACAACGATA TGTAAACTGC GGTA 137 94 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SE) ID NO: 55: 

GGATAAGTTC AGGTAAATTC ATTTCTTTTT CAATTTTGAT TTTCATTGTT TCCGCCCTTT 60 

TAAAATAAAG TTAGTTGCTT CTGTTCCTCA TATTCCAAAT CACTTTGCTT TATATATGTT 120 

TCAAGCTCTT CCGCTGTATC AAATGTCTTT TTCACACCTT GCCAACCTGG CACGATATGA 18 0 

CCGTGAAAGT AATAAGTGCC ATTTACTACA TGGATATGTG CCACTCGTTC GTTATCCTGA 24 0 

T AC AG AT AT C TCTTAGATCC AAAGAATTGA TTTAGGTATT CTTTACGCGC GCTATCTGTC 300 

ATGGTCATCA CTCCTTTTAA CAATTAGGCA GACCAAACGA CATGCATTCG TCGTATAGCT 3 60 

CTTCATTACT TATGCTTGCC TTATAGTTTT CAATCACAT GCTAACTTCT TTATGACTCA 4 20 

TTGCTTTAAC TTGTTCGTCT GTATATTTTT CGCAGTCTTC TAATTCCAGT TGCTCCTGTA 4 80 

AT GAC AT C AC ATATTCAACT TGTCTTTGGG TTGCCATCGT TAACCCTCCC ACAAGTCAAA 54 0 

AGCTCTTTGG ACGTAAAACT TCGCCTTTGC TAAATCCTCA TGACCATTCT TTAACGGTGC 600 

TCTAGACATG TATTTGATTG CATTACCTAT TGCGAATGCT AGTTGAGGTG GATACTGTGC 660 

CGTAACCTGT TCGATAAAAT CTATAATTTC AATGTCGCCG TATGTGTAGT GCGCTGGTTG 720 

CTTAACATTG TCTTGCGCTT CGTTCATATC TACTTTTCTG T T AC T GAT T A CGCTCATTAT 780 

GCTTCACTCC ATTTCTTGAA CATTTGGTTA TAAGTGACAT CGAACCAGTA CGGATCACGT 84 0 

GAATGTTTTT GTGGCGTTCC ATCATAAAGC CATGGTCTTA ATCTTCTCTT TCTTTCCTGT 900 

TCATATTCCG CTCTCACATT TCGTTGGTAT CGGTTCAAAA TCGCTTTTTT TCTGATTTTT 960 

TCTCTCCCTT TTTCTTCATC TTTnATtTGA CTCTnCATAT ATTCAETTC TTCTGTAGAT 1020 
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nTTGAGTCCT TTCTTCCACA CAATAATTCA nCGCCGCGC 



1059 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

GAAGTAAAAG AAGAATTAAA TTTAACATTA ACAATGGATG AAATTGAATA TGTCGGGACA 60 

ATTGTAGGTC CTGCATATCC ACAACAGGAT ATGTTAACTG AGTTAAATGG ATTTCGCGCA 120 

TTAACCAAAA TCGATTGGGA AAACGTAACT ATCAATAATG AAATTACGGA TATACGCTGG 180 

ATTGATAAAG ATAATGATGC GTTGATTGCG CCTGCTGTCA AAGTTTGGAT TGAAACTTAT 24 0 

GGTGGTAAAC ATGACAAATA ATGACACCAT CATGTTACGA CATTATGTCC CACAAGATTA 300 

TTCGATGTTA GAAGCTTTTC AATTAAGTGA AAGTGATTTG AAGTTTGTTA AAACGCCAGA 3 60 

GGAAAAT AT T ACAGCTGCAA TGTCTGATAA TGAAAGGTAT CCCATCGTTG TAATGGATGG 4 20 

CAGGCAATGT GTGGCCTTTT TTACATTACA TCGTGGAAAA GGGGTCGCAC CATTTAGCGA 480 

TAACCAAGAT GCAGTATTTT TCAGGTCATT TAGTGTTGAT CAAC GT TAT CGT AAT AG AGG 54 0 

AATAGGTAAA GTGGTAATGG AAAAATTGGC GTCATTTATC ACTTCAACAT TTCAGGATAT 600 

T AAT GAG AT T GTGTTAACGG T T AAT AC T G A C AAT C C AC AT GCCATGGCAC TTTATCGCCA 660 

ACAAGGATAT CAATATATGG GAGATAGTAT GTTCGTCGGA AGACCTGTTC ATATTATGGC 720 

GTTAACTATA AAATAAATTA AATTTAAAAG CATCTTTACT CATCGTCGAC CACAACAATT 780 

AATGATGAAT AAAGGTGCTT TTTGTTATAG AT CATCGGAC AATTTACTAT AGTAAAAAGC 84 0 

GACCTAGTGA ACAATTGACA TATATCCACA GGTCGCTTAA CTTAAGTTAT ATTGCTAGTT 900 

GCGATTAATT GATAGACTCA TCATTTTTGC (ETGTCGAGA TGGTCTTTTT ATTAAAAATG 9 60 

CCGTAATCCA AGCCGTAATC GGAATACTGA TTGCAACGGC AATACCGCCT AAAATAATAG 102 0 

AAATAAATTC TTGGGCAAAT ATTTTCGAGT TTATAATATG ACCAAATGAA TATTTAAGTT 108 0 

TGAAAAACCA AATAAATAAA GCAAGTTGGC CACCAAAAAA GGCAAGGTAA ATCGTGTCG 114 0 

CAGATGTCGC TAAAATTTCT CTACCAACAC GCATGCCAGA TTGGAATAAT TCGTATTGCG 1200 

TAACGTTgGA TTCACTTGAT GCAATTCATA AATGGGTGAA CTAATGGTAA TTGTTAAATC 12 60 

TATCACAGCT GCAATAACAG CAAGAATAAT AGTGAACACC ATAAATTGAA CCATATCAAT 1320 

GCCAATATTC ATTGAATACA CATATGTTTC ATCTTGTTGT TCGGTTGaAA AGCCTTGTAG 1380 
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ATGACCGAAG TAGACCGATA AATAAATGAG TGTAATCAAC AATATTGTTG TAACGATAgT 14 4 0 

GCtGgATAAA TGCaGCTTGT GTTTTAACAT TGTAACTATT GAGTACGAAT AAATTACAAG 1500 

CGCCAATAAT AATGCAGAAA AAGAATGTGA CGACATAAff CGGTACGCCA AAAATAATCA 15 60 

ATACAATACT AAT AAT T AAA ATAGCGAAAT T T AAAAAT AG GGTTAAATAA GAGATGAATC 1620 

CCTTTTTACC TCCGAAAATT ATCATCAGAA AGAGGAGCAA TAACGCCAAT ATAAATACAG 1680 

CATTCATTGT TTCGCCCTCC TTAATGTTTC AAATATTTCC ATAAACAATA TTGTGATAGG 17 4 0 

AATTGTAAGT ACGATACCTA TACCACCTGT TAGTGCGCGC GCAATTTCTA ACGACCAATT 1800 

CATCGAAATA GTATAAGTCA CTGTATTGGC ATTTTTTAAA AAGAT T AAAA ACATAGGTAG 18 60 

TGCACCGGAT AAAT AT GAGA ATAATAAGAT GTTAGTCATT GTTCCCATAA TATCTTGGCC 1920 

GATGTTTCGC CCAGCAAGCG CCCATCTCCT CATTGAAATG TGTGGCGTAC GCTGTAAAAT 1980 

TTCATGCATA CCACTAGCAA TTGTAATTGC AACATCCATA ATAGCGCCAA GTGAACCTAT 2 04 0 

TAACACTGAG -GCTAGGAAGA TATCTTTCGG TGGTAATGAT AAAAAGTTCA TCGTTTCATA 2100 

TTTAATGCCT TTACCATCTG T CAT AT AT AT GATTAATTCT GTTAABCTA TACTCAAAAA 2160 

AGTTCCGATA ATTGTACTGG CTATGGTAAT GAGTGTACGC ATATGCCAGC CTGTAACGAG 222 0 

CAATAAAGTG AGTATTGTTG AACAGATCAT GGCAATGGTC AT GAGT AAGA AT AAAT T AAT 2280 

ATTGCTATGT TGAATATGAA TGTAAATTGC GATTAATATG GCAATAGAAT TCAAGATTAA 2 34 0 

CGATAAAATC GATTGCAGTC CGACTTTGCG AC C AAC C AAT AATACAGTTA ATAAGAACAA 2 4 00 

ACCAGTGATG AT AAC C G T T A AGGTATCACG CTTCTTTTCT ATAATATAAG CATCACTCGG 2 4 60 

CTTGTTAGAA ATATGTAATA ATACTTTTTC GTGTGTGCGA AATGCCTCAG AATCTGCTTG 2520 

CGATTTGACG TACTGATGAT TAATCGTGST CGTTTCTCCA GCAAATTGAC CATTTAATAT 2580 

TTTGACTTTT AATTGATTTT TATATTTAAT AT C AC GAT T A TTTTGTGCAT CTTTTGTAGG 2 64 0 

TGTCGAAGAA ACATGTTTGA CATCTATAAT TTGACCAATT GGTTTGTTGT AAAAGTTCTC 2700 

ATTATTGAAT GTAAATAAAA TAGCACCAAT GAATGCGATG CAGAACAAAC CTAAATTAT 2 7 60 

ATTAAATGGC TTTGTAAATA AATTTCTATA TTTCAAAAAC AAAACCCCAA TTCTATGAAT 2820 

GAATTAATAT GGTGATTATA CGCCCTTAAT TTTTTATTTT CAAAGATATT ACTGCTAAGT 28 80 

GTAAAACGAA AATCATCATT GATAGCATCG AATTACTTAA TGGAATGTAG ACGTTTTAGT 2 94 0 

CATTAATTGC TGAATAAGTG TT AAT AAT AT GCCAATATCA CTCTTTGTAT AAGGCTCCTT 30 00 

TGTAATAGCA CATATCGTTC TTTTTAATTC AGTATGATCT AAT T T TAT AT CTATCCATGA 30 60 

TTTAGATTCT GGTAAATGTA TATTTTGTGA TGAAATGATG TAACCTTCTT TTTGACGAAG 312 0 

GAGATAcTGC GCAAGTGGTT GGCTACTGAT TGTG'ATACA TCTGATTTAG TAATCTTGCG 3180 

CAATTGTTTT TTTACAGTTT CGGCAAATGG TGCCAAGCAA T AAAT AT G AC TATGCTCAAA 32 4 0 
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CTGAATTAAT GGTGGGTGTG TCGCCATCGT AATTGGATCG TCTGAAGGCG CATATAAATG 330 0 

ATAGTGCTCT TCGAATAAAG GTAGCATATG TAATTGTTTG TGTTTACGTA TTTCTGGTGT 3360 

AAGTTCCGTG AAACCAATGT CTATATTCCC ATTTAATACG CTATTTATAA TTGTGTCATG 34 2 0 

TTCTAATAAG CTCGGTATGA CATGTGTATC ATTTTGTAAA TGAAACGTTT GGATAAGTGG 34 8 0 

TAGTAACATG TGGGATACGT CACTCTCATC ATAGCCAATG TAG AT AC T T T TATTTTTAGT 354 0 

TAATCCATGG CTTTGAAA.TT GTTCAATCGT GCTATCTAAA TGTTCAATAA TACGCAGAGC 3600 

TTCATTAAAT AATAATTTCC CTTCAGATGT GAGCGTAATA TTGCGTCCTT GCTTTTTAAA 3660 

TAAAGACACA TTAAGTTCTT GTTCTAATAA TGTAATTTGA CGGCTTATCG CTGATTGAGC 372 0 

AATGTTTAGT TCAAGTGCTG TTTCGGAGAT ATGTTCTCTT TAGCGACCT CGATAAAATA 37 8 0 

TCTTAATTGT TTAATTTCCA TAGCGATATA GGCACCTCCA AAAATGAGTG TTTTGTAACT 384 0 

ATTATAGCAA TAT TAT T GAT AAATGTTCTA TTTTTTAGAT GAATATCTTC TATTTTATAT 3900 

ATTGAACAGA TAAATTTTTT AGATTATAGT AAT TAT CAT T AATAACTAAT AT C AG AAT AT 3 960 

TCTAAAAAAG GGGTGTGCAT CATGCACAAT GAGAAATTAA TTAAAGGCTT ATATGACTAT 4 02 0 

CGTGAGGAAC ATGATGCGTG TGGTATTGGT TTTTATGCGA ATATGGATAA TAAAAGGTCT 4 08 0 

CAC GACATCA TTGATAAATC GCTTGAAATG TTGCGACGCT T AGAT C AC AG GGGCGGGGTC 414 0 

GGCGCAGATG GCATCACTGG TGATGGCGCA GGTATTATGA CTGAAATACC TTTTGCATTT 4200 

TTCAAACAAC ATGTAACGGA CTTTGATATC CCAGGTGAAG GTGAATATGC CGTGGGGTTA 4 2 60 

TTTTTTTCCA AAGAACGCAT TTTAGGTTCT GAACATGAAG TAGTTTTTAA AAAATATTTT 4 320 

GAAGGCGAAG GGTTATCAAT TCTTGGTTAT CGTAATGTAC C AG T T AAT AAAG AT GC C AT T 4 380 

GCTAAACATG TAGCAGATAC GATGCCAGTC ATTCAACAAG TGTTTATTGA TATTAGGGAC 4 44 0 

ATTGAAGATG TTGAAAAGCG TTTGTTTTTA GCGAGAAAAC AATTAGAGTT CTATTCGACT 4 50 0 

CAGTGCGATT TAGAATTGTA TTTTACGAGC TTATCACGCA AAACAATTGT ATATAAAGGT 4 560 

TGGTTACGAT CAGACCAAAT TAAAAAACTA TATACAGATT TATCGGATGA TTTATATCAA 4 620 

TCAAAGCTAG GGTTAGTGCA TTCGAGATTT AG T AC GAAT A CATTCCCGAG TTGGAAAAGG 4 68 0 

GCACATCCTA ACCGTATGTT AATGCATAAT GGTGAGATTA AC AC GAT T AA AGGTAATGTA 4 74 0 

AACTGGATGC GAGCACGCCA ACATAAATTA A"CGAAACAT TATTTGGCGA GGATCAACAT 4 8 00 

AAAGTGTTTC AAATTGTCGA TGAGGATGGT AGTGACTCTG CCATTGTAGA TAATGCGCTA 4 8 60 

GAGTTCTTAT CGTTAGCCAT G G AG C C AG AA AAGGCAGCGA TGTTACTCAT ACCTGAACCT 4 92 0 

TGGTTATATA AT GAAGCGAA TGATGCAAAT GTACGTGCGT TTTATGAATT TTATACTAT 4 980 

TTAATGGAAC CGTGGGATGG TCCTACAATG ATTTCGTTCT GTAACGGTGA CAAACTTGGC 504 0 
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GCGCTTACAG ATAGAAATGG ATTACGTCCA GGTCGTTATA CGATTACTAA AGATAACTTT 5100 

ATTGTCTTTT CATCTGAAGT GGGTGTTGTG GACGTACCTG AAAGTAATGT TGCTTTTAAA 5160 

GGTCAATTGA ATGCTGGAAA GTTATTGCTT GTTGATTTTA AACAGAATAA AGTCATTGAA 5220 

AATAATGATT TAAAAGGTGC GATTGCTGGA GAATTACCAT ATAAAGCGTG GATTGATAAC 5280 

CATAAAGTTG ACTTTGATTT TGAAAATATA CAATATCAAG ATTCGCAATG G AAAG AT GAG 534 0 

ACGTTATTTA AATTACAACG TCAgTTTGCA TACACGAAS AAGAGATTCA TAAGTATATT 54 00 

CAGGAACTTG TAGAAGGTAA GAAGGATCCT ATCGGTGCAA TGGGATATGA TGCGCCAATT 54 60 

GCAGTGTTGA ACGAGCGACC AGAATCACTA TTTAATTACT TTAAACAGCT GTTTGCACAA 5520 

GTTACGAATC CACCAATTGA TGCGTATCGT GAAAAAATCG TAACGAGTGA ACTTTCTTAT 558 0 

TTAGGTGGCG AAGGTAACTT ACTAGCACCT GACGAAACGG TTTTAGATCG TATTCAATTG 5 64 0 

AAAAGGCCGG TATTGAATGA ATCACACTTA GCAGCGATTG ATCAGGAACA TTTTAAATTA 57 00 

ACTTATTTAT CAACGGTATA TGAAGGGGAT TTGGAAGATG CGTTAGAAGC ATTAGGCCGA 57 60 

GAAGCAGTGA ATGCTGTAAA GCAAGGCGCT CAAATTCTAG TGTTAGATGA TAGTGGATTA 58 2 0 

GTTGATAGCA ATGGCTTTGC AATGCCGATG TTACTCGCAA TAAGTCATGT GCATCAATTA 5880 

CTT ATT AAAG CAGATTTACG TATGTCTACA AGTTTAGTCG CTAAATCTGG T GAG AC AC G A 594 0 

GAAGTGCATC ATGTTGCTTG TTTACTCGCA TATGGCGCGA ATGCASTGT GCCATACCTA 6000 

GCGCAACGTA CAGTTGAACA ACTGACATTG ACAGAAGGGT TACAAGGCAC CGTTGTCGAT 60 60 

AATGTTAAGA CATATACGGA TGTATTGTCA GAAGGTGTCA TTAAAGTAAT GGCTAAGATG 6120 

GGAATTTCGA CAGTGCAAAG TTATCAAGGG GCACAAATAT TTGAAGCGAT TGGCTTGTCT 618 0 

CATGATGTGA TTGATCGTTA TTTTACTGGG ACACAGTCTA AGTTATCTGG TATTTCGATT 62 4 0 

GATCAAATTG ATGCTGAAAA TAAAGCACGT CAACAAAGTG ATGATAATTA TCTTGCATCA 6300 

GGTAGTACAT TCCAATGGAG ACAACAAGGT CAACATCATG CTTTTAATCC GGAATCTATT 63 60 

TTCTTATTGC AGCACGCATG TAAAGAA^T GACTATGCGC AATTTAAAGC ATACTCTGAA 6420 

GCGGTGAACA AAAATAGAAC AGATCACATT AGACATTTAC TTGAATTTAA AGCATGTACA 64 80 

CCGATTGACA TCGACCAAGT TGAACCGGTA AGTGACATTG TCAAACGCTT TAATACAGGG 654 0 

GCGATGAGTT ATGGATCGAT TTCAGCGGAA GCACAT GAAA CGTTAGCACA AGCATGAAC 660 0 

CAATTAGGTG G AAAG AG T AA TAGTGGTGAA GGTGGCGAAG ATGCAAAACG TTATGAAGTA 6660 

CAAGTTGATG GAAGCAACAA AGTAAGTGCG ATTAAACAAG TTGCTTCTGG GCGTTTTGGT 6720 

GTAACTAGTG ATTATTTACA AC AT GC C AAA G AAAT T C AAA TTAAAGTTGC GCAAGGTGCA 67 8 0 

AAGCCTGGTG AAGGTGGTCA ATTACCTGGT ACTAAGGTAT ATCCGTGGAT TGCGAAGACA 68 4 0 

AGAGGGTCAA CGCCAGGTAT CGGTCTGATT TCACCACCGC CACATCATGA TATTTATTCA 6900 
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ATAGAAGATT TAGCGCAACT GAT AC AT GAT TTGAAAAATG CGAATAAAGA TGCAGATATC 6 9 60 

GCGGTAAAAT TAGTTTCGAA AACAGGTGTT GGTACATTG CATCTGGGGT GGCAAAAGCA 7 02 0 

TTTGCAGATA AAATTGTCAT CAGTGGTTAC GATGGTGGTA CAGGGGCTTC ACCCAAAACG 7080 

AGTATTCAGC ATGCCGGTGT TCCTTGGGAG ATTGGTTTAG CAGAAACACA TCAAACATTA 714 0 

AAACTAAATG ACTTAAGAAG TCGTGTTAAG T T AG AAAC AG ACGGTAAGTT ATTAACTGGT 72 00 

AAAG AT G TAG CGTACGCATG TGCGCTTGGA GCGGAAGAAT TTGGATTTGC AACTGCACCA 7 2 60 

TTAGTGGTGT TGGGCTGTAT TATGATGCGT GTATGCCATA AAGATACATG TCCAGTAG G A 7 320 

GTTGCAACTC AAAACAAAGA TTTACGTGCT TTATATAGAG GTAAAGCACA TCATGTTGTT 7380 

AATTTTATGC ATTTTATTGC ACAAGAATTA AG AG AAAT T T TAGCATCTTT AGGTTTGAAA 74 4 0 

CGTGTAGAAG ACTTAGTTGG AAGAACTGAT TTATTACAAC GATCATCAAC ATTAAAAGCG 7 500 

AATAGCAAAG CGGCTAGTAT TGATGTTGAA AAACTGTTAT GTCCTTTCGA TGGGCCAAAC 7 5 60 

ACAAAAGAAA TTCAACAAAA TCATAATCTT GAGCATGGAT TEATTTAAC AAAT T TAT AT 7 62 0 

GAAGTAACGA AGCCATATAT TGCTGAAGGG CGTCGCTATA CAGGTAGCTT TACAGTAAAT 7 680 

AATGAACAAC GTGATGTAGG GGTTATTACA GGTAGTGAGA TTTCGAAACA ATATGGAGAA 77 4 0 

GCAGGACTTC CTGAAAATAC AATTAATGTT TATACGAATG GTCATGCTGG TCAAAGTCTT 7800 

GCAGCATATG CACCGAAAGG CTTAATGATT CATCATACTG GAGATGCGAA TGACTATGTT 7 8 60 

GGTAAAGGAT TATCTGGTGG TACGGTCATT GTCAAAGCAC CTTTTGAAGA ACGACAAAAT 7 92 0 

GAAATTATTG CTGGTAACGT CTCATTCTAT GGTGCGACAA GTGGTAAGGC ATTTATTAAC 7 98 0 

GGTAGTGCAG GAG AAAG AT T CTGTATTAGA AATAGTGGTG TAGATGTTGT CGTTGAAGGT 804 0 

ATCGGCGACC ATGGATTAGA GTATATGACT GGTGGACATG TCATTAATTT AGGTGATGTA 8100 

GGTAAGAACT TCGGTCAAGG TATGAGTGGT GGTATTGCTT ACGTTATCCC GTCTGATGTA 8160 

GAAGCTTTTG TTGAAAATAA T C AAC TAG AT ACGCTTTCGT TTACAAAGATTAAACACCAA 8220 

GAAGAAAAAG C AT T C AT T AA GCAAATGCTG GAAGAACATG T G T C AC AC AC GAATAGTACG 828 0 

AGAGCGATTC ATGTGTTAAA ACATTTTGAT CGCATTGAAG ATGTCGTCGT TAAAGTTATT 834 0 

CCTAAAGATT ATCAATTAAT GAT GCAAAAA ATTCATTTGC ACAAATCATT ACATGACAAT 84 00 

GAAGATGAAG CGATGTTAGC TGCATTTTAC GATGACAGTA AAACAATCGA TGCTAAACAT 84 60 

AAACCAGCCG TTGTGTATTA AGGAAAGGGG GAGATACGAT GGGTGAATTT AAAGGAT T T A 8 52 0 

TGAAGTATGA CAAACAGTAC TTAGGTGAAT TATCACTGGT AGACCGTTTG AAGCATCATA 8 58 0 

AAGCATATCA AC AAC GAT TT ACTAAAGAAG 7TGCCTCTAT CCAAGGTGCA CGATGTATGG 8 64 0 

ATTGTGGAAC GCCGTTTTGT CAAACCGGAC AACAGTATGG TAGGGAAACA ATAGGTTGTC 87 00 
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CAATTGGAAA CTACATTCCT GAATGGAACG ACTTAGTGTA TCATCAAGAT TTTAAAACTG 8 7 60 

CTTATGAACG CTTAAGCGAA ACAAATAACT TTCCTGACTT TACAGGGCGT GTATGTCTG 8820 

CACCATGCGA AAGTGCTTGT GTGATGAAGA TTAATAGAGA ATCGATTGCG ATTAAAGGTA 8880 

TTGAACGCAC AATTATTGAT GAAGCTTTTG AAAATGGTTG GGTAGCGCCG AAAGTTCCGA 8 94 0 

GTCGCCGTAG AGATGAAAAA GTGGCAATCG TTGGAAGCGG TCCAGCAGGA TTAGCTGCTG 9000 

CTGAAGAACT TAATCTACTA GGATATCAAG TAACTATTTA TGAACGTGCT AGAGAATCAG 90 60 

GCGGTTTATT AATGTATGGT ATTCCGAATA TGAAACTTGA TAAAGATGTG GTTCGACGTC 9120 

GTATTAAGTT AATGGAAGAA GCGGGCATTA CTTTCATTAA TGGTGTTGAA GTCGGTGTTG 9180 

AT AT T G AT AA AGCAACGTTA GAATCTGAGT AT GAT G CCA 1 TATATTATGT ACTGGTGCAC 92 4 0 

AAAAAG G T AG AGATTTACCT TTAGAAGGAC GCATGGGTGA TGGTATACAT TTCGCTATGG 9300 

ATTATTTAAC TGAACAAACG CAGTTGTTAA ATGGAGAAAT T GAT GAT AT A ACAATAACTG 93 60 

CAAAAGATAA GAATGTCATT ATCATTGGTG CTGGTGATAC AGGGGCAGAC TGTGTAGCGA 94 20 

CAGCATTAAG AGAAAAT T G T AAATCGATTG TTCAATTTAA TAAATATACG AAATTGCCAG 94 8 0 

AAGCAATTAC AT T T AC AG AA AATGCATCAT GGCCTTTAGC AATGCCGGTG TTTAAAATGG 954 0 

ACTATGCGCA CCAAGAGTAC GAAGCTAAGT TTGGTAAGGA ACCACGTGCA TATGGTGTTC 9600 

AAACAATGCG TTACGATGTT GACGATAAAG GACACATACG TGGTTTGTAT ACTCAAATTT 9660 

TAGAGCAAGG CGAAAATGGT ATGGTCATGA AAGAAGGACC TGAAAGATTT TGGCCTGCTG 9720 

ACCTTGTATT ATTATCAATC GGCTTCGAAG GTACAGAACC AACAGTACCG AATGCTTTTA 97 8 0 

ACATTAAAAC GGATAGAAAT CGAATCGTGG CGGATGATAC AAACTffiCAA ACTAATAATG 98 4 0 

AAAAGGT AT T TGCTGCTGGA GATGCTAGAC GTGGTCAAAG TTTAGTTGTA TGGGCAATTA 9900 

AAGAAGGTAG AGGCGTAGCG AAAGCAGTAG ATCAGTATTT AGCTAGTAAA GTTTGTGTAT 9960 

AATCTTTGTA TGGAAATGGT GGTTACGTTG ACGTTGTGAC ATGCTGAATC GAGTTTGAAA 10020 

AAATCTAGTA TCTATCAACG TCACATGCCA TCTTTGTAAC CTAAAAACAA AGGTTTGTAA 10080 

GACAACAAAT AGATTAATTA T AAG TAG T G A TTTTTTACAT TCGTTTATAG GTCAACTGTA 1014 0 

GTGGAAGACA ATGATTTGTG GTAATCATGT AATGCTTAAA AACAATATTG ACTTTTACAG 10200 

AACGTTCATA TAT GAT AAA T ATTGTGTTTA GGAGGAATAC CCAAGTCCGG CTGAAGGGAT 102 60 

CGGTCTTGAA AACCGACAGG GGCTTAACGG CTCGCGGGGG TTCGAATCCC TCTTCCTCCG 10320 

CCATCAATAT T TAT AT T AAA TTCTATATAT AATGAAGG T A AGTGCTCAAA TTTTGAGTAT 1038 0 

TTACCTTTTT TATTTGTCTT TGAATGGCTC GTAATTTTTG ATAATAGAAA TGAAAGGCA 10440 

TTGAGATTGG AAGGGCATTT GGCTTGTGCA ATATACATAG CTAAATGTCT TTTTTGTTTT 10500 

GTGAAATATG ATGGATGGCT TGTGTGGACA AGTTTGCTAT T TAT AG AT AT GCATTTTTCA 105 60 



209 

( opied horn 1080fi; ()(> on 08/ 1/ t?00 I 



ATTTAGGAGT TGGCCATGCA TCTACACTTT ATAATGGTGA GAGCGTGGTG AGGTATTGTT 10 620 

AATAACGCAA TTGTAGCGAG GAGTTATTGC TACATATGTC GTTATGGCTC ATTGATTTTC 10 680 

TGAAATGGCT ACCCCAGATA ATTGTGACAA AATAAAAATA TTTTGTTGAA AGCCTTTACA 10740 

TAACTTGTCT AG AC AAG T T A TACTCGTTTT AAGACATTAA GGGAGTGAAA TATATGGCTG 108 00 

TAAAAAGAGA AGATGTAAAA GCCATCGTAA CCGCATTGG GGGAAAAGAA AATCTTGAAG 108 60 

CTGCAACGCA TTGTGTAACA CGATTACGTT TAGTGCTGAA GGATGAAAGT AAAGTTGATA 10 920 

AAGACGCATT AAGTAATAAC GCGTTGGTCA AGGGGCAGTT TAAAGCAGAC CATCAATATC 10 980 

AAATTGTCAT TGGTCCAGGA ACAGTCGATG AAGTGTATAA GCAGTTTATT GATGAAACAG 11040 

GTGCTCAAGA AGCTTCGAAA GAT G AAG C G A AACAAGCAGC TGCACAAAAA GGGAATCCAG 11100 

TACAACGTTT GATCAAATTG TtGGGGGATA TTTTTATACC AATATTACCT GCGATTGTGA 11160 

CAGCTGGTTT GTTAATGGGA ATCAATAATT TACTTACAAT GAAAGGTTTA TTTGGTCCAA 112 2 0 

AAGCACTTAT TGAGATGTAT CCACAAATTG CTGATATTTC AAACATCATT AATGTGATTG 11280 

CGAGTACGGC ATTTATTTTC TTACCAGCAT TAATTGGTTG GAGTAGTATG CGTGTATTTG 11340 

GTGGTAGTCC GATTCTAGGC ATAGTCTTAG GTTTGATTTT AATGCATCCG CAATTAGTAT 114 00 

C T C AG TAT GA TTTGGCAAAA GGGAATATTC CGACGTGGAA CTATTTGGC TTAGAGATTA 114 60 

AGCAGTTGAA TTACCAAGGT CAAGTGTTGC CAGTtTTAAT TGCAGCTTAC GTTCTAGCTA 11520 

AAATTGAAAA AGGAT TAAAT AAAGTCGTTC ACGATTCGAT AAAAATGTTG GTCGTTGGAC 11580 

CCGTAGCGCT TTTAGTTACT GGATTTTTAG CATTTATTAT CATTGGACCA GTTGCGTTAT 11640 

TGaTTGGTAC AGGTATTACA TCTGGTGTTA CAT T TAT ATT CCAACATGCA GGATGGCTTG 11700 

GCGGAGCAAT ATATGGATTG TTATATGCAC CACTTGTAAT TACAGGACTA CACCATATGT 11760 

TTTTAGCAGT AGATTTCCAA TTGATGGGTA GCAGCTTAGG CGGTACGTAT TTATGGCCAA 11820 

TTGTTGCGAT TTCCAATATT TGTG^GGGCT CTGCAGCATT TGGAGCATGG TTTGTCTATA 118 80 

AACGTCGTAA AATGGTTAAA GAAGAAGGCT TGGCATTAAC ATCTTGTATT TCTGGTATGT 11940 

TAGGTGTTAC TGAACCAGCC ATGTTCGGTG TGAACTTACC TCTGAAATAT CCATTTATCG 12 000 

CTGCGATATC AACGTCTTGT GTATTGGGGG CAATCGTTGG TATGAATAACGTACTTGGAA 12060 

AAGTTGGTGT TGGTGGCGTG CCAGCATTCA TTTCAATTCA AAAAGAATTT TGGCCAGTAT 12120 

ATCTTATTGT GACAGCTATT GCTATTGTTG TACCATGTAT ACTAACAATT GTGATGTCTC 1218 0 

ATTTTAGTAA ACAAAAAGCG AAAGAAATTG TTGAAGATTA ATAAAATAAA AAAGGGGCGT 1224 0 

TCGTTATTTG GACGTCCTTT ATTACGTTAT AAGGTGGTAA TTGTGTGTCG AAAGAAATAG 12300 

AT T G GAG AAA ATCCGTTGTA TATCAAATTT ATCCTAAGTC GTTTAATGAT ACGACGGGGA 12360 
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ATGGTATAGG AGATATCAAT GGAATTATAG AAAAATTGGA TTATATCAAG TTATTGGGTG 12420 

TTGATTATAT TTGGTTAACA CCAGTGTATG ffi.TCACCGAT GAATGATAAT GGCTATGATA 124 8 0 

TCAGCAATTA TTTAGAAATC aATGAAGACT TTGGAACGAT GGATGATTTT GaAAAGTTAA 12540 

TCAAAGTTGC GCATCAAAAA GACTTGAAAG TGATGTTAGA TATTGTCATT AATCaTACGT 12 600 

CGACGGAGCA TGaATGGTTT AAAGAAGCCC GTAAATCTAA AGATAACCCy T AT AG AST T 12 6 60 

ATTACTTTTT CAGATCATCT GAAGACGGGC CGCCAACAAA TTGGCATTCT AAATTCGGTG 12720 

GTAATGCATG GAAGTATGAT TCTGAGACAG ATGAATATTA TTTACATTTA TTTGATGTCA 12780 

GTCAAGCTGA TTTAAATTGG GATAATCCGG AAGTACGTCA ATCGTTATAT CGCATAGTCA 1284 0 

ATCATTGGAT AGACTTCGGC GTTGATGGTT TTCGATTTGA TGTCATTAAC TTAATTTCTA 12 900 

AAGGTGAATT TAAGGACTCT GACAAAATAG GTAAAGAATT TTATACGGAT GGTCCTAGAG 12 9 60 

TGCATGAGTT TCTGCATGAA TTAAATCGTC AAACGTTTGG TAACACTGAC AT GAT G ACT A 13020 

TAGGAGAAAT GTCTTCGACG AC GAT T G AAA ATTGTATTA GTATACACAA CCAGAACGCC 1308 0 

AAGAATTGAA TAGTGTTTTT AATTTTCATC ATCTAAAGGT TGATTATGTT GATGGTGAAA 1314 0 

AG T G G AC AAA TGCGAgcTTG nATTTTCATA AGTTAAAGGA AATTCTGATG CAATGGCAAC 13200 

GAGGTATTTA TGACGGTGGC GGATGGAACG CGATTTTCTG GTGTAATCAT GATCAGCCAC 32 60 

GGGTAGTGTC TAGATTTGGT GATGATACGT CGGAAGAGAT GAGGATACAA AGTGCTAAAA 13320 

TGTTAGCTAT CGCACTGCAT ATGTTGCAAG GGACGCCATA TATTTACCAA GGTGAAGAAA 1338 0 

TTGGTATGAC GGACCCACAT T T T AC AT C AA TAGCACAATA TCGTGATGTT GAATCGATTA 134 4 0 

ATGCCTACCA TCAGTTGTTA AGTGAAGGGC ATGCTGAAGC GGATGTGTTA GCGATTTTAG 13500 

GACAGAAGTC AC G AG AC AAT TCGAGAACGC CTATGCAATG GAG T GAT GAT GTTAATGCTG 13560 

GATTTACAGC TGGTAAnCCT TGGATTGATA TTTCGGAAAA T T AT CAT C AG GTCAACGTTA 13620 

GACAAGCACT T C AG AAT AAA GAGTCTATTT TCTATACGTA TCAAAATTA ATACAATTAA 13680 

GACATACGCA TGATATTATT AC G TAT G GAG ACATTGTGCC ACGTTTTATG GATCATGATC 137 4 0 

ATTTATTTGT TTATGAACGT CATTATAAGA ATCAACAATG GCTAGTAATT GCGAATTTCT 13800 

CAGCATCGGC TGTTGATTTG CCAGAAGGAT TGGCTAGAGA AGGTTGTGTT GTGATTCAAA 138 60 

CAGGCACAGT GGAAAATAAT ACGATAAGCG GGTTTGGTGC AATTGTAATC GAAACAAACG 13920 

CGTAAAATAA ATTGAGTGGA TGCGTTTATA TGGCGAAACA AAAAAAGTTT ATGAAGATTT 13980 

ATGAGGCGTT GAAAGAAGAT ATATTAAACG GGCAGATTCA ATATGGTGAA CAAATTCCGT 14 040 

CTGAACATGA TTTGGTGCAA TTGTACCA^T CATCTCGAGA GACCGTGCGT AAGGCATTAG 14100 

ATTTGTTGGC ATTAGACGGC ATGATTCAAA AGATTCATGG TAAAGGGTCA CTTGTCATTT 14160 

ATCAGGAGGT TACAGAGTTT CCATTTTCTG AACTTGTTAG TTTTAAAGAA ATGCAAGAAG 14220 
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AAATGGGCGT CGCATATTTA ACTGAAGTTG TTGTGAATGA GGTTGTTGAA GCGATGAAG 14 280 

TTCCAGAAGT TCAACATGCT T TAAACATCA ATTCTAGTGA AT C AC T CAT T CATATTGTTA 14340 

GAACTCGTCG GCTTAACCAA CATGTGAAGA TTGTTGATGA AG AT TAT T T T CTAAAGTCGA 14 4 00 

TTGTTTCAGA TATAGGTAAT GATGTTGCGA GTGATTCTAT T TAT GAT TAT TTGGAAAAGG 14 4 60 

TATTAAATCT TAATATTAGT TATTCAAGTA AGTCTATTAC TTTTGAACCG TTTGATGAAC 14 520 

AAGCATATCA ATTGTTTGGT GATGTATCGG TGGCTTATTC AGCAACAGTT CGAAGTATTG 14580 

TGTATTTAGA AAATACAATG CCGTTTCAAT ATAATATTTC AAAACATCTT GCAAATGAAT 14 640 

TTAAATTTAA TGACTTCTCA AGACGTCGTA TAAACTAAAC AATGATATAA ATGATTTATA 14 700 

CTTGCAATTA AC TAT T AAAA TATAGTAATA TATATCTTGC CGTGCTAGGT GGGGAGGTAG 14 7 60 

CGGTTCCCTG TACTCGAAAT CCGCTTTATG CGAGGCTTAA TTCCTTTGTT GAGGCCGTAT 14820 

TTTTGCGAAG TCTGCCCAAA GCACGTAGTG TTTGAAGATT TCGGTCCTAT GCAATATGAA 148 80 

CCCATGAACC ATGTCAGGTC CTGACGGAAG CAGCATTAAG TGGATCATCA TATGTGCCGT 14940 

AGGgTAGCCG AGATTTAGCT AACGACTTTG GTTACGTTCG TGAATTACGT TCGATGCTTA 15000 

GGTGCACGGT TTTTTATTTT TTAAATATTA AACCGATTAT TAAGAGTTGA AAATATATAA 15060 

T T AT AGAAGC TACTTTCTTG AAGACAATTC AGCGTATTAT ACGTGGAACA TGTTTGTGGG 1512 0 

AAGTAGCTTT TTTATATGTG AAGTTTGATT CAAGTGAACT CGATGTGCAG TTTGAATGAT 15180 

TTTTGTGTCA ATGAAAAGTA AGAAGT T AT A ATTTGATGAT AAAGAAATGA TGGTGAAATG 15240 

AGGGGGAGTA TCTTACAATA GAATTAT TAA T GAG AT AC G T TJKGATTATT G AC AAT C AAA 15300 

TGCCTACGGA GGACATATGC AAATATATTT AAGTACTTTA ACAGAGTTAG ATTATGATAA 15360 

ATCTTTAAAT AG T AT T GAAG AAAGTTTTGA TGATAATCCT GAAACGAGTT GGCAAGCACG 15420 

TGCGAAAGTA AAACATTTAA GAAAATCTCC TTGCTATAAT T T T GAAT TAG AAGTAATAGC 154 8 0 

GAAAAATGAA AAT AAC GAT G TCGTTGGACA CGTTTTATTA ATTGAAGTAG AAATTAATAG 15540 

TGATGATAAG ACGTATTATG GTTTGGCGAT TGCCTCTTTA TCAGTTCATC CTGAATTACG 15 600 

TGGACAAAAA TTAGGTCGTG GCTTGGTTCA AGCAGTAGAA GAGCGTGCCA AAGCACAAGA 15660 

GTATAGTACG GTTGTTGTAG ACCADTGTTT TGACTACTTT GAAAAGTTGG GTTATCAAAA 15720 

TGCTGCTGAG CATGACATTA AATTAGAATC TGGTGATGCA CCGTTACTTG TAAAATATTT 15780 

ATGGGATAAT TTGACGGATG CACCACACGG AATCGTAAAA TTTCCAGAAC ATTTTTATTA 1584 0 

ATTGTTCAAT TAAGAAGTAA AGGTATTATC ATGCTATAAT GAGAGGTAATTGTTTATGGA 15900 

GGTGCTAACT TGAATTATCA AGCCTTATAT CGTATGTACA GACCCCAAAG TTTCGAGGAT 15960 

GTCGTCGGAC AAGAACATGT CACGAAGACA TTGCGCAATG CGATTTCGAA AGAAAAACAG 1602 0 
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TCGCATGCTT ATATTTTTAG TGGTCCGAGA GGTACGGGGA AAACGAGTAT TGCCAAAGTG 16080 

TTTGcTAAAG CAATCAACTG TCTAAATAGC ACTGATGGAG AACCTTGTAA T GAAT G T CAT 1614 0 

ATTTGTAAAG GCATTACGCA GGGGACTAAT TCAGATGTGA TAGAAATTGA TGCTGCTAGT 16200 

AATAATGGCG TTGATGAAAT AAGAAATATT AGAGACAAAG TTAAATATGC ACCAAGTGAA 162 60 

TCGAAATATA AAGTTTATAT TATAGATGAG GTGCACATGC TAACAACAGG TGCTTTTAAT 1632 0 

GCCCTTTTAA AGACGTTAGA AGAACCTCCA GCACACGCTA TTTTTATATT GGCAACGACA 16380 

GAACCACATA AAATCCCTCC AACAATCATT TCTAGGGCAC AACGTTTTGA TTTTAAAGCA 16440 

ATTAGCCTAG ATCAAATTGT TGAACGTTTA AAATTTGTAG C AG AT GC AC A ACAAATGAA 1650 0 

TGTGAAGATG AAGCCTTGGC ATTTAtcgCT AAAGCGTCTG AAGGGGGTAT GCGTGATGCA 165 60 

TTAAGTATTA TGGATCAGGC TATTGCATTT GGTGATGGTA CGTTAACATT GCAAGATGCG 16620 

TTGAATGTCA CAGGTAGCGT AC AT GAT GAA GCGTTGGATC ACTTGTTTGA TGATATTGTA 16 680 

CAAGGTGACG TACAAGCATC TTTTAAAAAA TACCATCAGT T T AT AAC AG A AGGTAAAGAA 16740 

GTGAATCGCC TAATAAATGa TATGATTTAT TTTGTCaGAG ATACGATTAT GAATAAAACA 168 00 

TCTGAGAAAG ATACTGAGTA TCGAGCACTG ATGAACTTAG AATTAGATAT GTTATATCAA 168 60 

AT GAT T GAT C TTATTAATGA TACATTAGTG TCGATTCGT TTAGTGTGAA TCAAAACGTT 16920 

CATTTTGAAG TGTTGTTAGT AAAATTAGCT GAGCAGATTA AGGGTCAACC ACAAGT GAT T 16 980 

GCGAATGTAG C T G AAC C AG C ACAAATTGCT TCATCGCCAA AC AC AG AT G T ATTGTTGCAA 17 04 0 

CGTATGGAAC AGTTAGAGCA AGAACTAAAA AC AC T AAAAG CACAAGGAGT GAGTGTCGCT 7L10 0 

CCTGTTCAAA AATCTTCGAA AAAGCCTGCG AGAGGCATAC AAAAATCTAA AAATGCATTT 17160 

TCAATGCAAC AAATTGCAAA AGTGCTAGAT AAAGCGAATA AGGCAGATAT CAAATTGTTG 17 22 0 

AAAGATCATT GGCAAGAAGT GATTGATCAT GC CAAAAAT A AT G AT AAAAA ATCACTCGTT 172 8 0 

AGTTTATTGC AAAATTCGGA ACCTGTGGCG GCAAGTGAAG ATCACGTACT TGTGAAATTT 17340 

GAGGAAGAGA TCCATTGTGA AATCGTCAAT AAAG AC G AC G AGAAACGTAG TAGTATAGAA 174 00 

AGTGTTGTAT GTAATATCGT TAATAAAAAC GTTAAAGTTG TTGGTGTACC ATCAGATCAA 174 60 

T G G C AAAG AG TTCGAACGGA ATATTTACAA AATCGTAAAA ACGAAGCGA TGATATGCCA 17 520 

AAGCAACAAG C AC AAC AAA C AGATATTGCT CAAAAAGCAA AAGATCTTTT CGGTGAAGAA 17 580 

ACTGTACATG TGATAGATGA AGAG T GAT AC ATGACAAGCG ATATAATCGT ATGTATAATG 17 64 0 

AAAGAAACAT CATTTTATTG ATAAATATTT ATTGATTTTC AAGGAGGAAA TGGAATATGC 17 7 00 

GCGGTGGCGG AAACATGCAA CAAATGATGA AACAAATGCA AAAAATGCAA AAGAAAATGG 17 7 60 

CTCAAGAACA AGAAAAACTT AAAG AAG AG C GTATTGTAGG AACAGCTGGC GGTGGCATGG 17820 

TTGCAGTTAC TGTAACTGGT CATAAAGAAG TTGTCGACGT TGAAATCAAA GAAGAAGCTG 17880 
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TAGACCCAGA CGATATTGAA ATGCTACJ&G ACTTAGTGTT AGCAGCTACT AATGAAGCGA 17 94 0 

TGAATAAAGC TGATGAGCTT ACTCAAGAAC GTTTAGGTAA AC AT AC T C AA GGCTTAAACA 18000 

TCCCTGGaAT GTGATCATAG ATGCATTATC CAGAACCTAT ATCAAAACTT ATTGATAGCT 18 060 

TTATGAAATT GCCAGGCATT GGTCCAAAGA CAGCCCAACG TCTGGCTTTT CAffiCCTTAG 18120 

ATATGAAAGA AGACGATGTT GTTCAGTTTG CCAAAGCATT AGTAGATGTT AAGAGAGAAT 1818 0 

TAACATATTG TAGCGTATGT GGTCACATTA CTGAAAATGA TCCATGTTAT ATTTGTGAAG 1824 0 

ATAAGCAAAG AGATCGTTCA GTTATTTGTG TTGTGGAAGA TGACAAAGAT GTCATAGCTA 18 300 

TGGAAAAAAT GAGAGAATAC AAAGGTTTAT ATCACGTTTT ACATGGGTCT ATTTCGCCTA 18 360 

TGGATGGCAT TGGACCAGAA GATATTAATA TTCCTTCATT GATTGAACGC TTGAAAAACG 18 420 

AT G AAG T TAG CGAATTAATC TTAGCTATGA ACCCGAACTT AGAGGGGGAA TCTACAGCCA 18 480 

TGTATATTTC TAGATTAGTT AAGCCTATAG GTATCAAAGT GACGAGAT T A GCACAAGGGT 18 540 

TATCGGTAGG TGGCGATTTA GAGTATGCTG AC GAAGT AAC ATTATCTAAA GCAATCGCAG 18 600 

GTAGAACAGA AATGTAATkT CTTCTATTAA ACATTTTTGA TTTTAATACT ATAGTAAGAA 18 660 

AAG T C AC AG T GTAATCATTG TGGCTTTTTT TATGGTGTGG TGTGATGTAC TACTTTATTT 18720 

GCGGTGTGGC GGTGGTATGG TTTACCTAGT TTTACTGAGG GATGGGTAAT CTTTAGGAAG 18780 

CAAGCCGTTG GTTGTGATTT GTTACTTCTA ATAGTAATGA TGTGAATTGG ATTATCGAAT 1884 0 

TAGATCTATG GTTATGGTGT GTTGGTGCTA TTAATTTGAT AAATGCGGTT AATGACTATG 18 900 

CAAATGAAAT TCTTTTGTAA TTGAAATGAT AGATGCTGGC TTAGTAAGTT GTACTTCTTT 18 960 

GGTCTAAAGC TTATTAAATC AGCCTGTATA GCGGTGTTTT GAG AG AT TAT TTAAAACTTG 19020 

TAAATTTATT TTTAATTTCT GGTAAAAAAA TAACGTTCTG TTTTGCGTTT TTTTTGATTG 19080 

ATATGGTTAG AGAAAAATCT GTTTCTTGTT CTAAAAAACG TJ5TATTTAT AAGTGGGGAT 19140 

TTTTTAAGTT CGATTTTTAG GATAAGGGCG T T C AG T AC AG ATGACAAAGG TGTAATTTTT 19200 

ACTGTTGTTA AGCAGTTTGA AAGCCTGTAT AG TAT T TAT T TGTTGAGGCA AACAAAACAA 192 60 

CTCAACTTAA GAAATAACTT GAATTACTAA CGAAAATTAA TTTTAAAAAG TTATTGACTT 19320 

AAATGTTAAT AAAATGTATA ATTAATTCTT GTCGGTAAGA AAAAT G AAC A TTGAAAACTG 19380 

AATGACAATA TGTCAACGTT AATTCCAAAA AACGTAACTA TAAGTTACAA ACATTATTTA 19440 

GTATTTATGA GCTAATCAAA CATCATAATT TTTATGGAGA GTTTGATCCT GGCTCAGGAT 19500 

GAACGCTGGC GGCGTGCCTA ATACA.TGCAA GTCGAGCGAA CGGACGAGAA GCTTGCTTCT 195 60 

CTGATGTTAG CGGCGGACGG GTGAGTAACA C G T GG AT AAC CTACCTATAA GACTGGGATA 19 620 

ACTTCGGGAA ACCGkAGCTA ATACCGGATA ATATTTTGAA CCGCATGGTT CAAAAGTGAA 19680 
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AGACGGTCTT GCTGTCACTT ATAGATGGAT CCGCGCTGCA TTAGCTAGTTGGTAAGGTAA 19740 

CGGCTTACCA AGGCAACGAT GCATAGCCGA CCTGAGAGGG TGATCGGCCA CACTGGAACT 198 00 

GAGACACGGT CCAGACTCCT ACGGGAGGCA GCAGTAGGGA ATCTTCCGCA ATGGGCGAAA 198 60 

gCtGaCGGAG CAACGCCGCG T GAG T GAT G A AGGTCTTCGG ATCGTAAAAC TCTGTTATTA 19920 

GGGAAGAACA TATGTGTAAG TAACTGTGCA CATCTTGACG GTACCTAATC AGAAAG C C AC 1998 0 

GGCTAACTAC GTGCCAGCAG CCGCGGTAAT ACGTAGGTGG CAAGCGTTAT CCGGAATTAT 2004 0 

TGGGCGTAAA GCGCGCGTAG GCGGTTTTTT AAGTCTGATG TGAAAGCCCA CGGCTCAACC 20100 

GTGGAGGGTC ATTGGAAACT GGAAAACTTG iCTGCAGAAG AGGAAAGTGG AATTCCATGT 20160 

GTAGCGGTGA AATGCGCAGA- GATATGGAGG AACACCAGTG GCGAAGGCGA CTTTCTGGTC 20220 

TGTAACTGAC GCTGATGTGC GAAAgCGTGG GGATCAAACA GGATTAGATA CCCTGGTAGT 20280 

CCACGCCGTA AACGATGAGT GCTAAGTGTT AGGGGGTTTC CGCCCCTTAG TGCTGOSCT 20340 

AACGCATTAA GCACTCCGCC TGGGGAGTAC GACCGCAAGt TGAAACTCAA AGGAATTGAC 204 00 

GGGGACCCGC ACAAGCGGTG GAGCATGTGG TTTAATTCGA AGCAACGCGA AGAACCTTAC 204 60 

CAAATCTTGA CATCCTTTGA CAACTCTAGA GATAGAGCCT TCCCCTTCGG GGGACAAAGT 20520 

GACAGGTGGT GCATGGTTGT CGTCAGCTCG TGTCGTGAGA TGTTGGGTTA AGTCCCGCAA 20580 

CGAGCGCAAC CCTTAAGCTT AGTTGCCATC ATTAAGTTGG GCACTCTAAG TTGACTGCCG 2064 0 

GTGACAAACC GGAGGAAGGT GGGGATGACG TCAAATCATC ATGCCCCTTA TGATTTGGGC 20700 

TACACACGTG CTACAATGGA CAATACAAAG GGCAGCGAA. CCGCGAGGTC AAGCAAATCC 2 07 60 

CATAAAGTTG TTCTCAGTTC GGATTGTAGT CTGCAACTCG AC T AC AT G AA GCTGGAATCG 20820 

CTAGTAATCG TAGATCAGCA TGCTACGGTG AATACGTTCC CGGGTCTTGT ACACACCGCC 208 80 

CGTCACACCA CGAGAGTTTG TAACACCCGA AGCCGGTGGA GTAACCTTTT AGGAGCTAGC 0294 0 

CGTCGAAGGT GGGACAAATG ATTGGGGTGA AGTCGTAACA AGGTAGCCGT ATCGGAAGGT 21000 

GCGGCTGGAT CACCTCCTTT CTAAGGATAT ATTCGGAACA TCTTCTTCAG AAGATGCGGA 210 60 

ATAACGTGAC ATATTGTATT CAGTTTTGAA TGTTTATTTA ACATTCAAAT ATTTTTTGGT 21120 

TAAAGTGATA TTGCTTATGA AAATAAAGCA GTATGCGAGC GC T T G AC T AA AAAGAAATTG 2118 0 

TACATTGAAA AC T AG AT AAG TAAGTAAAAT ATAGATTTTA CCAAGCAAAA CCGAGTGAAT 21240 

AAAGAGTTTT AAATAAGCTT GAATTCATAA GAAATAATCG CTAGTGTTCG AAAGAACACT 21300 

CACAAGATTA ATAACGCGTT TAAATCTTTT TATAAAAGAA CGTAATTCA TGTTAACGTT 21360 

TGACTTATAA AAATGGTGGA AACATAGATT AAGTTATTAA GGGCGCACGG TGGATGCCTT 214 20 

GGCACTAGAA GCCGATGAAG GACGTTACTA ACGACGATAT GCTTTGGGGA GCTGTAAGTA 214 80 

AGCTTTGATC CAGAGATTTC CGAATGGGGA AACCCAGCAT GAGTTATGTC ATGTTATCGA 2154 0 
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TATGTGAATA CATAGCATAT CAGAAGGCAC ACCCGGAGAA CTGAAACATC TTAGTACCCG 21600 

GAGGAAGAGA AAGAAAATTC GATTCCCTTA GTAGCGGCGA GCGAAACGGG AAGAGCCCAA 21660 

ACCAACAAGC TTGCTTGTTG GGGTTGTAGG ACACTCTATA CGGAGTTACA AAGGACGACA 21720 

TTAGACGAAT CAT C T GGAAA GATGAATCAA AGAAGGTAAT AATCCTGTAG TCGAAAATGT 217 8 0 

TGTCTCTCTT GAGTGGATCC TGAGTACGAC GGAGCACGTG AAATTCCGTC GGAATCTGGG 218 4 0 

AGGACCATCT CCTAAGGCTA AATACTCTCT AGTGACCGAT AGTGAACCAG TACCGTGAGG 21900 

GAAAGGTGAA AAGCACCCCG GAAGGGGAGT GAAATAGAAC CTGAAACCGT GTGTTACAA 21960 

GTAGTCAGAG CCCGTTAATG GGTGATGGCG TGCCTTTTGT AGAATGAACC GGCGAGTTAC 22020 

GATTTGATGC AAGGTTAAGC AGTAAATGTG GAGCCGTAGC GAAAGCGAGT CTGAATAGGG 2208 0 

CGTTTAGTAT TTGGTCGTAG ACCCGAAACC AGGTGATCTA CCCTTGGTCA GGTTGAAGTT 2214 0 

CAGGTAACAC TGAATGGAGG ACCGAACCGA CTTACGTTGA AAAGTGAGCG GATGAACTGA 222 00 

GGGTAGCGGA GAAATTCCAA TCGAACCTGG AGATAGCTGG TTCTCTCCGA AATAGCTTTA 222 60 

GGGCTAGCCT CAAGTGATGA TTATTGGAGG TAGAGCACTG TTTGGACGAG GGGCCCCTCT 22320 

CGGGTTACCG AATTCAGACA AACTCCGAAT GCCAffTAAT TTAACTTGGG AG T C AG AAC A 22 380 

TGGGTGATAA GGTCCGTGTT CGAAAGGGAA ACAGCCCAGA CCACCAGCTA AGGTCCCAAA 224 4 0 

ATATATGTTA AG T G G AAAAG GATGTGGCGT TGCCCAGACA ACTAGGATGT TGGCTTAGAA 22 500 

GCAGCCATCA TTTAAAGAGT GCGTAATAGC TCACTAGTCG AGTGACACTG CGCCGAAAAT 22560 

GTACCGGGGC T AAAC AT AT T ACCGAAGCTG TGGATTGTCC TTTGGaCAAT GGtAGGAGAG 22 620 

CGTTCTAAGG GCGTTGAAGC ATGATCGTAA GGACATGTGG AGCGCTTAGA AGTGAGAATG 22680 

CCGGTGTGAG TAGCGAAAGA CGGGTGAGAA TCCCGTCCAC CGATTGACTA AGGTTTCCAG 227 4 0 

AGGAAGGCTC GTCCGCTCTG GGTTAGTCGG GTCCTAAGCT GAGGCCGACA GcGTAGGCGA 22800 

TGGATAACAG GTTGATATTC CTGTACCACC TATAATCGTT TTAATCGATG GGGGGACGCA 22 8 60 

tAGGATAGGC GAAgcGTGcG ATTGGATTGC ACGTCTAAGC AGTAAGGCTG AGTATTAGGC 22 92 0 

AAATCCGGTA CTCGTTAAGG CTGAGCTGTG ATGGGGAGAA GEATTGTGT CTTCGAGTCG 22980 

TTGATTTCAC ACTGCCGAGA AAAGCCTCTA GATAGAAAAT AGGTGCCCGT ACCGCAAACC 23040 

G AC AC AG G T A GTCAAGATGA GAATTCTAAG GTGAGCGAGC GAACTCTCGT TAAGGAACTC 23100 

GGCAAAATGA CCCCGTAACT TCGGGAGAAG GGGTGCTCTT TAGGGTTAAC GCCCAGAAGA 23160 

GCCGCAGTGA ATAGGCCCAA GCGACTGTTT ATCAAAAACA CAGGTCTCTG CTAAACCGTA 2 3220 

AGGTGATGTA TagGGcTGAC GCCTGCCCGG TGCTGGAAGG TTAAGAGGAG TGGTTAGcTT 2 3280 

CTGCGAAgCT ACGAATCGAA GCCCCAGTAA ACGGCGGCCG TAACTATAAC GGTCCTAAGG 2334 0 
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TAGCGAAATT CCTTGTCGGG TAAGTTCCGA CCCGCACGAA AGGCGTAACG ATTTGGGCAC 2 34 00 

TGTCTCAACG AGAGACTCGG TGAAATCATA GTACCTGTGA AGATGCAGGT TACCCGCGAC 234 60 

AG G AC G G AAA GACCCCGTGG AGCTTTACTG TAGCCTGATA TTGAAATTCG GCACAGCTTG 23520 

TACAGGATAG GTAGGAGCCT TTGAAACGTG AGCGCTAGCT TACGTGGAGGCGCTGGTGGG 2 3580 

ATACTACCCT AGCTGTGTTG GCTTTCTAAC CCGCACCACT TATCGTGGTG GGAGACAGTG 2 3 640 

TCAGGCGGGC AGTTTGACTG GGGCGGTCGC CTCCTAAAAG GTAACGGAGG CGCTCAAAGG 23700 

TTCCCTCAGA ATGGTTGGAA ATCATTCATA GAGTGTAAAG GCATAAGGGA GCTTGACTGC 237 60 

GAGACCTACA AGTCGAGCAG GGTCGAAAGA CGGACTTAGT GATCCGGTGG TTCCGCATGG 23820 

AAGGGCCATC GCTCAACGGA TAAAAGCTAC CCCGGGGATA ACAGGCTTAT CTCCCCCAAG 2 3880 

AGTTCACATC GACGGGGAGG TTTGGCACCT CGATGTCGGC TCATCGCATC CTGGGGCTGT 2394 0 

AGTCGGTCCC AAGGGTTGGg CTGTTCGCCC iTTAAAGCGG TACGCGAGCT GGGTTCAGAA 24000 

CGTCGTGAGA CAGTTCGGTC CCTATCCGTC GTGGGCGTAG GAAATTTGAG AGGAGCTGTC 24 060 

CTTAGTACGA GAGGACCGGG ATGGACATAC CTCTGGTGTA CCAGTTGTCG TGCCAACGGC 2 412 0 

ATAGCTGGGT AGCTATGTGT GGACGGGATA AGTGCTGAAA GCATCTAAGC ATGAAGCCC 24180 

CCTCAAGATG AGATTTCCCA ACTTCGGTTA TAAGATCCCT CAAAGATGAT GAGGTTAATA 2424 0 

GGTTCGAGGT GGAAGCATGG TGACATGTGG AGCTGACGAA TACTAATCGA TCGAAGACTT 24 300 

AATCAAAATA AATGTTTTGC GAAGCAAAAT CACTTTTACT TACTATCTAG TTTTGAATGT 24 360 

ATAAATTACA TTCATATGTC TGGTGACTAT AGCAAGGAGG TCACACCTGT TCCCATGCCG 24 4 20 

AACACAGAAG TTAAGCTCCT TAGCGTCGAT GGTAGTcGAA CTTACGTTCC GCTAGAGTAG 24 4 80 

AACGTTGCCA GGCAAAAAAT GGATGCGATG AGCCGCATTG AGACCGCAAG GTCTCTTTTT 24 54 0 

TTTATGTCTA AAACGTCAAA ATAAAAAGCA AACACAAAa AAAATGGCTT GGCGAAGTGA 2 4 600 

AAACGTTTGA ATCTGACGAA AC G AG AAAAG ArCGCAACGA GTTTAGTAGA GCTAAATGAG 24 660 

TAAGyGAGAG CCGAAGrAGA GGAAAGAAGC AAGCGATTGT CACAAGTCAA GAAAGGTTCT 24 720 

TAGCGAsGAT GGTAGCCAAC TTACGTTCCG CTAGAGTAGA ACTGGAAATG ATAATTTAAT 427 8 0 

AATGTACACT TTCGATTGTC TAAGTATGTA CAACTTTAAT TTTGTGTTTA TATAAATTTA 24840 

AAATGATATC AT C G AAAAC A AAATATTGTA TAAATAGAGA AGAGCAGTAA GACGGTATCT 24 900 

AATTGAAAAT GATCTTACTG CTCTTTTATA TACTTTATTG AAATACAAAA AGGAAATTAA 24 960 

TTATTATACA ATAGACAAGC TATTGCATAA GTAACACTAA CTTTTATCAA AGAAGTGTTA 2 502 0 

CTTTATAATT AATGATTTTA TTAGAGCGTC TACATGCGGT TTTAAAGCAT CATCGTCTAT 25080 

ACCGCCAAAG CCTAATATAA ATTTAGGGGT TTTCTTATAG TCTTGATCAT CATCAAAATT 2 514 0 

ATAAACTTGT AATTTTAACT TTACTTTGTT TGCTCTATCA AGACACCTT GTAATGTTAA 2 52 00 
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TCCATTTTTT ACTGTAATTG TAAAATGCAT ACCCGTTTCA GCACCTTGAA TATCAAGCTG 2 52 60 

CTCTTTGTAA GGTTTCAATC TTTTTAAAAT ATAGGTTAGT TTTCTACGAT AAATTCGTCT 25320 

CATTTTATTT AAATGCCTTT CAAAACCACC GGAAGATATA AACGTTGCAA TAAGGTTTTG 2 538 0 

CATATGAACA GGTACAGTGT TGCCTTCAAT GTGATTTTGA GAATGATATT TTTTCATTAT 25440 

AGAATAGGGT AACACCATAT ATGCAACTCG ACAGCTAGGA AAAATAGACT TTGAAAATGT 25 500 

ACTGATATAA ATCACTTTTT CTCCTCTTGA ATATAGACCT TGAATTGCTG GAATGGGTTT 25560 

GCCGAAATAT CTAAACTCGG AATCATASTC ATCTTCTATA ATAAATCGTT CTTCTTTTTC 2 5 620 

TTGAGCCCAT TGTATTAATT GAGTTCGTTT TTTTAAGTCC AT C AC AT AT C CAGTTGGAAA 2 5 680 

TTGATGGGAA GGCGTTATAT ATACTATATT TTTTTGTGAT TTAATAACTT CATCTACGTT 2574 0 

TATTCCATTA TCTTCAACTT CAATTTGTTC ATATTCAACT TGTTTTTTAT CTAAATATT 258 00 

TTTGATTGGT GGATAACTAG GTTTTTCGAT AATAAATGTT GAAGTATAAA GTAAATCGAC 25860 

TAATTGATTT ACTAATTGTT CGGTAGATGA GCCAATTATA ATTTGATTAG GATCACAAAT 25920 

TACGCCACGA TTAGTAAATA AATAAAATGC CAGTTGAAAC CGCAAATGTA ATTCTCCTTG 2 5980 

AAAATGTCCT CTACGTAATT GATTTAAATG ATTTGTATCA TAAAGATCTT TGGAATACTT 2 6040 

TCTGAAAAGT TCTATAGGGA AATGTTTCGT ATCTATTTCA TCCAAATTAA AAGCATAATC 2 6100 

ATAAGCTTCA TCACTCGCTT TTGGTTTATA TGAATCATCA T C AAAAAG AG AGGGGATAGG 2 6160 

TTGATTGTTT AAAATTGTTA AAGATTCAAT TTCGdCACA AAATATCCAG AGCGAGGTCT 2 622 0 

TGAATAAATG TAACCTTCGT CTAATAGAAG TTGATATGCA TGCTCTACGG TTGTTTGGCT 2 6280 

AATAGATAAA TGTTTGCTTA ATTGTCTTTT AGAATAAAAT TTATCGCCTT CTTTAAATTG 2 634 0 

ACCTTCAATT ATTTGTTTTT TTAATTTTTC ATAAAGTTGA TGGTATAAAG TGTTTTTCAA 2 64 00 

TTTTATAACT GACCTCCTAA ATTTATCTTA TTTTGTACCT TTTTAAATAT C AG T T TAT AC 2 64 60 

ATTACAATGT ATTTAATCAA CTTGAAAAGG GGTTTTATGT ATAATGAGTA AAAT TATTGG 2 652 0 

AT C AG AC AG A GTCAAAAGAG GTATGGCTGA AATGCAAAAA GGCGGCGTTA TTATGGATGT 26580 

CGTTAATGCT GAGCAAGTAA GAATTGCAGA AGAAGCTGGC GCGGTAgCAG TTATGGCATT 2 664 0 

AG AAC G AG T A CCTTCTGATA TTAGAGCTGC TGGTGGTGTT GCACGTATGG CAAACCCTAA 26700 

AATTGTAGAA GAAGTAATGA ATGCTGTTTC TATTCCAGTC ATGGCTAAAG CACGTATTGG 2 67 60 

TCATATCACT GAAGCAAGAG TATTAGAGGC GATGGGTGTT GfiTATATTG ATGAATCAGA 26820 

AGTGTTAACA CCAGCAGATG AGGAATATCA CTTAAGAAAA GATCAATTTA CAGTACCATT 2 6880 

TGTATGTGGA TGTCGTAATT TAGGTGAAgm TGCGCGTAGA ATTGGTGAAG GTGCTGCTAT 2 694 0 

GTTACGTACT AAAGGTGAAC CAGGTACAGG TAATATTGTT GAAGCTGTAA GAC AT AT GAG 27 000 
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ACAAGTTAAT TCAGAAGTTA GTCGATTGAC TGTAATGAAT GATGATGAGA TTATGACTTT 27060 

TGCGAAAGAT ATCGGTGCGC CTTATGAAAT TTTAAAACAA ATTAAAGACA ATGGTCGTTT 27120 

ACCGGTAGTT AACTTTGCAG CTGGTGGCGT TGCGACTCCT CAAGATGCTG CTTTAATGAT 27180 

GGAATTAGGT GCTGACGGTG TATTCGTTGG ATCAGGTATT TTTAAATCAG AAGATCCAGA 272 4 0 

AAAATTTGCT AAAGCAATTG TTCAAGCAAC AAC AC AT T AC CAAGACTATG AACTAATTGG 27300 

AAGATTAGCA AGTGAACTTG GCACTGCTAT GAAAGGTTTA GATATCAATC AATTATCATT 27360 

AGAAGAACGT ATGCAAGAGC GTGGTTGGTA AG AT AT G AAA ATAGGTGTATTAGCATTACA 27420 

AGGTGCAGTA CGTGAACATA T TAG AC AT AT TGAATTAAGT GGTCATGAAG GTATTGCAGT 27 4 80 

TAAAAAAGTT G AAC AAT TAG AAGAAATCGA GGGCTTAATA TTACCTGGTG GCGAGTCTAC 27540 

AACGTTACGT CGATTAATGA ATTTATATGG ATTTAAAGAG GCTTTACAAA ATTCAACTTT 27600 

ACCTATGTTT GGTACATGCG CAGGAT T AAT AGTTCTAGCG CAAGATATAG TTGGTGAAGA 2 7 660 

AGGATACCTT AACAAGTTGA ATATTACTGT ACAACGAAAC TCATTCGGTA GACAAGTTGA 27720 

CAGCTTTGAA ACAGAATTAG ATATTAAAGG TATCGCTACA GATATTGAAG GTGTCTTTAT 27780 

AAGAGCCCCA CAT AT T GAAA AAGTAGGTCA iSGCGTAGAT ATCCTATGTA AGGTTAATGA 27 84 0 

GAAAATTGTA GCTGTTCAGC AAGGTAAATA TTTAGGCGTA TCATTCCATC CTGAATTAAC 27 900 

AGATGACTAT AG AG T AAC T G ATTACTTTAT TAATCATATT GTAAAaAAAG CATAGCTTAA 27 960 

TGTATGCTAA ATCAACGAAT TATTGATATT TATAGATTTG TTGAGAAGAA AATATOTCT 2 8 020 

TCAAACTTAG CTTTGGAGGA GTTATTTTTT ATGTCAAAAT TAAAAATGAT AAAAAATAAA 28 08 0 

GCTATACATA AGAAAAAAAC CCTTCAAAGA GACTGAGAAT AG T C AAAAT T TTGAAGGGGT 2814 0 

TAATTCGATG TTGATGTATT TGTTAAATAA AGAATCcAGC GATTGCAGCT GAAATGAAAG 28200 

ATACTAGTGT tGCACCGAAT AATAATTTCA AACCAAAGCG GGCAACTGTA TCTCCTTTTT 2 82 60 

TGTCATTAAG TGATTTAATC GCACCTGAAA TAATACCGAT AG AG C T AAAG TTAGCAAATG 28320 

AT AC T AAGAA TACAGATGTA ACACCTTTTG CGTGTTCAGA TAAATCACTA AGTTTACCAA 2 8380 

GTGCTTGCAT TGCTACAAAT TCGTTAGATA ATAGTTTTCT CGCCATAACT GAACCGGCTT 284 4 0 

GAACTGCATC TTGCCATGGC ACACCGACTA AGAATGCAAA TGGTGCAAAG ACAAAACCAA 2 8 500 

TTAATGTTTG GAAATCCCAA GAAATAGCGC CACCTGAAAC TGTACTAAAG ATATTGCTTA 285 60 

CAATTCCATT TAATAGAGCG ATAATGGCAA TGTATCCGAT TAACATTGCG CCTACAATGA 8262 0 

CAGCTACTTT AAATCCATCT AAAAT AT AT T CTCCTAGCAT TTCGAAGAAT GATTGTTGTC 2 8 680 

TTTCTTCAGT TTCTTCAACT AATAATTTGT CATCTTCTTC ATTAACTTTA TAAGGGTTAA 28740 

TAATTGAAGC GATGATGAAA CCACCAAATA AGTTTAAGAC AACAGCCGTT ACAACATATT 28 8 00 

TAGGTTCAAT TAAGGTAAAG TATGCACCGA TAATTGAAGC AGAAACAGTC GACATTGCTG 2 88 60 
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AAGCTGTTAA TGTGTATAAA CGTTGCTTAG GTATGTATGG TAATTGTTTT TTAATTGAAA 2 8 920 

TAAATACTTC AGATTGTCCC AAAATTGCTG CAGCAACTGC ATTGTATGAT TCTAAACGTC 2 8 980 

CCATACCATT AATTTTAGAA ATTAAGAATC CTAAAACATT AATGATAAA GGTAAAATCT 2 9040 

TTGTGTATTG AAGGATACCG ATAATCGCTG AAATAAATAC GATAGGTAAT AATACACTGA 29100 

AGAAGAATGG TGGTTGCTTA GGATCGATAT ATTGAATACC ACCGAATACA AAGTTAACAC 2 9160 

CATCTGCTGC TTTTAATAAT AAGTAGTTAA AACCGTTTGA AATACCACCA ATAACCTTGA 29220 

TTCCCATTGT AGTTTTAAGC AAGATAAATG CAAAGATAAG CTGAATTGCA AGTAAAATTC 2 9280 

CTACATATTT CCAGCGAATA TTTTTCCTGT CTGAGCTAAA TAGAAACGCA AGTGCTAAAA 2 934 0 

AGAAGATAAT TCCGATAATC CCAATTAGAA TATGCATATA TTTCTCATTC CTTTAGTTTT 2 9400 

TTCTACaATc TATCATACAA TAAAATG®A GGGCTAACAT CATAAATTTT TGAAAATATA . 294 60 

AAAACAAATT AATTGAAAAA GGTCAAAATA GGTCATATAA TATAGTCAAA GAAGGTCAAA 29520 

AAGGGGTGAT ATACATGCAC AATATGTCTG ACATCATAGA ACAATAaTCA AACGTTTATT 2 9580 

TGAAGAGTCG AATGAAGATG TCGTTGAAAT TCAGAGAGCG AATATCGCAC AGGTTTTGA 2 9 640 

TTGCGTACCA TCACAATTAA ATTATGTAAT C AAAAC AC G A TTCACTAATG AACATGGTTA 2 97 00 

TGAAATCGAA AGTAAACGTG GTGGTGGTGG T T AC AT C C G A AT C AC T AAAA TTGAAAATAA 2 97 60 

AGATGCAACA GGTTATATTA ATCATTTGCT TCAGCTGATT GGACCTTCTA TTTCTCAACA 2 9820 

ACAAGCTTAT TATATTATTG ATGGGCTTTT AGATAAAATG TTAATAAATG AACGTGAAGC 2 9880 

TAAAATGATT CAAGCAGTTA TTGATAGAGA AACGCTATCA ATGGATATGG TTTCTAGAGA 2 9 940 

TATTATTAGA GCAAATATTT TAAAACGTTT GTTACCAGTT ATAAATTATT ACTAAATGAA 30000 

ATGAGGTGTT GAAGTGCTTT GTGAAAATTG TCAACTTAAT GAAGCGGAAT TAAAAGTTAA 300 60 

AGTTACAAGT AAAAATAAAA CAGAAGAAAA AATGGTGTGT CAAACTTGTG CTGAGGGGCA 30120 

CCATCCGTGG AATCAAGCTA ATGAACAACC TGAaTATCAA GAACATCAAG ATAATTTCGA 30180 

AG AAGC AT T T GTTGTTAAGC AAATTTTACA ACATTTAGCT ACGAAACATG GAATTAATTT 3024 0 

TCAAGA 3024 6 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
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TATTCCCCCA TCGGTTTATT AAATCGTCCA TTTCAATACT GTTTTTCCCC AAGATGTCGA 60 

T AAAT C CAT T TCAAACGCTT GGACGATATC TTGCATCGTA CATACATTAA TTTCATGTCC 120 

TTTTAATAAT GCTAACTTTT CAACTATGTC TGGGTACTTA C GAT AT AAAT CAACAACTTG 18 0 

CTCAAAATCT TTAGAGCCGC TTCGACTACT ACCAATCAAC GTTAATCCTT TTTCAAGTAC 24 0 

TAATCGTGTA TTCACTTCCA CGGGTAATTC ACTTACGCCT AACAAAGCAA TACTGCCTTC 300 

TGGTGAAATA TGTTCAACTA TTTGTTGAAG TGCAACTT& CTTCCTTTAC CTCCAACACA 360 

TTCAAATGCA TGATCAATTT TAAGATCATC TGGTATTTGA TTTACTGTAA AGATGTCATC 420 

TACAAATGAA AAATGACTTA AT T TAT AG T C TGTCTTACCA AATACATAAG TTTTAGCTTC 4 80 

TGGGTACAAC TTACGTAGCA AAATAGCAGT AATATAACCT AAGTTACCAT CACCCCAAAT 540 

ACCAAAGCTG GTTTTCAAAG GTATAGATTT ACGTTCAAAT CGTTGTATAG CATGATAACT 600 

TACTGACACT AACTCTGTGT ATGAAATCGT AC T C AAAT C A ATGTCATTAG GCAGCGGAAC 660 

GAT AC GAT C A TGTGCCATCA CAACGTAGTC TTGCATAAAA CCATCATAAC CACTAGATCT 72 0 

AAAATAACTA GAGGCTAAGT AATTCTCCGC AATAATATGA TGTTGCTCTG TAGGTGTATT 78 0 

CGGTACCATT ACTACTTTCG TACCTTTTTC AAATACCCCT TTACTATCAA ATACAACTTC 84 0 

ACCAACAGCT TCATGAACTA ATGACATTGG TAATTTTTTG CGTAGTACAT TTTCATCTCT 900 

TCGACCTGTG TAATACCTTT GATCAGCTGC ACAAATAGAC AAGTAXAAG GTCTTACGAT 960 

GAC AT GAT T A CCATAAATAT CAACATTATT ATATGTGACG TCGAACTGTC TCGGTGCAAC 102 0 

GAGTTGATAT AC T T GAT T AA TCATCGGCAA TATCACCTTG AATAATGGCA TTTGCTACTT 108 0 

TTAAATCATA CGGTGTTGTC ACTTTAATGT TGTATAGTTC TCCaCGTACC AATTTAACTG 114 0 

CATGTCCAGA TTCGACAATG ATTTTACATG CATCTGATAA GATTTCTTTT TGTTCACTAC 1200 

TTAAGGCGCG ATAACTATCT TGTAATAATT T AAT AT T AAA TGATTGTGGT GTTTGGCCTT 12 60 

GAT AC AT T T C ATTCCTTACA GGGATACTGT GTATGTTCTG TTTATCTTTA GACAT T AC AA 132 0 

TCGTATCAAT TGCTTCAATG ACTGTATCTA CTGCACCATA TTTTGCTGCT ACTTCAATGT 1380 

TCTCTTTAAT AATACGTTGA GTTAAAAATG GTCTTACGGC AT CAT GAG T T ACAATCACAT 14 4 0 

CATCATTATT AATTCCATTT ACATTGCGAA TATGGTCGAT AATGTTCATA ATTGTTTCAT 1500 

TTCGATCCGT ACCACCTGCA ACTACTTTGA CACGTTGATC TGTAATGTTA TATTTTTTA 15 60 

AAATATCCTG TGTATGGGAA ATCCACTGTG CTGGCGTTGC GAT AAT AAT C TC ATT AAAT T 162 0 

C AC T C AC T AA AATGAACTTC TCAATTGTAT GGATTAAAAT CGGTTTATTA TCAATATCTA 168 0 

AAAATTGTTT TGGTAATGGC ACGTTCCCCA TTCTCGAACC AATACCTCCT GCTAAAATAC 17 4 0 

CTGCATAAAT CATGTTGTCC TCCATTCTGT CAT T AC AT C A TTTCCATTTA TACATTACTG 1800 

ACCTATGCCC GCACATAAGC CTAACCTATT GCTCACTTGC CTCTTTTATT AATCCAAAGA 18 60 



221 

Copied from 10806206 on 08/12/2004 



TAGTTGTCAC AATAGTGTGA TAATTTTTTA TAAAAATGTA TTTTTGTAAC TGACCATTCT 1920 

AAGTTGTTTT GCCATGCAGT TAATCATTAA CTCT<ACGAT ATTAAATTGT TAAAGGTATT 198 0 

AATGTTTACT CTTTTTCAAA TTCATTATTA CTGCCATCAT TTTACCATAT ATTATAATAA 20 4 0 

ATTTATCTTA TTAAGTGGCT GTACTTGATT TTCACTTTAA AAATTATCAA ATATTGCCAT 2100 

CTCATTTTAA GTATACAAAA TGCAAAACAA CCGATTCACA AGCATATTTC ACACAAGTAA 2160 

ACCGGCTATT TATCAACGTA TATTCGAAGA TGAATTATTT CGATAGTATC TATAGACCAG 2220 

ACGGCATTCG CACTTTCATA GCTATAACTA TACCAGCGTT TTCGTCCTCA AAGGTGCATA 2280 

CTAATAAATC GTAAACATGA CTTTATCAAA TCGTTCTTTC TTGTTAACTA ATTTATCAAA 234 0 

TGTCTCCGGG CCTTTTTCTA ACGGTAAAAA AT G AG AAAT A ATAGGCTTTA CATTAATATC 2400 

TTTCGTCTTC ATATAATGTA AGGTTGCCGT CCACTCTTTG C C C GGAAAAT TACTGGACAA 24 60 

ACAGTTCCAA GAGCCACATA CTGTCAACTC GTTACGCAGA ATTTTTTCAA AATGAACGCG 252 0 

ATCAATCTCA ATATCATCAT ATGGTATTCC GAGTAATACC ACTCGCCAC CTTTTTTAGG 258 0 

TAGCGTCAAT ATTTGACCAA TCGTAACTTT AGCACCTGAT GATTCTATAG CTAAATCGAT 2 64 0 

TTGATTGGCG TAATGATTTT CGATGAATTT CTCAAGATTT TCTTCTTTTG AATTGATTGT 2 7 00 

TTGATGTGCG CCCAATGATG TTGCAATATC TAGTTTATGC GCATCTATAT CTATAGCGAT 27 60 

GATATGTGCA GCACCAAATA TTCGTGCCCA TTGAATAGCT AACAAACCTA TACTGCCACA 28 20 

CCCCATTACT GCAACAGTCA TACCAGGTTG TATATTCGAT TTATAAAACC CATGCGCAAC 2 8 80 

AACGGCTGAT GGCTCAACCA TTGCTGCTTC AATGTAATCA ACATTGTCTG GAACCTTTAA 294 0 

AACATTTTGC GCTGGCAATT TGACATATTC CGCGAACGAT CCAGGTTCAT ATGAGCCAAT 3000 

GACGAATAAC TTTTCACATC GTGCATATTC ACCTTTTAAA CAATACTCGC ATTGATAACA 30 60 

AGGTATTGCT GGGCAACCTG TCACTTTGTC G C C C AC AT T A ACATGCGTAA CATCACTTCC 312 0 

AATGGCATCT ACTACACCTG AAAATTCATG ACCAAATGGC ATACCTTTAATGTATGGCCC 318 0 

CATTTTTTTG TATCGTGACG TGTCTGAACC ACATATGCCA GTCGCTCGTA CTTTAATAAT 32 4 0 

AACGTCATTC GCACTTTCAA TGACTGGCTT TTCATTATCC TCATACCGTA AATCTTCCAC 3300 

GCCATATAAT TTCAATGCTT TCACTTGTAA ATCACCTCAA ATTTGATTTA ATTCACAACT 33 60 

TTTTTCTTTT TAAAAATACC TGTCGCAAAA TAACCTGCAA TGACAATGGA ATTACTTACG 34 2 0 

AGTAAATGTT CC AT AT AAAA ATCAGTGATT TGTCTTAATG GCCCAAGCAT AAAAGTTAGC 34 8 0 

ATTGATTCTA CCATATTAAT TCTCTCCTTC ATTTTCATCT AAATCGAGCG GCTTAAGTAA 35 4 0 

TGCTTTAATA CCTTCGCCGG ATTTTAAATG TTGATACGCC TCGTCCCATT TCGAAATATC 3600 

ATATATTTTT GTCACCAAAG CTTCAGCATT TACTAAACCA TCCGCCATAA GTTGCAATGA 3 6 60 
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AGGTTCCCAA TCTGCTGGCT TTTGACTTCT ACTACCAACA ACTGTTATTT CTTTTTGAAT 37 20 

CACTTTTTCC ATATCAAATG GAATTTCAGC ATCCTTAAAA AT ACC T AT T T GACTGTGAA 37 8 0 

ACCTTTTTTG CGTAAAATAT CCAAACCTTG TCGTGCTGCT GGAACTGCAC CTGAACATTC 38 4 0 

AACAACAACA TCTGCACCGT AACCGTCTGT AATTCCATTG ATATACGTTT TTAAGTCTGT 3900 

TTGTTGTAAA TTGACTACAT AATCCATGTG CAATGCTTCT GCTTTATCTA ATCTGACTTT 3 9 60 

GTCATTGTCC AATCCAGTTA CCACAACAGT TGCGCCTTTA CTTTTTAACA CTTGTGCTAC 4 020 

AAGTAATCCG ATTGGCCCAG GTCCCATTAC AACTGCTACA TCGCCTGAAT TGACTTGAAT 4 080 

CTTAGAAACG CCATGATGTG CACATGCTAA TGGTTCTGTC ATAGCTGCAG ACTGATACGA 414 0 

TAtTCGTCTG GAATATGATG CAAACTTTCT TCACGTGCR TGACATAATT AGTAAATGCG 4200 

CCATCAACTT GTGTTCCAAT ACCTTTTCGA TGGTTGCATA AATTATAGTC TTTTGATTTA 4 2 60 

CAGTATTCAC ACTCATTACA AACATAGAAT GTCGTTTCAG aTGtGACACG GTCACCAACT 4 320 

TTAAAATCTT TAACGTCTGC TCCAACTTCA ACGATTTCAC CAGAAAATTC ATGACCTAAT 4380 

GTCACTGGAA AATTAACTTT ATAATGACCT TCATAAGTAT GAATATCTGT GCCACAAATT 44 40 

CCTGCATAAT GTACTTTAAT CTTTACTTTA TCATCTAGCG GTGTTGCAAC TTCTTTATCA 4 5 00 

AGAAGTTCTA AGTTGCCATG TCCTTCTCTT GTTTTTACTA AAGCTTTCAC CACAAACACC 4 5 60 

TCGATTTTTA ATTGAATAGA CTAAATAGTT TAAAGATAAG ATAGTTAACG ATATTACCAC 4 62 0 

CTTGATCAAT ACTTGAAATT T C AG AT G AAC CTTTTGGCAT TTGTACATTC GTACCTTTCG 4 680 

CCATATCTGT GAAAATGGGT GCTACGTCTG TTGCAATATA TAGTGAAATT GCAATCATAA 47 4 0 

TCGTACCCAC AATGACAGAA TGAATAATGT TTCCTCTTGC TGCACEACA ATAAACGCGA 4 800 

CAACAAATGG TATCGTTGCT AAGTCACCAA AAGGTAGTAC TTGGTTTCCT GGTAAAATAA 4 8 60 

CGGCTAATAA AACAGTGATA GGTACTAAAA TTAATGCTGT CGAAATAACT GCTGGATGAC 4 920 

CTAATGCTAC AGCCGCATCC AATCCAATAT AAATTTCACG TTCGCCAAAA CGTTTATTTA 4 980 

GCCATGTTCT TGCAGACTCT GAAAC T GGC A TTAAACCTTC CATTAAGATT TTTACCATTC 50 4 0 

TAGGCATTAA TACCATTACT GCAGCCATTG ACATTCCTAA ATTAATGATG TCTCCAGGTT 5100 

TGTAACCTGC TAACACACCA ATACCTAAAC CTAAAATTAA GCCGACAAAT ATAGACTCTC 5160 

CAAATGCGCC AAAACGTTTT TGAATTGTTT CAGGATCAGC ATCTAACTTA TTCAGACCGG 5220 

GTACTTTTTG TAACAATTTA AC T AAG T AAA TACCTGGTGC ATAAGAAATT GTACTTCCTG 52 8 0 

TAGCAATCGA TACACCTGGT AAATCAAAGA ACTCACTCAT CATCGGTGCT GTCCAATCTG 53 4 0 

CTACTTTCAA ACAGATAATT TGGAAAATAA CTGCTGCTAA TAACGCTTGC CAATACTGC 54 00 

CTGATACGGC ATAAACCATT GCTGCTGTAA ACGTATAATG CCAAAAATTC CAAATATCTA 54 60 

CATTCATCGT CTTTGTCACT TTAGTTACTA GCAATACAAC GTTAACTATG ATTCCGAGTG 5520 
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GAATAATAAA TGCTGCGACA GATGATGCCC AAGCGATAGA TGATGTTGCT GGCCAACCTA 558 0 

CATCAATCAC ATTCAGACTG ACGCCTAAAT TTTTAACCAT CGCTTGTGCT GCTGGCCCTA 5 64 0 

AATTTTTAAC TAATAAATCG ATGACTAAGA AAATCCCTAC AAAAGCCACA CCTATTGTTA 57 00 

AACCAGACCT AAATGCCGCT CCAATTTTCT GCCTAAAGAA TAGGCCAAGC AAGAATATGA 57 60 

CAACCGGTAA AATAACAGTt GCACCTAAAT CTAAffiATCC CCTTACAAAA TCAGTGAAGT 58 20 

AACTCATATT TAAACCCTCC CTGTTATATA TGCATTGTCA CGATACTTTC CGATTGTGAT 58 8 0 

TACATTTGAC GTTACAGTCA TTTCAACGAC AACCCTTGCT AAATTCGACT GCAGTCCTTT 5940 

TGAATTACAG tCACTGCGTT TCTATGTCAT CAACAATCAT TTGTCGTGAT AGTCATTTAT 6000 

ATGCAATTTG CATATATTAA TATGTTATCG ACCCACGTTA CATATCAATT CCGTTATTTT 60 60 

TGTAACTCTG TTAAGATTTG TTGTTTTGTT TCTTCAATAC CAATACCAGT T AAGAAAT T A 6120 

CGTGCGTTGA TAACTGGGAA TTTATATTCT TTTTTTGTCA TTGCAGTTGT AAC T AAT AAA 6180 

TCTGCAGTGT CTTCATAAGG TCCAACTTCT GTAATTTTGA TTTGTTTAAT ATCTACTTTA 624 0 

ATATTGTGTT CCTTTGCCAT TTCTTCAATT GCATTATTTA CTACTGTTGA CGTTGCAATA 6300 

CCTGCACCAC ACGCTACTAA TACTTGTTTC ATTTTCAATT CCTCCAATTA ATTTTTAGTT 63 60 

ATATTCCAAA TAATCATTGA TTAGTGTTGC TAAAATTGTT TfiTCTTTCG TTCGTAGAAT 6420 

CTGCTCCAAT TTTTCTTCAC TTTGAAAAAT TTGCATCAAC TGTTGTAACA GCTTAAGTTG 64 8 0 

ATCATCTACT TTATCCATTG C T AAC AT AAA AACGATTTTC ACTTCTGTCT GTTGATCAAG 654 0 

TGTTCCCATT TCAATAAACG GCACTTCTTT TTCTAGAACA GCCACACCTA TCGTTCTATG 6600 

GTTAATATGT TCGACATCTG TATGCGGTAT AGCGACCGAA CATAGATGCG TTGGTAAACC 6660 

AGTAGCAAAT TCTTTTTCTC TGTCGATGAC TGCATCTTTA AACGTTGACT TCACGAACCC 6720 

ATTTTGAAAT AAC AC AT C T G ACATTTGTGA CAATACGGAT TCTTTATCAG TTGCCGACAA 67 8 0 

ATTGAGCATT ATATTTTCTT TATGCACTAA TTGCTGTCCC ATCCATTTTC CCTCGCTTCT 68 4 0 

T TAT T T G AAT AATTTTTTAA AATCTCATTT ACATCAGAAT TTTTGCGACT TTGTATGATG 6900 

CGCTTAATTG CGTCATTGTC TTGCGCCACA TCTCTCAATT GTAGTAACGC TCTTAAGTGT 6960 

GTCACTTTAT CAACAGCAGC AATAGGTACA ATAATATGGA TTGCTGTGCCATCTGACATG 7 020 

TATATTGGTT CTTGTAATAT C AAC AT AC T C ATCGCTGTTT TATGTACATG CTTTTCAGAG 7080 

TATGCATGTG GAATAGCAAT ATTTTGCATG ATAACCATAT ACGAATCATC AAATACCTGT 714 0 

TGCATCTCAT G AAT AT ATT T AATATCAATA AAAT GAT TAG CAACTAACAC ATCACTTGCT 7200 

TTAGCAATAG CTTCATCAAT ATTTTCAACA TGATGCATTC TTTTCACGTG CCTTGCCGGT 72 60 

ATCAAGTCAG CTAAATCTAA TGyCTwATTT tGTGtGACaA TCGATCCATT AATGGTTGAA 7 32 0 



224 

Copied from 10806206 on 08/12/2004 



ATTGAATTAT AATTGGCAAT AAAATCTTCT AAAC CAT C AC GTAGTcTGTA ATGTCATTAA 7380 

CTGTCGTTGT GCGTTCAATT AATGCCATTA 7CTTGTTTAT TTCCTTATCA ATGTCAGCCG 74 4 0 

ATTCCTTATT AATGTACTTC ATCACTTCTT TACGTAACTT TCGTTGCTCA T T T T C AG AT A 7500 

AAGCTACTTT TGTGATAAAT AATTTTTTAT GTGTTAGGAC AAACATTGGT G AAAAG AC G A 7 5 60 

TGTCATAATC TAATGTGTAA TTTTCAAATG TTCTAAGTGA AATCGCATCT AAGAAAAAA 7 62 0 

TTTCTGGAAA TAAGTTTCGC AAC TCGTATA ACATCATTTG TGATACTGAC GTGCCTTGTG 7 680 

TACACACGAT AATAGCTTTT ATCTTGCCAT CGAAGTTTTC ATCTTGACGT C T C AAAC T AC 774 0 

CTCCGAACAA CATGGTTAAA TATGCTATTT CATTATCAGG CAACGATTTT CCGAAATATT 7 8 00 

CAGTTAACGA TTGACATGAT TGTTTCACCA TATGAAATAA GGATTGATAA TTTCCTTGTA 7 8 60 

AAGGATTTAT TAATTCATCA CGATCCGTTA AGTTATATTT AATCCTATAA AAAGCAGGCG 7 920 

TTAAATGTAA CAAGAGTTGC TGTGATAATT TCTCCTTATC TTCAATGTTA ATAAAAGTGA 7 980 

TTTGTTCAAA ATGGTGAATC ATTTGAGCGA TGGCCATCCT TAAATTCGAT ATGCTATCTG 80 4 0 

ATTCTTGCAA ATCAGTCCAT TGCACACTTG TTGAAAGTAA GTGTAATGTC AAATATAACT 8100 

TTTCCGCTTC TGGCAAATCC GGCTCATGTT GCGTCATAAT CTCCGTTGCT TGATATTCTT 8160 

TCGTATCCCT CAAATACTGA TAATTAATAT TTAATGGATT CAT C AC AT GA CCACTTTGAA 82 20 

TTCGTCTACG AATCACACAA AGGACATAAG GCAATGAACT AAGTGATTTG TCTATAAAGC 828 0 

GACTCTTCAA AAATTGTTCT ACCTGTTTGA TCTTGTCTTT TTGATATGCG ATATCTTCGA 834 0 

ATGTTAAGTT GAGCGCCTTT AAAACTTCAC TTTTAGTAAT AT CAT GAT T C AACCTTTGAT 8400 

CAATCAACTT AATGAAGAAA CGGCGAACTT CAAATTCATC ACCAACAATT TCATAACCAT 8 4 60 

GTTTTCGAGA ATACTTAAGT GACAAACCAT GATTTTCCAA TTGCTCTTTC ACATGATTTA 8520 

TATCGTGAAT GACAGTATTT TTACTGACTT GTAAATCAAT TGAAAAATGG TTTAGAGACA 8580 

TTGCGTTTTC CTTACTAAAA AGCATGAGCA TTAAATAATA ACGACGGTT TCTATGCTAA 8 64 0 

AAATGACATT GTTGCCGTTT AACATTTGCT GCTCCGATAC ATCTCGCTTG AATAACGTCA 8700 

TGATTTCAGA ACTTACAATA AAATTTCCTT GGCTTGTTCT TTCAAGTTTT GGATAACCCT 87 60 

CTTGTTCAAG CCACAAATTG ATTTTTTGAA TGCGATATCC TAGTTGTCTA C GAG AC AAAC 8820 

CAAATATCGA TTCAAGTTCT T T AC CAT G AA TAGTAGGATT CAATACAATT TCTCTGAGTA 8880 

TTTGCTGATT ATCCCCGTTC ATGTTCGATT CTCCTTTGAG CATCTGAGCT CAATTTATCA 8 94 0 

TCAATCGTCA CACCGATGTA C AC AC T T T G A AC AC AT AT T T T C AAAAT GAG CATGTACATC 9000 

ATTGTGATGT T T T AAC AAC A TTTCAATTAT ATCTATATTT TTTGTGATTT TAATCTTTTA 90 60 

AAATAAAGCA ATTGAAATTT TTGCATATAT TTTTGTGTTT TGTGTTTTTT TGAAGCATTT 9120 

TTAACATACA TATCTCAATC AT TAT C AAAT TGTCATGACC ATTGTAACCC AATACAAAAA 9180 
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CCCTAAGGAC GCTTATATCA GGCGCCTTAG GGTTAACTGT ATCTATTTAA T TAG TAT T A 92 4 0 

TTATTCGTAT GTACGTAACT TATGGTCTAT CAAGTTCCAC ACTTCTTCAA CATCAACTGC 9300 

TGTAGCAAAA TAAGCATTGG CAGGCTTACC TGTAACATGA TTTAAATCGA C AGCCAT AG T 9360 

GCCATAAGTT AGTGGACTTT GATGTTCAAT GTCGATATTA ACGGGTACCA TTGTAAACAA 94 2 0 

TTCTGGTTGT AACAAATACA AAATTGTACA AGCATCATGT ATTGGACCAC CATCCATATT 94 80 

AAAGTGAGTC TTGTATGTCT TCTTAAAGAA TTGCAATAAT TCTACGACGA ACTGTGCAAC 954 0 

AGGATTATTG ATACTTTCAA AGCGTTCAAT CACGTGATCG TCGGCTAAAA CTTGATGTGT 9 600 

TACATCTAAA CCAAACACAT TTATAGTAAT CCCACTTTCA AAAACACGCT TCGCTGCTTC 9 6 60 

AGCATCTACC CAAATATTGA ATTCTGCTGT AGGCGTCCAA TTTCCAAATG TACCACCACC 97 2 0 

CAT C AAAGT A ATAGATTCAA TATGCTCAGC GATTCTTGGC TCACGAATCA ATGCCGTTGC 9780 

TACATTCGTA AGAGGACCTG TCGCTACAAT TGTTACAGGT GTATCACTCG TCATCACTTT 984 0 

GTTTATAATC AC AT C T GAT G CTGGCATTGC AACTGCTTGA CGTGATGGTG TCGACGGTAG 9 90 0 

TTTCGGACCA TCTAATCCAG ATTCCCCATG TATTTCAGAA GCAAAGGCAG CTGGTTTAAT 9960 

TAACGGCCTA TCCGCACCTT TCGCTACTGC TATATCTTGG CGTCCCATAA TATCCAATAC 10020 

GTTCAAGGCG TTTGTCGTAT TCTTGTCAAC TGATTGATTA CCTGCGACTG TTGTTACAGC 10080 

TAATATCTCT AGTGGACTGT CAATTGCCCC CGCTAAAATT AATGCTATTG CATCATCGTG 10140 

TCCTGGATCA CAATCCATAA TAATCTTTCT TTTCATTTAT ATATCCACCT TTCTTAAGTT 10200 

GTTATCGATA GCTTATGTAT ATTTATTTAT GTGGTGAATC AtETTTATTT T GAAAAAT AG 102 60 

TTTTAACTTT CTCATATTTT T GGAT AC AAA CACTATTTAT CTATTTTATG GCTTATAAAT 10320 

TTATCCGATA TGCCTTATCA ACCTACCTCG C T AAAAAT AG GATGTCTACA TATCTATACC 10 380 

GACTTTTGTC AACTCATTTT CACAACAATA TAAACAGCAA TTTATATGAT TGTTACATGA 10440 

TTCAAACAAT TTTTATGAAA AATATTTTCA TACACAGAAT ATATATTGAT ATTAAATTTC 10500 

TCAAAAGCTA TATTGAGAAT AAT TAG G AGG GATGTTGATG AAATCTTTAT TTGAAAAAGC 10560 

ACAGCAGTTC GGCAAGTCCT TTATGTTACC TATCGCAATC TTACCAGCTG CAGGTCTATT 10 620 

GTTGGGTATC GGTGGTGCAT TAAGTAATCC AAACACCGTT AAAGCATACC CTATTTTAGA 10680 

TATTACCTTA TTACAAAATA TTTTTACATT AATGTCAGCT GCAGGTAGTA TTGTTTTCCA 107 40 

AAATTTACCG GTCATCTTTG CAATTGGTGT CGCAATCGGA TTATCTAGAA GCGATAAAGG 10800 

TACTGCAGGT tTAGctGCGC TGCTCGGTTT C T T AAT T AT G AACGCAACTATGAATGGCTT 108 60 

ATTAACTATC ACGGGCACAT TGGCAAAAGA TCAGCTTGCA CAAAATGGAC AAGGCATGGT 10 920 

GCTCGGTATA CAAACGGTTG AAACCGGTGT TTTTGGCGGG ATTATCACAG GTATTATGAC 10980 
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CGCAATACTT CACAACAAAT ATCACAAAGT GGTATTACCA CCGTATTTAG GTTTCTTTGG 1104 0 

TGGCTCTAGA TTTGTCCCTA TTGTCACAGC ATTTGCCGCA ATCTTTTTAG GTGTATTGAT 11100 

GTTTTTCATT TGGCCAAGCA TACAAGCCGG CATTTATCAT GTTGGTGGAT TTGTAACGAA 11160 

AACAGGTGCC ATCGGTACTT TTGTTTATGG CTTCATCTTA AG AT T G T TAG GTCCACTCGG 11220 

TTTACACCAT ATTTTTTACT TACCGTTTTG (EAGACGGCA CTTGGTGGTA CTTTAGAAGT 11280 

CAAAGGGCAC TTAGTTCAAG GTACGCAGAA CATCTTCTTT GCTCAACTTG GTGATCCAGA 11340 

TGTGACGAAG TATTATTCAG GTGTGTCACG CTTTATGTCA GGCCGTTTTA TTACGATGAT 114 00 

GTTCGGCTTA TGTGGTGCCG CACTTGCAAT TTAT CACACA GCTAAACCTG AACATAffiAA 114 60 

AGTTGTCGGC GGTTTAATGT TATCCGCTGC AC TC AC T T CA TTTTTAACAG GTATTACCGA 11520 

ACCTTTAGAG TTTAGTTTCT TGTTTGTCGC ACCTATTCTT TATGTAATCC ATGCCTTCTT 11580 

TGATGGATTA GCATTTATGA TGGCAGACAT TTTCAACATT ACAATTGGTC AAACCTTCAG 11640 

TGGAGGCTTT ATCGATTTCT TACTCTTTGG TGTGCTACAA GGTAATAGTA AAACAAACTA 117 00 
CCTATACGTC ATACCTATTG GAATTGTGTG GTTCTGTTTG TATTACATCG TTTTCAGATT • 117 60 

CTTAATTACG AAATTTAATT TCAAAACACC TGGTCGAGAA GATAAAGCTG CAGCACAACA 11820 

AGTTGAGGCT ACTGAAAGAG CACAAACTAT TGTTGCTGCT TTGGGAGGCA AAGATAACAT 11880 

TGAAATCGTT GACTGTTGTG CAACGAGACT ACGCGTCACA CTTCATCAAA ATGACAAAGT 1194 0 

CGATAAAGTA TTACTCGAAA GTACTGGTGC CAAAGGTGTA ATCCAGCAAG GCACTGGTGT 12000 

GCAAGTAATT TATGGGCCTC ACGTTACAGT TATCAAAAAT GAAATTGAAG AATTGCTCGG 210 60 

GGATTAAGAC TAACCGAAAT ATCAACAGAA CTAATGGCAA CGATGTACGA AGTAAGAAGT 12120 

GACATCGTTG CTTTTATTTT TAATGTTACA TTTGAAGCAT TAAGTTCATC ATGCACTGTA 12180 

GTGAGCCCGC AAATCGCCTC TGCTAGACAA TCATCTTAAT GCTATGATTA AAGCTTAAGT 12 24 0 

GCCAGATTTG AATTTAATTT CAACAACGAC T T T C AC T AC A TTAAAAATAG GGCCACTCGA 12 300 

C AC AT AT AG T TGTATCAAAT AGCCCTTTAT ACAATTTTTT GGGTAAGGTT TTACAATTTT 12360 

TGGGATGGTA TAGATTTTAT AAAAAGTTAT TTAAGTTCTT CTGCTTCAGC CATAATATCT 12420 

TTTAATGTTT TAGCTGAATG TGCGAACTTG CTTTGTTCTT CGTCGTTAA TGGGATTTCT 124 80 

ACGACGTTGC GAATACCGTT TCTATTGATG ACTGCTGGAA CACCAATATA AACATCTTCT 12 54 0 

TCCTCATATT CGCCTTCTAA TAATGCTGAT ACAGTCAATA CGGCATCTTC ATTTCTGAAA 12 600 

ATCGCTTCAG TAATTCTAGC TAATCCCATT GCAACACCAT AATAAGTGGC ACCTTTAGCT 12 660 

TGAATAATGT CATATGCTGC ATCACGTGTT TGAACAAAAA TTTGTTCAAT TTGCGCTTTG 12720 

CCCTCAGGAC GTTGTTCAAG TAATGTCTTC AAAGGTTGAC CCGCAATATT AGCGTGTGAC 1278 0 

CATACTGGTA ATTCAGTGTC ACCATGTTCA CCAATAATTT GAGCATCGAC GCTACGTGGC 1284 0 
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GCAACATCGn AcgyTcGCTT AACAATA£TC TAAAGCGTGC AG AG T C T AAA ATTGTACCAG 12 900 

AACCTATAAC ACGTTCTTTA GGTAAACCAG AGAATTTCCA TGTTGCATAC GCTAAAATAT 12960 

CAACAGGATT TGTAGCTACC AAGAAAATAC CATCAAATTT TGATGCCATT ACTTCACCAA 13020 

CAATTGATTT GAATATTTTC AAGTTTTTAG ATACTAAATC TAAACGTGTT TCCTCAGGTT 13080 

TTTGTGCAGC ACCAGCACAG ATGACAACTA GAT C C GC AT C ATGACAATCA CTGTATTCGC 1314 0 

CAGCTTTCAC ACGAACTGTT GTTGGAGAAT ATGGTGTGGC ATGTTTTAAA TCCATAACAT 13200 

CTCCTCGAAC TTTTTCAGTG TCTAAATCAA TGATGACTAA TTCATCAACA ATGCTTTGGT 132 60 

TCACTAATGA AAATGCGTAG CTTGAACCTA CTGCACCATT ACCTATTAAT ACAACTTTGT 13320 

TCCCTTTAAA TTTGTTCATT ACAAAAACTC CCTTATGATT AATTCACTAA CATACATGTA 1338 0 

GCTTCAAATA TGTTAGTTTA ATGCTGCTTA TTGACGATAC AAAAGCAAAT AAACATCTCT 13440 

TTTATTTTCA ACGCATAACT TAAAAGGTCA TGTGTATCC GCTTTTAAGT TTGTGATTTA 13500 

TTTCACATAT AAAATGTAAC ATGCATTAAG TACTGGGTCA ATATTAAATT GTGATTTATT 135 60 

TCACATTTTA TTTTAATTTT TACACCTTTT TAATTTGTAT mCGATTACAT CTTAGATGTC 13620 

TTTAGTCTTC GTACTTCGCC AG T GAT TAT T TACACTTTCA CATTTTTATT ATCATGTTTA 13680 

CTTTTTTCTA GGAAAACAAC AATGTTTTTT GAATTAGTCA AATAAATGCG CTCAATCGTC 13740 

GGTGTGCAAA CAGACAATTG TACACAATGC T T AT T GAT AA GTATTTAAAA AATTAAAAAT 13800 

GTCATACAAT TATCAAATTT GCCATTTTAT TTATATTTTC TCAAACCAAT TAATTGAATA 138 60 

TCGAAATTTT TAGTAGAA.TA ATCAAAATAT AC AG AT T AAA GGAGGAGTAT CATGCTTACA 13920 

GAACAAGAGA AAGACATTAT CAAACAAACG GTGCCTTTAC TTAAAGAGAA AGGGACAGAA 13980 

ATTACGTCAA TCTTTTATCC AAAAATGTTT AAAGCGCATC CTGAACTTTT AAACATGTTT 14 040 

AATCAAACGA ACCAAAAACG AGGCATGCAA TCTTCAGCAT TfiCACAAGC TGTAATGGCC 14100 

GCAGCGGTTA AT AT C G AT AA CTTAAGTGTT ATTAAACCAG TCATTATGCC AGTCGCATAT 14160 

AAACACTGCG CACTACAAGT TTATGCTGAA CATTATCCAA TTGTGGGGAA AAATTTATTA 14220 

AAAGCCATTC AAGACGTGAC AGGATTAGAA GAAAATGACC CTGTCATTCA AGCTTGGGCA 14280 

AAAGCATATG GCGTTATTGC GGATGTGTTC ATCCAAATTG AAAAAGAAAT TTA 14 333 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GGTATTTTnG GAnGGGTACC TAAAGCAATT CCGGCAAAGG GTnAATCCAG GTACCGAAAT 60 

GGACTTCCCG TTATCGATAA TACCGACATA TATTGTGACA AGTAGATTTT ATGGACATTT 12 0 

AGGCTTACTT TTACTTGTGA TAATTGCATG TATGTTTACT GGTATTTAtC CaTCaATACA 18 0 

TATCATTCAA TTATTGATAT ATGTACCGTT TTGTTTTTTC TTAACTGCCt CGGTGACGTT 24 0 

ATTAACATCA ACACTCGGTG TGTTAGTTAG AGATACACAA ATGTTAATGC AAGCAATATT 300 

AAGAAT AT T A TTTTACTTTT CACCAATTTT GTGGCTACCA AAGAACETG GTATCAGTGG 3 60 

TTTAATTCAT GAAATGATGA AATATAATCC AGTTTACTTT ATTGCTGAAT CATACCGTGC 4 20 

AGCAATTTTA TATCACGAAT GGTATTTCAT GGATCATTGG AAATTAATGT TATACAATTT 4 80 

CGGTATTGTT GCCATTTTCT TTGCAATTGG TGCGTACTTA CACATGAAAT AT AG AG AT C A 54 0 

ATTTGCAGAC TTCTTGTAAT ATATTTATAT GACGAAACCC CGCTAACCAT TAATAAATGG 600 

AAGTGGGGTT CATTTTTGTT TATAATTTAA GTAAATAACA TATTAAGTTG GTGTATTATG 660 

AACGTTTTAA TAAAGAAATT TTATCATTTG GTAGTTCGAA TACTTTCTAA AATGATTACG 720 

CCTCAAGTGA TTGATAAACC GCATATCGTA T T TAT GAT G A CTTTTCCAGA AGATATTAAG 7 80 

CCTATCATCA AAGCAT T AAA TAATTCGTCG TAT C AGAAAA CTGTTTTAAC AACACCAAAA 84 0 

CAAGCGCCTT ATTTATCTGA ACTTAGCGAC GATGTTGATG TGATAGAAAT GAC T AATCGA 900 

ACATTGGTAA AACAAATTAA GGCTTTGAAA AGCGCGCAGA T GAT TAT TAT CGASATTAT 9 60 

TACCTATTGC TAGGTGGATA TAATAAGACT TCTAATCAAC ACATTGTTCA AACGTGGCAT 1020 

GCAAGTGGTG CATTAAAAAA CTTTGGCTTA ACAGATCATC AAGTCGATGT GTCTGACAAG 1080 

GCAATGGTTC AGCAGTACCG TAAAGTTTAT CAAGCGACGG ATTTTTACTT AGTGGGTTGT 114 0 

GAACAAATGT CACAATGTTT TAAACAGTCT TTAGGTGCAA CAGAAGAGCA AATGCTGTAT 120 0 

TTTGGGCTTC CGAGAATTAA TAAATATTAC ACAGCTGATA GAGCAACGGT TAAGGCAGAG 12 60 

T TAAAGGAT A AATATGGAAT TACAAATAAG TTGGTATTAT ATGTACCAAC ATATAGAGAA 1320 

GATAAAGCAG ATAATAGGGC TAT T GAT AAA GCTTATTTG AAAAATGTTT ACCAGGATAT 1380 

ACACTGATTA ATAAATTACA TCCATCAATT G AAGAT T C AG AC AT T GAT G A CGTATCTTCA 14 4 0 

ATCGACACGT CTACATTAAT GCTAATGTCA GATATAATTA TTAGCGACTA TAGTTCGCTG 1500 

CCAATAGAAG CTAGCTTGTT AGATATTCCA ACTATATTTT ATGTGTATGA TGAAGGAACA 1560 

TATGATCAGG TGAGAGGCCT GAATCAATTT TACAAAGCAA TACCGGATAG CTACAAAGTG 1620 

TAT AC T GAAG AAGATTTAAT AAT GAC GAT A CAAGAAAAAG AACATCTATT AAGTCCGTTA 168 0 

TTTAAAGATT GGCATAAGTA TAATACTGAT AAAAGTTTAC ATCAGCTCAC AGAATATATA 17 4 0 

GATAAGATGG TGACAAAATG AGGTTTACGA TAATCATACC TACATGTAAT AATGAGGCAA 1800 



229 

( opied horn 1080fi '()(> on 08/ 1/ /<<0 I 



CAATTCGACA ATTGTTAATA TCTATTGAGA GTAAAGAACA CTATAGAATC CTTTGTATTG 18 60 

ATGGTGGTTC T AC T GAT C AA ACAATTCCTA TGATTGAACG GTTACAAAGA GAACTCAAGC 192 0 

ATATTTCATT AATACAATTA CAAAATGCTT CGATAGCTAC GTGATTAAT AAAGGTTTGA 1980 

TGGATATCAA AATGACAGAT C C AC AT GAT A GTGACGCATT TATGGTCATA AAACCAACAT 2 04 0 

CAATCGTATT GCCAGGTAAA TTAGATAGGT TAACTGCTGC TTTCAAAAAT AATGATAATA 2100 

TTGATATGGT AATAGGGCAG CGAGCTTACA ATTACCATGG TGAATGGAAA TTGAAAAGTG 2160 

CTGATGAGTT TAT T AAAG AC AATCGAATCG TTACATTAAC GGAACAACCA GATTTGTTAT 2220 

CAATGATGTC TTTTGACGGA AAGTTATTCA GTGCTAAATT TGCTGAATTA CAGTGTGaCG 228 0 

AAACTTTAGC TAACaCATAC AATCACGCAA TACTTGTCAA GGCGATGCAA AAAGCTACGG 2 34 0 

ATATACATTT AGTTTCACAG ATGATTGTCG G AG AT AAC G A TATAGATACA CATGCTACAA 2400 

GTAACGATGA AGATTTTAAT AG AT AT AT C A CAGAAATTAT GAAAATAAGA C AAC GAG T C A 2 4 60 

TGGAAATGTT ACTATTACCT GAACAAAGGC TATTATATAG TGATATGGTT GATCGTATTT 2 52 0 

TATTCAATAA TTCATTAAAA TAT TAT AT GA ACGAACACCC AGCAGTAACGCACACGACAA 2580 

TTCAACTCGT AAAAG AC TAT ATTATGTCTA TGCAGCATTC TGATTATGTA TCGCAAAACA 2 64 0 

TGTTTGACAT TATAAATACA GTTGAATTTA TTGGTGAGAA TTGGGATAGA GAAATATACG 27 00 

AATTGTGGCG ACAAACATTA ATTCAAGTGG GCATTAATAG GCCGACTTAT AAAAAATTCT 27 60 

T GAT AC AAC T TAAAGGGAGA AAGTTTGCAC ATCGAACAAA ATCAATGTTA AAACGATAAC 2820 

GTGTACATTG ATGACCATAA ACTGCAATCC TATGATGTGA CAATATGAGG AGGATAACTT 2880 

AATGAAACGT GTAATAACAT ATGGCACATA TGACTTACTT CACTATGGTC ATATCGAATT 2 94 0 

GCTTCGTCGT GCAAGAGAGA TGGGCGATTA TTAATAGTA GCATTATCAA CAGATGAATT 3000 

TAATCAAATT AAACATAAAA AATCTTATTA TGATTATGAA CAACGAAAAA TGATGCTTGa 3060 

ATCAATACGC TATGTCGATT TAGTCATTCC AG AAAAG G G C TGGGGACAAA AAGAAGACGA 312 0 

TGTCGAAAAA TTTGATGTAG ATGTTTTTGT TATGGGACAT GACTGGGAAG GTGAATTGA 3180 

CTTCTTAAAG GATAAATGTG AAG T CAT T T A TTTAAAACGT ACAGAAGGCA TTTCGACGAC 324 0 

TAAAATCAAA CAAGAATTAT ATGGTAAAGA TGCTAAATAA AT T ATATAGA ACTATCGATA 3300 

CTAAACGATA AATTAACTTA GGTTATTATA AAATAAATAT AAAAC G G AC A AGTTTCGCAG 3360 

CTTTATAATG TGCAACTTGT CCGTTTTTAG TATGTTTTAT TTTCTTTTTC TAAATAAACG 34 20 

ATTGATTATC AT AT GAACAA TAAGTGCTAA TCCAGCGACA AGGCATGTAC CACCAATGAT 34 8 0 

AGTGAATAAT GGATGTTCTT CCCACATACT TTTAGCAACA GTATTTGCCT TTTGAATAAT 354 0 

TGGCTGATGA AC T T C T AC AG TTGGAGGTCC ATAATCTTTATTAATAAATT CTCTTGGATA 3 600 



230 

( opied h orn 1< 80(ii>0(i on 08/ 1/ /"> ■ ; 



GTCCGCGTGT ACTTTACCAT CTTCGACTAC AAGTTTATAA TCTTTTTTAC TAAAATCACT 3660 

TGGTAAAACA TCGTAAAGAT CATTTTCAAC ATAATATTTC TTACCATTTA TCCTTTGCTC 37 2 0 

ACCTTTAGAC AATATTTTTA CATATTTATA CTGATCAAAT GAGCGTTCCA TTAATGCATT 733 0 

CCCCATCATA TTACGTTGCT TCTCGCCACC AAGGTTTTTA TAGTCTCCTG CACCCATGAT 38 4 0 

AACTTGATTA ATTCTAAATT TACCTCGTTT GGTAGTAATC GTATGGTTGT AATTTGCTGT 3900 

ATCACTTGAT CCAGTTTTTA AACCATCTGT ACCCGGCAAA CTCATTTTTG CACCTTCCAA 3960 

TGAAAAGTTG AATGTGTAAT ACGTAACTGC ATGCGTTGTT GGTGCTAACT GCTTTGTAAA 4 020 

GTCTAATATT TTAGGTGTCT CTTTAATCAC GTGTAAATCT AAAATGGCAT AGTCTCTAGC 4 080 

AGTCGTTACA GTACGTTCTT GGTCTTTATA CTTTGTTGGT GCAAAT GTAC GTAATCTTGA 414 0 

ATTTTCAGCA CCCGTTGGAT TGACGAAATG TGTATTTTTC ATTCCGXAG CTTTAGCTTT 4200 

GTTATTCATT AAATCAACGA AATCGCTGGT GTTTTTTGAA ACCTTCTTAG CTAAAATTAA 4 2 60 

TGCCGCGGCA TTACTAGAAT TAG AT AC T G T AATTTGTAAT AGGTCTGCGA TTGTCCATAC 4 320 

TTGTCCAGGA TATAGTTTCG T AT T AC T C AA CTCAGGTAGT GTAGACATAA TATATTCTTT 4 380 

GTTCGTCATT GTGACTGTGT CATCAAGTGA AAGCTGCCCC TTATTTACAG CTTCCAATGT 4 440 

TAAGTACATT GTCATTAATT TAG T CAT AG A CGCTGGAtTC CACTTAGTAT CGATATTGTA 4 500 

T T GAT AC AG T AATTGTCCAG TTTGACTTAC AT T AACAGCA CTCGTCGGTT CGTATGCAGC 4 5 60 

CGACAAACCT GCATAACCAT ATTGATTTCE TGCTTGTACA GGGGTTACGT CACTGTTAGT 4 62 0 

AGCTTGTGCA TATGGTGTCA TAATACTTAA TGTTAAACAT AAAATGATGA TAATAGATAT 4 680 

TAAATTTTTC ATAAAGCGTT AATCTTCCCT TTTCCAATTC TTAAATATTC CCTAAAAGCA 4740 

ATGGTTATTC CTACTTACGG AAATCATTGC TAATTCACTT CACCTTAATT AAATGTTGA 4 800 

AAATAAAGTT TTCTGCAGTT AATTTGAAAA ATAATGCAAA TATATTACGT GTGTAGCTAA 4 8 60 

AGGTGTTATA ATGTTTGTAC GAAGAGCAAA CTTACTCAAA AGCGATTAAT TTTCATGTTT 4 920 

TAATATAAAG AC T T T G AG AA GTTATTACAA AAAATGCAAT AGAAATATTC TATCATATAA 4 980 

ATGTTATGAG CGGTATTTTG GGGCAACACT TTATTTGATT TTTAAAGTTT TGTTGGGAGA 504 0 

AAGTATATGA TAGAAATGCA TGTATCTATC TAAATGAATT AAC TAT AAA T T T C AAA C AG A 5100 

AGAGGTAAAA CTATGAAACG AGAAAATCCA TTGTTTTTCT TATTTAAAAA ACTATCATGG 5160 

CCAGTGGGTC TTATCGTTGC AGCTATCACT ATTTCffCAC TAGGGAGCTT AAGTGGACTA 5220 

TTAGTGCCAC TGTTTACTGG ACGAATTGTA GATAAATTTT CCgTGAGCCA TATCAATTGG 52 8 0 

AATCt AATCG CATTATTTGG TGGTATCTTT GTCATCAATG CTTTATTAAG CGGATTAGGT 534 0 

TTATATTTAT TAAGTAAAAT TGGTGAAAAG ATTATTTATG CGATACGCTC AGTTTTATGG 5 4 00 

GAGCATATCA T AC AAT T AAA AATGCCATTC TTTGACAAAA ATGAAAGTGG TCAATTAATG 54 60 
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AGTCGATTAA CTGACGATAC GAAAGTGATA AATGAATTTA TTTCACAAAA GCTACCTmAC 5520 

TTATTACCAT CAATCGTTAC ATtAGTTGGG TCACTAATCA TGTTATTTAT TTTAGATTGG 55 8 0 

AAAAT G AC AT TATTAACATT TATAACGATA CCGATATTCG TTTTaATTAT GATTCCTCTA 5 64 0 

GGTCGTATTA TGCAAAAGAT ATCGACAAGT ACACAATCTG AAATTGCAAA CTTCAGTGGT 57 00 

TTGTTAGGGC GTGTCCTAAC TGAAATGCGT CTTGTTAAAA TATCAAATAC AGAGCGTCTT 57 60 

GAATTAGATA ATGCACATAA AAATTTGAAT GAAATATATA AATAGGTTT AAAACAGGCT 5 820 

AAAATTGCGG CAGTTGTACA ACCAATTTCA GGTATAGTTA TGTTGCTAAC AATTGCAATT 58 8 0 

ATTTTAGGTT TTGGTGCATT AGAAATTGCG ACTGGTGCAA TCACTGCAGG TACATTAATT 5 94 0 

GCAATGATAT TTTATGTTAT TCAGTTATCT ATGCCTTTAA TCAATCTTTC CACGTTAGTT 6000 

ACAGATTATA AAAAGGCAGT CGGTGCAAGT AGTAGAATAT AC G AAAT CAT GCAAGAACCT 60 60 

ATTGAACCGA CAGAAGCTCT TGAAGATTCT G AAAAT G T AT TAATTGATGA CGGTGTATTG 612 0 

TCATTTGAAC ATGTAGACTT TAAATATGAT GTGAAGAAAA TATTAGATGA TGTGTCGTTC 6180 

CAAATCCCAC AAGGTCAAGT GAGTGCTTTT GTAGGCCCTT CTGGGTCTGG TAAAAGTACG 624 0 

ATATTTAATC TGATAGAACG TATGTATGAA ATTGAGTCAG GTGATATTAA ATATGGCCTT 6300 

GAAAGTGTCT ATGATATCCC GTTATCTAAG TGGCGACGCA AAATTGGATA TGTTATGCAA 63 60 

TCAAATTCGA TGATGAGTGG T AC AAT TAG A GACAATATTT TATACGGAATTAATCGTCAT 6420 

GTTTCAGATG AAGAACTTAT TAATTATGCT AAAT T AGCGA ACTGTCATGA TTTTATCATG 64 8 0 

CAATTTGATG AAGGATATGA CACGCTTGTA GGTGAACGAG GATTGAAACT GTCTGGCGGA 654 0 

CAACGTCAAC GTATTGATAT TGCTAGAAGT TTTGTTAAAA ATCCTGATAT TTTGTTACTT 6600 

GATGAAGCAA CAGCTAATCT CGATAGTGAA AG T G AAT T GA AAATTCAAGA AGCTTTAGAA 6 6 60 

ACATTGATGG AAGGTAGAAC AACGATTGTC ATTGCGCATC GTTTGTCTAC AATTAAAAAA 67 2 0 

GCCGGTCAAA TTATATTCTT AGACAAAGGA CAGGTAACAG GTAAAGGTAC GCATTCAGAA 67 80 

CTGATGGCAT CACATGCGAa GTATAAAAAC TTTGTAGTGT CTCAAAAATT AAC AG AT T AA 684 0 

TTTTATATAT ATAAGTAAGC TTGGAGCAAA TACACATATA CCATCGAGGA AATTAAAGTG 6900 

TGGCACATTG ATGGATATAG ATGTTAATAA ATTGCTTCAA GCTTTTGTCT ATTTTAAATC 6960 

ATTTGAGAAG T T AC G AC AT A ATAATTCTTA AAT T AAT G AA ATCGATATTT TAAGAAASA 7 02 0 

ATGCTCATGG TATAATACAA GTTATAAGCA AACATACATA TAT T AAAT AC TGTAGCCACG 7 080 

AGTCATAATT CTTCATATTT TACATAGCAA TTTAACTGAT TTTAGAGTCC ACGGTACAGA 714 0 

AGTTTGATAT TTCAATGTTT CTAAATTTTT AAAAAATTAA ATCATAGGTG GGTGCCAAAT 7200 

GTTTTTATTA ATCAACATTA TTGGTCTAAT TGTATTTCTT GGTATTGCGG TATTATTTTC 7 2 60 
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AAGAGATCGC AAAAATATCC AATGGCAATC AATTGGGATC TTAGTTGTTT TAAACCTGTT 7320 

TTTAGCATGG TTCTTTATTT ATTTTGATTG GGGTCAAAAA GCAGTAAGAG GAGCAGCCAA 7 38 0 

TGGTATCGCT TGGGTAGTTC AGTCAGCGCA TGCTGGTACAGGTTTTGCAT TTGCAAGTTT 74 4 0 

GACAAATGTT AAAATGATGG ATATGGCTGT TGCAGCCTTA TTCCCAATAT TATTAATAGT 7 500 

GCCATTATTT GATATCTTAA TGTACTTTAA TATTTTACCG AAAATTATTG GAGGTATTGG 7560 

TTGGTTACTA GCTAAAGTAA CAAGACAACC TAAATTCGAG TCATTCTTTG GGATAGAAAT 672 0 

GATGTTCTTA GGAAATACTG AAGCATTAGC CGTATCAAGT GAGCAACTAA AACGTATGAA 7 68 0 

TGAAATGCGT GTATTAACAA TCGCAATGAT GTCAAT GAGC TCTGTATCGG GAGCTATTGT 774 0 

AGGTGCGTAT GTACAAATGG TACCAGGAGA ACTGGTACTA ACGGCAATTC CACTAAATAT 7 8 00 

CGTTAACGCG ATTATTGTGT OVTGCTTGTT GAATCCAGTA AGTGTTGAAG AGAAAGAAGA 7 8 60 

TAT TAT TT AC AGTCTTAAAA ACAATGAAGT TGAACGTCAA CCATTCTTCT CATTCCTTGG 7 920 

AGATTCTGTA TTAGCAGCAG GTAAATTAGT ATTAATCATC ATCGCATTTG TTATTAGTTT 7 98 0 

TGTAGCGTTA GCTGATCTAT TTGATCGTTT TATCAATTTG ATTACAGAT TGATAGCAGG 804 0 

ATGGATAGGC AT AAAAGG T A GTTTCGGTTT AAACCAAATT TTAGGTGTGT TTATGTATCC 8100 

ATTTGCGCTA TTACTCGGTT TACCTTATGA TGAAGCGTGG TTGGTAGCAC AACAAATGGC 8160 

TAAGAAAATT GTTACAAATG AATTTGTTGT TATGGGTGAA ATTTCTAAAG ATATTGCATC 8 22 0 

TTATACACCA CACCATCGTG CGGTTATTAC AACATTCTTA ATTTCATTTG CAAACTTCTC 8280 

AACGATTGGT ATGATTATCG GTACATTGAA AGGCATTGTT GATAAAAAGA CATCAGACTT 8 34 0 

TGTATCTAAA TATGTACCTA TGATGCTATT ATCAGGTATC CTAGTTTCAT TATTAACAGC 8 4 00 

AGCTTTCGTT GGTTTATTTG CATGGTAATA TGTCGAAGAG T G AC TAT GAT AATACATTTT 8 4 60 

AACTAATAAA TATGTCCAGG CATGTCGTCT ATTGATATAG GTGAGATGCT TGGACTTTTT 8 52 0 

TATTATTGAT ATAAAGGTAT nTAAATATTT TTAAAGTTAC CGAAATTGAA GCATTATAAA 8 580 

AACCAAGTGC ACATGGTAAT ACACTTGGCT TTTATGGGAA ATGAATATTA TTGSICATAT 8 64 0 

GACAGTAAGG ACTAGGTACA GTCATAGTAC TTCGAGCAAA ATTTGTTTTG T TAT TAT AAA 87 00 

CAACACAAAG G AG AT AAC T T CTCTAnTGAA GAAGTTAAAA ACATTATAGC AGACAATGAA 87 60 

ATGAAAGTAA ATTAAAAAT 87 7 9 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31096 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

GTTGCAGTAG TCAAAGAATT AAACAAGGTG AAGGcGTGTA GCTTGCACAC CCGAAAATGT 60 

GCGTAAGTTA aCGGATGCAG GACATAAAGT AATTGTTGAA AAAAATGCTG GCATTGGTTC 12 0 

AGGATTTTCT AACGATATGT AT G AAAAAG A AGGCGCTAAG ATCGTAACTC ACGAACAAGC 18 0 

ATGGGAAGCT GATCTTGTTA TCAAAGTAAA AOACCTCAT GAAAGCGAAT ATCAATATTT 24 0 

CAAAAAGAAT CAAATTATCT GGGGATTTTT ACATCTAGCA TCTTCAAAAG AAATAGTAGA 300 

AAAAATGCAA GAAGTTGGTG TAACTGCGAT TAGTGGTGAA ACCATTATAA AAAATGGAAA 3 60 

AGCAGAATTA TTAGCGCCAA TGAGTGCTAT AGCAGGTCAA CGCTCAGCAA TTATGGQ3C 4 20 

TTACTACTCT GAAGCACAAC ATGGTGGTCA AGGTACTTTA GTGACTGGTG TACATGAAAA 4 80 

TGTGGATATA CCTGGTAGTA CAT AT G T GAT TTTCGGTGGT GGAGTAGCAG CAACAAATGC 54 0 

AGCAAATGTT GCCTTGGGAC TAAATGCTAA AGTAATCATT AT C GAG T T AA ACGATGACCG 600 

CATTAAATAT CTTGAAGATA TGTATGCAGA AAAAGATGTC AC AG TAG T C A AATCAACACC 660 

AGAAAAT T T A G C AG AAC AAA TTAAGAAAGC AGATGTATTT ATTTCTACAA TTTTAATTTC 7 20 

AGGTGCGAAA CCGCCAAAAT TGGTTACTCG TGAGATGGTT AAATCAATGA AAAAAGGTTC 7 80 

AGTATTAATC GATATAGCTA TTGACCAAGG TGGAACTATT GAAACAATTA GACCAACTAC 840 

AATTTCTGAT CCAGTGTATG AAGAAGAAGG TGTGATTCAT TATGGTGTAC CAAATCAACC 900 

AGGAGCAGTC CCAAGAACTT CAACAATGGC AT T AGCACAA GGAAAT AT T G ATTATATATT 9 60 

AGAAATTTGT GACAAAGGCT TAGAACAAGC AATTAAAGAT AATGAAGCCT TAAGTACTGG 02 0 

TGTAAACATT TACCAAGGAC AAGTGACAAA TCAAGGATTA GCTTCATCAC ATGACCTAGA 1080 

TTATAAAGAA ATATTAAATG TTATCGAATA GATAGTAATT TAAATGAAAT TGAGTGAAAT 114 0 

GAATATTTTA AATATAGCAT TATAGTTTGG AC T AAAAAT T TACAAAACGG AAGGATGTAA 12 00 

TTATTATGAC AACCAACACA GTTACATTAC AAACAGCACA TATTGTAAGT TTAGGAGATA 12 60 

TCGAAGAAGC TAAAGCAAGC ATTAAACCAT TTATTCGTCG AACACCTCTA ATTAAATCAA 132 0 

TGTATTTAAG CCAAAGTATA ACTAAAGGGA ATGTATTTCT AAAAT T AG AA AATATGCAAT 138 0 

TCACAGGATC TTTTAAATTT AGAGGCGCTA gCAATnAAAA TTAATCETT AACAGATGAA 14 4 0 

CAAAAAGAAA AAGGCATTAT CGCAGCATCT GCTGGGgAAC CAT GC AC AAG GTGTTGCTTT 1500 

AACAGC T AAA TTATTAGGCA TTGATGCAAC GATTGTAATG CCTGAAACAG CACCACAAGC 15 60 

GAAACAACAA GCAACAAAAG GCTATGGGGC AAAGGTTATT TTAAAAGGTA AAAACTTTAA 162 0 

C G AAAC TAG A CTTTATATGG AAGAATTAGC GAAAGAAAAT GGCATGACAA TCGTTCATCC 168 0 

ATATGACGAT AAGTTTGTAA TGGCAGGCCA AGGAACAATT GGTTTAGAAA TTTTAGATGA 17 4 0 
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TATTTGGAAT GTGAATACAG TCATCGTACC AGTTGGCGGT G GAG GAT T AA TTGCAGGTAT 18 00 

TGCCACCGCA TTAAAATCAT TTAACCCTX: AATTCATATT ATCGGTGTTC AATCTGAGAA 1860 

TGTTCATGGT ATGGCTGAGT CTTTCTATAA GAGAGATTTA ACTGAACATC GAGTGGATAG 192 0 

CACAATAGCA GATGGTTGTG ATGTAAAAGT TCCTGGTGAA CAAACATATG AAGTAGTTAA 1980 

ACATTTAGTA GATGAATTTA TTCTTGTTAC TGAAGAAGAA ATTGAACATG CTAOAAAGA 204 0 

TTTAATGCAG CGTGCCAAAA TTATTACTGA AGGTGCAGGC GC AT T AC C AA CAGCTGCAAT 2100 

TTTAAGTGGA AAAATAAACA ATAAATGGCT TGAAGATAAA AATGTTGTTG CATTAGTTTC 2160 

AGGCGGGAAT GTTGACTTAA CTAGAGTTTC AGGTGTCATT GAACATGGAC TGAATATTGC 2220 

AGATACAAGC AAGGGTGTGG TAGGTTAAAA CATTTAATCT TAAAAATGAG GTGTAATTAT 2280 

GTCAAATGGT AAAGAATTAC AAAAAAATAT AGGTTTCTTC TCAGCGTTTG CTATTGTTAT 2 34 0 

GGGGACAGTT ATTGGTTCAG GAGTATTCTT TAAAATATCA AACGTAACAG AAGTAACAGG 2 4 00 

AACAGCAGGA ATGGCCTTGT TTGTATGGTT CCTAGGGGC ATCATTACCA TTTGTGCGGG 24 60 

GTTAACAGCA GCAGAACTTG CTGCTGCAAT CCCTGAAACA GGTGGCTTAA CGAAGTATAT 2520 

AGAATATACA TACGGTGATT TCTGGGGCTT CCTATCAGGT TGGGCGCAAT CATTTATTTA 2580 

TTTTCCAGCT AACGTAGCAG CATTGTCTAT CGTATTTGCG ACACAGCTAA TTAATTTATT 2 64 0 

CCATTTATCT ATAGGTTCGT TAATACCAAT AGCAATCGCA TCTGCGTTAT CTATTGTGTT 2700 

GATAAATTTC CTAGGTTCAA AAGCAGGCGG AATTTTACAA TCAGTTACTT TAG T AAT T AA 27 60 

ACTGATTCCA ATCATCGTTA TTGTAATTTT TGGTATTTTT CAATCTGGAG ATATCACTTT 2820 

TTCATTAATT CCAACTACAG GTAATTCaGG AAATGGCTTC TTTACAGCAA TTGGTAGTGG 28 80 

TTTATTAGCA ACTATGTTTG CATATGATGG TTGGATTCAT GTAGGAAATG TTGCGGGGGA 2 94 0 

ACT T AAAAAT CCTAAACGCG ATTTACCTTT AGCGATTTCA GTTGGTATCG GTTGTATTAT 3000 

GGCTGTGTAT TTATTAATTA ACGCTACATT CTTATTAACG TTGCAATAG AACTACTTGC 30 60 

TGGTAATTTA AATGCAGCTT CAGATACATC AAAAATATTA TTTGGTGAAA ATGGCGGTAA 312 0 

GATTATTACA ATCGGTATAT TAATTTCTGT TTATGGTACG ATCAATGGCT AT AC TAT G AC • 3180 

TGGTATGCGC GTACCATATG CAATGGCTGA AAGAAAAT T A TTGCCATTTA GCCATTTATT 324 0 

CGCAAAATTA ACAAAATCTG GCGCACCATG GTTTGGCGCA ATTATACAAC TTATAATCGC 3300 

TATCATCATG ATGTCAATGG GAGCATTTGA TACAATTACA AATATGTTAA TCTTTGTTAT 33 60 

TTGGTTGTTC TATTGTATGT CATTTGTTGC GGTAATAATT TTAAGAAAAC GTGAACCAAA 3420 

TATGGAACGA C C AT AT AAAG TACCGTTATA TCCGATCATA CCTTTAATTG CTATTTTGGC 34 8 0 

AGGATCATTT GTATTAATTA ATACACTGTT TACACAATTT ATATTAGCAA TCATTGGAAT 354 0 

TCTAATAACA GCACTTGGTA TACCAGTTTA TTACTATAAA AAGAAACAAA AAGCAGCATA 3 600 
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AGGTAAGATA ACTAGCATTG AGAATAAATG GATGGACTAC T AAT AAAT T TAAAG T T T T AC 3 6 60 

AC AT T AAAAT CAAAAACCAT TCAATTATTC TATGGAACAG ACAAATTTCT GTTATGGAAT 3720 

TTGTCTGTTT TTCAAAAGTA TAGGGAGGCA AAT AGAG AT G GAAAAGCCGT CAAGAGAGGC 37 80 

ATTTGAAGGC AATAATAAGT TGTTAATAGG AATTGTTCTA AG TG T AAT AA CGTTTTGGCT 3840 

ATTTGCACAA TCATTGGTTA ATGTTGTACC AAT AC T T G AA GATAGTTTCA ATACAGATAT 3 900 

TGGAACGGTT AATATCGCCG TTAGTATAAC TGCTTTATTT TCAGGAATGT TTGTAGTAGG 3960 

AGCAGGTGGT CTTGCTGATA AATATGGCAG AATTAAACTC ACGAACATTG GTATTATCTT 4 020 

AAAT AT AT T A GGTTCATTAT T AAT CAT TAT TX AAAT ATT CCTTTATTAC TTATTATAGG 4 080 

AAGATTAATT CAAGGACTTT CAGCAGCATG TATTATGCCT GCAACTTTGT CTATTATTAA 414 0 

GTCATATTAC ATTGGGAAAG AT AG AC AAC G CGCTTTAAGT TATTGGTCAA TTGGCTCATG 4200 

GGGCGGCTCT GGTGTTTGTT CATTTTTTGG AGGTGCAGTT GCAACGCTTT TAGGTTGGG 4 260 

TTGGATTTTC ATCCTATCAA TTATAATTTC ATTAATTGCA CTGTTTCTTA TTAAAGGCAC 4 320 

ACCTGAAACT AAATCTAAAT CGATTTCTCT AAATAAATTT G AC AT TAAAG GTCTGGTTCT 4380 

TTTAGTCATT ATGCTCCTCA GTTTAAATAT TTTAATTACT AAAGGATCAG AATTAGGTGT 4440 

AACCTCACTT CTTTTTATTA CTTTATTAGC TATTGCAATT GGATCTTTTA GTTTATTTAT 4500 

AGTTCTTGAA AAGCGTGCTA CAAATCCTTT AATCGATTTT AAAT TAT T T A AAAAT AAAGC 4 5 60 

TTACACAGGT GCAACAGCTT CAAACTTTTT GTTAAATGGT GTTGCAGGAA CAT T AAT AG T 4 620 

AG C C AAC AC A TTTGTTCAAA GAGGTTTAGG ATATTCTTCATTGCAAGCAG GAAGTTTATC 4 680 

AATCACTTAT TTAGTAATGG TACTAATTAT GATTCGTGTT GGTGAAAAGT TACTTCAAAC 4740 

ACTCGGATGC AAGAAACCAA TGTTAATTGG AAC AG G A G T T CTTATTGTCG GAGAATGTCT 4 8 00 

CATTTCATTA ACTTTCTTGC CAGAAATATT CTATGTCATT TGTTGTATTA TAGGTTATTT 8160 

ATTCTTTGGT TTAGGACTAG GGATATATGC TACAC CATCA ACAGATACAG CAATTGCAAA 4 92 0 

TGCACCGTTA GAAAAAGTAG GCGTTGCTGC AGGTATCTAT AAAATGGCTT CTGCATTAGG 4 980 

TGGAGCATTT GGCGTCGCAT TGAGTGGTGC AGTATATGCA ATCGTATCAA ATATGaCAAA 504 0 

CATTTATACA GGTGcAATGa TTGnCATTAT GGTTaAATGC AGGTATGGGa ATATTATCaT 5100 

TCGTTATCAT TTTGtTACTT GTGcCTAAAC mAAACGACAC T C AAT TAT G A TAATTGAGAA 5160 

TTAAATTGAA ATCATACAAG TCGCTACAAT ATTAAACAAA AATATAAACC GATTCTTATG 5220 

TGTCATTATT TTAAATGAAC ATAGGGATTG GTTTTTTATT ACTCTTTAC GCTACTTTAT 5280 

T TAT AAT TAT TATAAATTGT C AC AAAT T C A ATTTACCTTA C AAT AT ATT T TGTGTTATTA 534 0 

TATTCTGGAG CAT AAAT AAA TTGTTCAACA CATAGTTGTA ATGTGTTTCA ATACTTTTTG 54 00 
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GATAGATTGC GAAATTGTAT TGAATCGTCA TCGTTTTAAA TTTTTAAATG AGAATGGAAT 54 60 

GAGCATTACA ATACACAAGC AATCAAAAGT AAATACATTC ACAACACAAC AGAGACATAA 552 0 

CAACAAGATA AGGAGT GAAC AATAGCTGTG AATTATCGTG AT AAAAT T C A AAAGTTTAGT 5580 

ATTCGTAAAT ATACAGTTGG TACATTTTCA ACTGTCATTG CGACATTGGT ATTTTTAGGA 564 0 

TTCAATACAT CACAAGCACA TGCTGCTGffi ACAAATCAAC CAGCAAGCGT GGTTAAACAG 57 00 

AAACAACAAA GTAATAATGA ACAGACTGAG AAT CGAGAAT CTCAAGTACA AAATTCTCAA 57 60 

AATTCACAAA ATGGTCAATC ATTATCTGCT ACTCATGAAA ATGAGCAACC AAATATTAGT 58 2 0 

CAAGCTAATT TAGTAGATCA AAAAGTAGCG CAATCATCTA CTACTAATGA TGASAACCA 5880 

GCATCTCAAA ATGTAAATAC AAAGAAAGAT TCGGCAACGG CTGCGACAAC ACAACCAGAT 594 0 

AAAGAACAAA GTAAGCATAA ACAAAACGAA AGTCAATCTG CTAATAAAAA TGGAAACGAC 6000 

AATAGAGCGG C T CAT G TAG A AAATCATGAA GCAAATGTAG TAACAGCTTC AG AT T CAT C T 60 60 

GATAATGGTA ACGTACAACA TGACCGAAAT GAATTACAAG CGTTTTTTGA TGCAAATTAT 6120 

CAT GAT TAT C GCTTTATTGA CCGTGAAAAT GCAGATTCTG GCACATTTAA CTATGTAAAA 6180 

GGCATTTTTG ATAAGATTAA TACGTTATTA GGCAGTAATG ATCCAATAAA CAATAAAGAC 62 4 0 

TTGCAACTTG CATACAAAGA ATTGGAACAA GCTGTTSCTT T AAT T C G T AC AATGCCTCAA 6300 

CGTCAACAGA CTAGCCGACG TTCAAATAGA ATTCAAACGC GTTCGGTTGA GTCAAGAGCT 63 60 

GCAGAGCCTA GATCAGTATC AGACTATCAA AATGCAAATT CATCATATTA TGTTGAAAAT 64 2 0 

GCTAATGATG GTTCGGGCTA TCCTGTTGGT ACATATATCa ATGCTTCTAG TAAAGGGGCG 64 8 0 

CCATATAATT TACCAACTAC AC CAT GG AAT ACATTGAAGG CCTCTGACTC AAAGGAAATT 65 4 0 

GCTCTTATGA CAGCGAAACA AACTGGAGAC GGGTACCAAT GGGTTATTAA GTTTAATAAA 6600 

GGACATGCTC C AC AT C AAAA TATGATCTTT TGGTTTGCAT TACCAGCAGA CCAAGTGCCA 6660 

GTAGGAAGAA CTGACTTTGT AACAGT TAAT TCAGATGGAA CAAATGTACA ATGGAGTCAT 67 2 0 

G GAG C AG GAG CAGGTGCAAA TAAACCACTT CAACAAATGT GGGAATATGG AGTAAATGAT 67 80 

CCTCATCGTT C AC AT G AC T T TAAAATAAGA AATAGAAGTG GCCAAGTAAT ATATGACTGG 68 4 0 

CCAACTGTCC ATATTTATTC TTTAGAAGAT T TAT C TAG AG CGfiTGATTA TTTTAGTGAA 6900 

GCTGGAGCGA CACCTGCTAC TAAAGCTTTT G G TAG AC AAA ATTTTGAATA TATTAATGGT 69 60 

CAAAAACCTG CTGAATCACC GGGTGTTCCT AAAGTTTATA CTTTCATCGG TCAAGGTGAT 7 020 

GCAAGTTATA CAATTTCATT TAAAACACAA GGTCCAACTG TTAATAAATT GTACTATGCA 70 80 

GCAGGTGGGC GTGCTTTAGA GTACAATCAA TTATTTATGT ACAGTCAACT ATACGTCGAA 7140 

TCAACGCAAG ACCATCAACA ACGTCTTAAT GGTTTAAGAC AAGTGGTTAA TCGTACATAT 7 2 00 

CGCATAGGTA CAAC T AAACG TGTAGAAGTG AGTCAAGGAA ATGTACAAAC G AAAAAGG T A 7 2 60 
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TTAGAAAGTA CAAACC T AAA TATAfATGAT TTTGTTGATG ATCCTTTAAG TTATGTTAAG 7 32 0 

ACGCCGAGTA ATAAAGTGTT AGGATTTTAT TCGAATAATG CAAATACTAA TGCTTTTAGA 7380 

CCGGGTGGAG CCCAACAATT AAATGAATAT CAATTAAGTC AATTATTTAC TGATCAAAAA 74 4 0 

TTACAAGAAG CAGCAAGAAC TAGAAACCCA ATAAGATTAA TGATTGGTTTCGACTATCCT 7 500 

GATGCTTATG GTAATAGTGA AcTTTAGTTC CTGTTAACTT AACGGTATTA CCTGAAATCC 7 5 60 

AACATAATAt TaAATTCTTT AAAAATGACG ATACTCAAAA TATTGCTGAA AAACCATTTT 7 620 

CAAAACAAGC TGGGCATCCA GTTTTCTATG TATATGCAGG TAACCAAGGG AATGCTTCCG 7 68 0 

TGAATTTAGG TGGTAGCGTA ACATCTATTC AACCAT T ACG TATTAATTTA ACAAGTAATG 77 4 0 

AGAATTTTAC AGATAAAGAT TGGCAAATTA CAGGTATTCC GCGTACATTA CACATTGAAA 7 8 00 

AC T C G AC AAA TAG AC C T AAT AATGCCAGAG AAC GCAAT AT TGAACTTGTT GGTAACTTAT 78 60 

TACCAGGGGA TTACTTTGGA ACGATACGTT TTSGACGTAA AGAACAATTA TTCGAAATTC 7 92 0 

GTGTTAAACC ACATACACCA AC AAT T AC AA CGACAGCTGA GCAATTAAGA GGTACAGCAT 7 98 0 

TACAAAAAGT GCCTGTTAAT ATTTCGGGAA TACCGTTGGA TCCATCGGCA TTGGTTTATT 8 04 0 

TAGTTGCACC AACAAATCAA ACTACGAATG GTGGTAGTGA GGCAGATCAA ATACCATTG 810 0 

GTTATACGAT ACTTGCGACT GGTACACCTG ATGGGGTGCA T AAT AC AAT T AC T AT ACGAC 8160 

CGCAAGATTA TGTTGTATTC ATACCACCTG TAGGTAAACA AAT T AGAGCA GTAGTTTATT 82 2 0 

ATAATAAAGT AGTTGCATCT AATATGAGTA ATGCTGTTAC TATTTTGCCA GAT G AC AT T C 82 8 0 

CACCAACAAT CAATAATCCT GTTGGAATAA ATGCCAAATA CTATCGAGGC GACGAAkCAA 834 0 

CTTTACAATG GGTGTCTCTG AT AG AC AT T C TGGTATAAAA AATACAACTA TTACGACATT 8 4 00 

GCCGAATGGT TGGACATCAA ATT T AAC AAA AGCGGATAAG AATAATGGCT CATTATCTAT 8 4 60 

TACAGGTAGA GTGAGTATGA ATCAGGCATT TAACAGTGAT ATTACATTTA AAGTGTCAGC 8 52 0 

G AC AG a C AAT GTCAATAATA CGACAAATGA TAGTCAATCT AAACATGTTT CAATTCATGT 8580 

AGGTAAAATT AGTGAAGATG CTCATCCGAT TGTATTAGGA AAT AC T GAGA AAGTTGTAGT 8 64 0 

AGTCAATCCG ACTGCTGTAT CTAATGATGA AAAGCAAAGC ATAATTACTG CCTTTATGAA 7300 

TAAAAACCAA AATATAAGAG GATATTTAGC ATCAACTGAT CCAGTAACTG TCGATAATAA 87 60 

TGGTAATGTC ACATTACATT ACCGTGATGG CTCATCGACA ACGCTTGATG CTACAAATGT 88 20 

GAT G AC AT AC GAACCAGTTG TGAAACCTGA ATACCAAACT GTCAATGCTG CTAAAACAGC 88 80 

AACGGTAACG ATTGCTAAAG GR.CAATCATT TAGTATTGGT GATATTAAAC AATATTTTAC 8 94 0 

TTTAAGTAAT GGACAACCTA TTCCAAGTGG CACATTTACA AATATTACAT C T GAT AG AAC 9000 

TATTCCAACT GCACAAGAAG TTAGTCAAAT GAACGCAGGC ACGCAGTTAT ACCATATAAC 90 60 
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TGCTACAAAT GCGTATCATA AAGATAGTGA AGACTTCTAT ATTAGTTGA AAATCATCGA 912 0 

TGTGAAACAA CCAGAAGGCG ATCAACGTGT ATATCGTACA TCAACATATG ATTTAACTAC 918 0 

TGATGAAATC TCAAAAGTAA AACAAGCATT TATTAATGCA AATAGAGATG TAATTACGCT 924 0 

TGCCGAAGGT GATATTTCAG T T AC AAAT AC ACCTAATGGT GCTAATGTAA G TACT ATT AC 9300 

AG TAAAT AT T AATAAAGGTC GATTAACGAA ATCATTCGCG T C AAAC C TAG CTAATATGAA 9360 

TTTCTTGCGT TGGGTTAATT TCCCACAAGA TTATACAGTG ACATGGACGA ATGCAAAAAT 9420 

TGCAAACAGA C C AAC AG AT G GTGGTTTATC ATGGTCTGAT GACCATAAAT CTTTAATTTA 94 8 0 

TCGTTATGAT GCTACATTAG GTACTCAAJT T AC G AC GAAT GATATTTTAA CAATGTTAAA 954 0 

AGCAACAACT ACAGTGCCTG GATTGCGAAA T AAC AT TACT GGTAATGAAA AATCACAAGC 9 60 0 

AGAAGCTGGC GGAAGACCTA ACTTTAGAAC GACTGGTTAT TCACAATCAA ATGCGACAAC 9 66 0 

TGATGGTCAA CGTCAATTTA CGTTGAATGG TCAAGTGATT CAAGTGTTAG ACACATCAA 97 2 0 

CCCTTCAAAC GGTTATGGTG GGCAACCTGT TACAAATTCA AATACTCGTG CAAACCATAG 97 8 0 

TAACTCAACT GTTGTTAACG TAAACGAACC GGCAGCTAAT GGTGcTGGCG CATTTACAAT 984 0 

TGACCACGTT GTAAAAAGTA ATTCTACACA TAATGCAAGT GATGCAGTTT ATAAAGCACA 9 900 

GTTATACTTA ACGCCATATG GTCCAAAACA ATATGTTGAA CAT T TAAAT C AAAATACAGG 9960 

AAA TACT ACT GACGCTATTA AC AT T T AT T T TGTACCAAGT G AC T TAG T G A ATCCAACAAT 10 020 

TTCAGTAGGT AATTACACTA ATCATCAAGT GTTCTCAGGT G AAAC AT T T A CAAATACTAT 10 080 

TACAGCGAAT GATAACTTTG GTGTGCAATC TGTAACTGTA CCAAATACAT C AC AAAT T AC 1014 0 

AGGTACTGTT GATAATAACC ATCAACATGT TTCTGCAACG GCACCAAATG TGACATCAGC 10200 

AACTAATAAG ACAATCAATT TATTAGCAAC T GAT AC AAGC GGCAATACAG C T AC AAC T T C 102 60 

GTTCAATGTA AC AG T G AAAC CTTTGCGTGA TAAATATCGA GTTGGTACTT CATCAACGGC 10320 

TGCTAATCCT GTGAGAATTG CCAATATTTC GAATAATGCG ACAGTATCAC AAGCTGATCA 10380 

AACGACAATT ATTAATTCGT TAACGTTTAC TGAAACAGTA CCAAATAGAA GTTATGCAAG 104 4 0 

AGCAAGTGCG AATGAAATCA C TAG T AAAAC AGTTAGTAAT GTCAGTCGTA CTGGAAATAA 10500 

TGCCAATGTg cACAGTAACT GTTACTTATC AAGATGGAAC AACATCAACA GTGACTGTAC 10560 

CTGTAAAGCA TGTCATTCCA GAAATCGTTG CACATTCGCA TTACACTGTA CAAGGCCAAG 10 620 

ACTTCCCAGC AGGTAATGGT TCTAGTGCAT CAGATTACTT TAAGTTATCT AATGGTAGTG 10 680 

ACATTGCAGA TGCAACTATT ACATGGGTAA GTGGACAAGC GCfiAATAAA GATAATACAC 107 4 0 

GTATTGGTGA AGATATAACT GTAACTGCAC ATATCTTAAT TGATGGCGAA AC AAC G C C G A 10800 

T T AC G AAAAC AGCAACATAT AAAGTAGTAA GAACTGTACC G AAAC AT G T C T T T G AAAC AG 10 8 60 

CCAGAGGTGT TTTATACCCA GGTGTTTCAG ATATGTATGA TGCGAAACAA TATGTTAAGC 10 920 
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CAGTAAATAA TTCTTGGTCG ACAAATGCGC AACATATGAA TTTCCAATTT GTTGGAACAT 10980 

ATGGTCCTAA CAAAGATGTT GTAGGCATAT CTACTCGTCT TATTAGAGTG ACATATGATA 11040 

ATAGACAAAC AGAAGATTTA ACTATTTTAT CTAAAGTTAA ACCTGACCCA CCTAGAATTG 11100 

ACGCAAACTC TGTGACATAT AAAGCAGGTC TTACAAACCA AGAAAT T AAA GTTAATAACG 11160 

TATTAAATAA CTCGTCAGTA AAATTATTTA AAGCAGATAA TACACCATTA AATGTCACAA 11220 

ATATTACTCA TGGTAGCGGT TTTAGTTCGG TTGTGACAGT AAGTGACGCG TTACCAAATG 1128 0 

GCGGAATTAA AGCAAAATCT TCAATTTCAA TGAACAATGT GACGTATACGACGCAAGACG 11340 

AACATGGTCA AGTTGTTACA GTAACAAGAA ATGAATCTGT TGATTCAAAT GACAGTGCAa 114 00 

CAGTAACAGT G AC AC C AC AA T T AC AAGCAA CTACTGAAGG CGCTGTATTT ATTAAAGGTG 114 60 

GCGACGGTTT TGATTTCGGA C AC G TAG AAA GAT T TAT T C A AAACCCGCCA CATGGGGCAA 11520 

CGGTTGCATG GCATGATAGT CCAGATACAT GGAAGAATAC AGTCGGTAAC AC T CAT AAAA 11580 

CTGCGGTTGT AACATTACCT AATGGTCAAG GTACGCGTAA TGTTGAAGTT CCAGTCAAAG 11640 

TTTATCCAGT TGCTAATGCA AAGGCGCCAT CACGTGATGT GAAAGGTCAA AATTTGACTA 117 00 

ATGGAACGGA TGCGATGAAC TACATTACAT TT5ATCCAAA TACAAACACA AATGGTATCA 117 60 

CTGCAGCATG GGCAAATAGA CAACAACCAA ATAACCAACA AGCAGGCGTG CAACATTTAA 11820 

ATGTCGATGT CACATATCCA GGTATTTCAG CTGCTAAACG AGTTCCTGTT ACTGTTAATG 11880 

TATATCAATT TGAATTCCCT CAAACTACTT ATACGACAAC GGTTGGAGGC ACTTTAGEA 1194 0 

GTGGTACGCA AGCATCAGGA TATGCACATA TGCAAAATGC TACTGGTTTA CCAACAG AT G 12 000 

GATTTACGTA TAAATGGAAT CGTGATACTA CAGGTACAAA TGACGCAAAC TGGTCAGCTA 12060 

TGAATAAACC GAATGTGGCT AAAGTCGTTA AC GC AAAAT A TGACGTCATC TATAACGGAC 12120 

ATACTTTTGC AACATCTTTA CCAGCGAAAT TTGTAGTAAA AGATGTGCAA CCAGCGAAAC 12180 

CAACTGTGAC TGAAACAGCG GCAGGAGCGA TTACAATTGC ACCTGGAGCA AACCAAACAG 12240 

TGAATACACA TGCCGGTAAC GTAACGACAT ACGCTGATAA ATTAGTTATT AAACGTAATG 12 300 

GTAACGTTGT GACGACATTT ACACGTCGCA ATAATACGAGTCCATGGGTG AAAGAAGCAT 12360 

CTGCAGCAAC TGTAGCAGGT ATTGCTGGAA CTAATAATGG TATTACTGTT GCAGCAGGTA 12 420 

CTTTCAACCC TGCTGATACA ATTCAAGTTG TTGCAACGCA AGGAAGCGGA GAGACAGTGA 12480 

GTGATGAGCA AC G TAG T GAT GATTTCACAG TTGTCGCACC ACAACCGAAC CAAGCGACTA 124 0 

CTAAGATTTG GCAAAATGGT CATATTGATA TCACGCCTAA TAATCCATCA GGACATTTAA 12 60 0 

TTAATCCAAC TCAAGCAATG GAT AT T GC T T AC AC T G AAAA AGTGGGTAAT GGTGCAGAAC 12 660 

AT AG T AAG AC AATTAATGTT GTTCGTGGTC AAAATAATCA ATGGACAATT GCGAATAAGC 12 720 
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CTGACTATGT AACGTTAGAT G3ACAAACTG GTAAAGTGAC GTTCAATGCC AATACTATAA 12780 

AACCAAATTC ATCAATCACA ATTACTCCGA AAGCAGGTAC AGGTCACTCA GTAAGTAGTA 1284 0 

ATCCAAGTAC ATTAACTGCA CCGGCAGCTC ATACTGTCAA CACAACTGAA ATTGTGAAAG 12 900 

ATTATGGTTC AAATGTAACA GCAGCTGAAA TTAACAATGC AGTTCafiTT GCTAATAAAC 12 9 60 

GTACTGCAAC GATTAAAAAT GGCACAGCAA TGCCTACTAA TTTAGCTGGT GGTAGCACAA 13020 

CGACGATTCC TGTGACAGTA ACTTACAATG ATGGTAGTAC TGAAGAAGTA CAAGAGTCCA 13080 

TTTTCACAAA AGCGGATAAA CGTGAGTTAA TCACAGCTAA AAATCATTTA GATGATCCAG 1314 0 

TAAGCACTGA AGGTAAAAAG CCAGGTACAA TTACGCAGTA CAATAATGCA ATGCATAATG 13200 

CGCAACAACA AATCAATACT GCGAAAACAG AAGCACAACA AGTGATTAAT AATGAGCGTG 132 60 

CAACACCACA ACAAGTTTCT GACGCACTAA CTAAAGTTCG TGCAGCACAA ACTAAGATTG 1332 0 

ATCAAGCTAA AGCATTACTT CAAAATAASS AAGATAATAG CCAATTAGTA ACGTCTAAAA 13380 

ATAACTTACA AAGTTCTGTG AACCAAGTAC CATCAACTGC TGGTATGACG CAACAAAGTA 13440 

TTGATAACTA TAATGCGAAG AAGCGTGAAG CAGAAACTGA AATAACTGCA GCTCAACGTG 13500 

i 

TTATTGACAA TGGCGATGCA ACTGCACAAC AAATTTCAGA T GAAAAAC AT CGTECGATA 13560 

ACGCATTAAC AGCATTAAAC CAAGCGAAAC ATGATTTAAC TGCAGATACA CATGCCTTAG 13 620 

AGCAAGCAGT GCAACAATTG AATCGCACAG GTACAACGAC TGGTAAGAAG CCGGCAAGTA 13 680 

TTACTGCTTA CAATAATTCG ATTCGTGCAC TTCAAAGTGA CTTAACAAGT GCTAAAAATA 13740 

GCGCTAATGC TATTATTCAA AAGCCAATAA GAACAGTACA AGAAGTGCAA TCTGCGTTAA 13800 

CAAATGTAAA TCGTGTCAAT GAGCGATTAA CGCAAGCAAT TAATCAATTA GTACCTTTAG 138 60 

CTGATAATAG TGCTTTAAAA ACTGCTAAGA CGAAACTTGA TGAAGAAATC AATAAATCAG 13920 

TAACTACTGA TGGTATGACA CAATCATCAA TCCAAGATA TGAAAATGCT AAACGTGCGG ■ 1398 0 

GTCAAACAGA ATCAACAAAT GCACAAAATG TTATTAACAA TGGTGATGCG ACTGACCAAC 14 04 0 

AAATTGCCGC AGAAAAAACA AAAGTAGAAG AAAAATATAA TAGCTTAAAA CAAGCAATTG 14100 

CTGGATTAAC TCCAGACTTG GCACCATTAC AAACTGCAAA AACTCAGTTG CAAAATGATA 14160 

TTGATCAGCC AACGAGTACG ACTGGTATGA CAAGCGCATC TATTGCAGCA TTTAATGAAA 14 220 

AACTTTCAGC AGCTAGAACT AAAATTCAAG AAATTGATCG TGTATTAGCC TCACATCCAG 14 280 

ATGTTGCGAC AATACGTCAA AAC G T G AC AG CAGCGAATGC CGCTAAATCA GCACTTGATC 14 34 0 

AAGCACGTAA TGGCTTAACA G T C GAT AAA G CGCCTTTAGA AAATGCGAAA AATCAACTAC 14 4 00 

AACATAGTAT TGACACGCAA ACAAGTACAA CTGGTATGAC ACAAGACTCT ATAAATGCAT 14 4 60 

ACAATGCGAA GTTAACAGCT GCACGTAATA AGATTCAACA AATCAATCAA GTATTAGCAG 14 520 

GTTCACCGAC TGTAGAACAA AT T AATACAA ATACGTCTAC AG&AATCAA GCTAAATCTG 14 580 
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ATTTAGATCA TGCACGTCAA GCTTTAACAC CAGATAAAGC GCCGCTTCAA ACTGCGAAAA 14 640 

CGCAATTAGA ACAAAGCATT AATCAACCAA CGGATACAAC AGGTATGACG ACCGCTTCGT 14 7 00 

TAAATGCGTA CAACCAAAAA TTACAAGCAG CGC GTCAAAA GTTAACTGAA ATTAATCAAG 147 60 

TGTTGAATGG CAACCCAACT GTCCAAAATA TCAATGATAA AG T G AC AG AG GCAAACCAAG 14820 

CTAAGGATCA ATTAAATACA GCACGTCAAG GTTTAACATT AGATAGACAG CCAGCGTTAA 14880 

CAACATTACA TGGTGCATCT AACTTAAACC AAGCACAACA AAATAATTTC AC G C AAC AAA 14 940 

TTAATGCTGC TCAAAATcAT GctGCGCTTG AAACAATTAA GTCTAACATT ACGGCTTTAA 15000 

ATACTGCGAT GACGAAATTA AAAG AC AG T G TTGCGGATAA TAATACAATT AAATCAGATC 15060 

AAAAT T AC AC TGACGCAACA CCAGCTAATA AACAAGCGTA TGATAATGCA GTTAATGCGG 15120 

CTAAAGGTGT CATTGGAGAA ACGACTAATC CAACGATGGA T G T T AAC AC AGT G AAC C AAA 15180 

AAGCAGCATC TGTTAAATCG AC G AAAG AT G CTTTAGATGG TCAACAAAAC TTACAACGTG 1524 0 

C G AAAAC AG A AGCAACAAAT GCGATTACGC ATGCAAGTGA TTTAAACCAA GCACAAAAGA 15300 

ATGCATTAAC ACAACAAGTG AATAGTGcAC AAAACGTGCA AGCAGTAAAT GATATTAAAC 15360 

AAAC G ACT C A AAGCTTAAAT ACTGCTATGA CAGGTTTAAA ACGTGGCGTT GCTAATCATA 15420 

ACCAAGTCGT AC AAAG T GAT AATTATGTCA ACGCAGATAC TAATAAGAAA AATGATTACA 15480 

ACAAT GCAT A CAACCATGCG AAT G AC AT T A TTAATGGTAA TGCACAACAT CCAGTTATAA 1554 0 

CACCAAGTGA TGTTAACAAT GCTTTATCAA A T! T C AC AAG TAAAGAACAT GCATTGAATG 15 600 

GTGAAGCTAA GTTAAATGCT GCGAAACAAG AAGCGAATAC TGCATTAGGT CATTTAAACA 15660 

ATTTAAATAA TGCACAACGT C AAAAC T T AC AATCGCAAAT TAATGGTGCG CATCAAATTG 15720 

ATGCAGTTAA TACAATTAAG C AAAAT GCAA C AAAC T T G AA TAGTGCAATG GGTAACTAA 157 8 0 

GACAAGCTGT TGCAGATAAA GATCAAGTGA AACGTACAGA AGATTATGCG GATGCAGATA 15840 

CAGCTAAACA AAATGCATAT AACAGTGCAG TTTCAAGTGC CGAAACAATC ATTAATCAAA 15900 

CAACAAATCC AACGATGTCT GTTGATGATG TTAATCGTGC AACTTCAGCT GTTACTTCTA 15960 

ATAAAAATGC ATTAAATGGT TATGAAAAAT TAGCACAATC TAAAACAGAT GCTGCAAGAG 1602 0 

CAATTGATGC ATTACCACAT TTAAATAATG CACAAAAAGC AG AT G T T AAA TCTAAAATTA 16080 

ATGCTGCATC AAATATTGCT GGCGTAAATA CTGTTAAACA ACAAGGTACA GATTTAAATA 1614 0 

CAkCGATGGg TAACTTGCAA GGTGCAATCA AT GAT GAAC A AAC GACGC T T AAT AG T C AAA 162 00 

ACTATCAAGA TGCGACACCT AGTAAGAAAA CAGCATACAC AAATGCGGTA CAAGCTGCGA 162 60 

AAGATATTTT AAATAAATCA AATGGTCAAA ATAAAACGAA AGATCAAGTT ACT G AAG C G A 16320 

TGAATCAAGT GAATTCTGCT AAAAATAACT TAGATGGTAC GCGTTTATTA GATCAAGCGA 133 0 
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nCAAaCAGCA AAACAGCAGT TAAATAATAT GACGCATTTA ACAACTGCAC AAAAAACGAA 16440 

TTTAACAAAC CAAATTAATA GTGGTACTAC TGTCGCTGGT GTTCAAACGG TTCAATCAAA 16500 

TGCCAATACA T TAG AT C AAG CCATGAATAC GTTAAGACAA AGTATTGCCA ACAAAGATGC 165 60 

GACTAAAGCA AG TG AAG ATT ACGTAGATGC TAATAATGAT AAGCAAACAG CATATAACAA 16620 

CGCAGTAGCT GCTGCTGAAA CGATTATTAA TGCTAATAGT AATCCAGAAA TGAATCCAAG 16680 

TACGATTACA CAAAAAGCAG AGCAAGTGAA TAGTTCTAAA ACGGCACTTA ACGGTGATGA 16740 

AAACTTAGCT GCTGCAAAAC AAAATGCGAA AACGTACTTA AACACATGA CAAGTATTAC 16800 

AGATGCTCAA AAGAACAATT TGATTAGTCA AATTACTAGT GCGACAAGAG TGAGTGGTGT 168 60 

TGATACTGTA AAACAAAATG CGCAACATCT AGACCAAGCT ATGGCTAGCT TACAGAATGG 16920 

TATTAACAAC GAATCTCAAG T G AAAT CAT C TGAGAAATAT CGTGATGCTG ATACAAATAA 1698 0 

ACAACAAGAG TATGATAATG CTATTACTGC AGCGAAAGCG ATTTTAAATA AATCGACAGG 17040 

TCCAAACACT GCGCAAAATG CAGTTGAAGC AGCATTACAA CGTGTTAATA ATGCGAAAGA 17100 

TGCATTGAAT GGTGATGCAA AATTAATTGC AGCTCAAAAC GCAGCGAAAC AACATTTAGG 17160 

TACTTTAACG CATATCACTA CAGCTCAACE TAATGATTTA AC AAAT C AAA TTTCACAAGC 17220 

TACAAACTTA GCTGGTGTTG AATCTGTTAA ACAAAATGCG AATAGTTTAG ATGGTGCTAT 17280 

GGGTAACTTA CAAACGGCTA TCAACGATAA GT CAGGAAC A TTAGCGAGCC AAAACTTCTT 1734 0 

GGATGCTGAT GAGCAAAAAC GTAATGCATA CAATCAAGCT GTATCAGCAG CCGAACCAT 174 00 

TTTAAATAAA CAAACTGGAC CGAATACAGC G AAAAC AG C A GTCGAACAAG CACTTAATAA 17 4 60 

TGTTAATAAT GCGAAACATG CATTAAATGG TACGCAAAAC TTAAACAATG CGAAACAAGC 17520 

AGCGATTACA GCAATCAAT G GCGCATCTGA TTTAAATCAA AAACAAAAAG ATGCATTAAA 17580 

AGCACAAGCT AATGGTGCTC AACGCGTATC TAATGCACAA GATGTACAGC ACAATGCGAC 17 640 

TGAACTGAAC ACGGCAATGG GC ACAT T AAA ACATGCCATC GCAGATAAGA CGAATACGTT 17 7 00 

AGCAAGCAGT AAATATGTTA ATGCCGATAG CACTAAACAA AATGCTTACA CAACTAAAGT 17 7 60 

TACCAATGCT GAAC AT AT T A TTAGCGGTAC GCCAAGGTT GTTACGACAC C T T C AG AAG T 17 820 

AACAGCTGCA GCTAATCAAG TAAACAGCGC GAAACAAGAA TTAAATGGTG ACGAAAGATT 17 880 

ACGTGAAGCA AAAC AAAAC G CCAATACTGC TATTGATGCA TTAACACAAT TAAATACACC 17 940 

TCAAAAAGCT AAATTAAAAG AACAAGTGGG ACAAGCCAAT AGATTAGAAG AC G T AC AAAC 18000 

TGTTCAAACA AATGGACAAG CATTGAACAA TGCAATGAAA GGCTTAAGAG ATAGTATTGC 18060 

TAACGAAACA ACAGTCAAAA CAAGTCAAAA C TAT AC AG AC GCAAGTCCGA ATAACCAATC 18120 

AACATATAAT AGCGCTGTGT CAAATGCGAA AGGTATCATT AATCAAACTA ACAATCCGAC 18180 

TATGGATACT AGTGCGAITA CCCAAGCTAC AAC AC AAG T G AATAATGCTA AAAATGGTTT 18240 
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AAACGGTGCT GAAAACTTAA GAAATGCACA AAACACTGCT AAGCAAAACT TAAATACATT 18300 

ATCACACTTA ACAAATAACC AAAAATCTGC CATC T CATC A CAAATTGATC GTGCAGGTCA 18360 

TGTGAGTGAG GTAACTGCTA CTAAAAATGC AG C AAC T G AG TT&ATACGC AAATGGGTAA 184 20 

CTTGGAACAA GCTATCCATG AT C AAAAC AC AGTTAAACAA AGTGTTAAAT TTACTGATGC 18 4 80 

AGATAAAGCT AAACGTGATG CGTATACAAA TGCGGTAAGC AGAGCTGAAG CAATTCTGAA 1854 0 

TAAAACGCAA GGTGCAAATA CGTCTAAACA AGATGTTGAA GCGGCTATTC AAAATGTTTC 18 600 

AAGTGCTAAA AATGCATTGA ATGGTGATCA AAACGTTACA AATGCGAAGA ATGCAGCTAA 18 660 

AAATGCATTA AATAACTTAA CGTCAATTAA TAATGCACAA AAACGTGACT T AAC AAC T AA 18720 

AATTGATCAA GCAACAACTG TAGCTGGTGT TGAAGCTGTA TCTAATACGA GTACACAATT 18780 

GAAtACAGCG ATGGCTAACT TGCASVATGG TATTAATGAT AAAACAAATA CACTAGCAAG 1884 0 

T G AAAAC TAT CATGATGCTG ATTCAGATAA GAAAACTGCT TATACTCAAG CCGTTACGAA 18 900 

CGCAGAAAAT ATTTTAAATA AAAATAGTGG ATCAAATTTA G AC AAAAC T G CCGTTGAAAA 18 960 

CGCGTTGTCA CAAGTTGCTA ATGCGAAAGG TGCCCTAAAT GGTAACCATAATTTAGAGCA 19020 

AGCTAAATCA AATGCAAACA CT ACT AT AAA CGGACTTCAA CATTTAACAA CTGCTCAAAA 19080 

AG AT AAAT T G AAACAACAAG TGCAACAAGC ACAAAATGTT GCAGGTGTAG ATACTGTTAA 1914 0 

ATCAAGTGCC AAC AC AT T AA ATGGTGCTAT GGGTACGTTA AGAAATAGCA TACAAGATAA 19200 

CACAGCTACG AAAAATGGCC AAAACTATCT TGATGCTACA GAACGTAACA AAACAAACTA 192 60 

TAACAATGCT GTTGATAGTG CTAATGGTGT CATTAATGCA ACAAGCAATC CAAATATGGA 19320 

TGCTAATGCA ATTAACCAAA TCGCTACACA AGTGACATCA ACGAAAAATG CAT T AGATGG 19380 

TACACATAAT TTAACGCAAG CGAAACAAAC A(£AACAAAT GCCATCGATG GTGCTACTAA 19440 

CTT AAAT AAA GCGCAAAAAG ATGCGTTAAA AGCACAAGTT ACAAGTGCGC AACGTGTTGC 19500 

AAATGTAACA AGTATCCAAC AAACTGCAAA TGAACTTAAT ACAGCTATGG GTCAATTACA 19560 

ACATGGTATT GATGATGAAA AT G C AAC AAA ACAAACTCAA AAATATCGTG ACGcTGAEA 19 620 

AAGTAAGAAA ACTGCTTATG ATCAAGCTGT AGCTGCTGCG AAAGCAATTT TAAATAAACA 19680 

AACAGGTTCA AATTCAGATA AAGCAGCAGT TGACCGTGCA TTACAACAAG TAACAAGTAC 19740 

GAAAGATGCA TTGAATGGTG ATGCAAAACT GGCAGAAGCG AAAGCGGCAG CTAAACAAAA 198 00 

CTTAGGCACT TTAAACCATA TTACGAATGC ACAACGTACT GACTTAGAAG GCCAAATCAA 198 60 

TCAAGCGACG ACTGTTGATG GCGTTAATAC TGTAAAAACA AATGCCAATA CATTAGACGG 19920 

CGCAATGAAT AGCTTACAAG GTTCAATCAA TGATAAAGAT GCGACATTAA GAAATCAAAA 19 980 

TTATCTTGAT GCGGATGAAT C AAAAC GAAA TGCATATACGCAAGCTGTCA CAGCGGCTGA 2 0040 
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AGGCATTTTA AATAAACAAA CTGGTGGTAA C AC AT C T AAA GCAGACGTTG ATAATGCATT 20100 

AAATGCAGTT ACAAGAGCGA AAGcGgCTTT AAATGGTGCT GACAACTTAA GAAATGCGAA 20160 

AACTTCAGCA AC AAAT AC G A TTGATGGTTT ACCTAACTTA ACACAAT T AC AAAAAGACAA 222 0 

CTTGAAGCAT CAAGTTGAaC AAGCGCAAAA TGTAGCAGGT GTAAATGGTG TTAAAGATAA 20280 

AGGTAATACG TTAAATACTG CCATGGGTGC ATTACGTACA AGTATCCAAA ATGATAATAC 2 034 0 

G AC G AAAAC A AGTCAAAATT ATCTTGATGC ATCTGACAGC AACAAAAATA AT T AC AAT AC 204 00 

TGCTGTAAAT AATGCAAATG GTGTTATTAA TGCAACGAAC AATCCAAATA TGGATGCTAA 204 60 

TGCGATTAAT GGCATGGCAA ATCAAGTCAA TACAACAAAA GCAGCGTTAA ATGGTGCACA 20520 

AAACTTAGCT CAAGCTAAAA CAAATGCGAC G AAC AC AAT T AACAACGCAC ATGACTTAAA 2058 0 

CCAAAAACAA AAAGATGCAT TAAAAACACA AGTTAACAAT GCACAAGTG TATcTGATGC 2 0 640 

AAATAACGTT CAACACACTG CAACTGAATT GAACAGTGCG ATGACAGCAC TTAAAGCAGC 2 0700 

TATTGCTGAT AAAGAAAGAA CAAAAGCAAG CGGTAATTAT GTCAATGCTG ATCAAGAAAA 2 07 60 

ACGTCAAGCG TATGATTCAA AAGTGACTAA CGCTGAAAAT ATCATTAGTG GTACACCGAA 2 0820 

TGCGACATTA ACAGTCAATG ACGTAAATAG TGCGGCATCA CAAGTCAATG CGGCT AAAAC 208 8 0 

AGCATTAAAT GGTGATAACA ACTTACGTGT AGCGAAAGAG CATGCCAACA AT ACAAT TGA 2094 0 

CGGCTTAGCA CAATTGAATA AT G C AC AAAA AGCAAAATTA AAAGAACAAG TTCAAAGTGC 21000 

AACTACATTA GATGGTGTTC AAACTGTTm AAATAGTTCT CAAACGTTGA ATACAGCGAT 21060 

GAAAGGCTTA AGAGATAGTA TTGCGAATGA AGCAACAATT AAAGCAGGTC AAAAC T AC AC 2112 0 

TGACGCAAGT C C AAAT AAT C GTAACGAGTA CGACAGTGCA GTTACTGCAG CAAAAGCAAT 21180 

CATTAATCAA ACATCGAACC CAACGATGGA ACCAAATACT ATTACGCAAG TAAETCACA 2124 0 

AG T G AC AAC T AAAGAACAGG CATTAAATGG TGCGCGAAAC TTAGCTCAAG CTAAGACAAC 21300 

TGCGAAAAAC AACTTGAATA ACTTAACATC AAT T AAC AAT GCACAAAAAG ATGCGTTAAC 21360 

GCGTAgcATT GATGGTGCAA CAACAGTAGC TGGTGTAAAT CAAGAAACTG CAAAAGCAAC 214 20 

AG AAT T AAAT AACGCAATGC ATAGTTTACA AAATGGTATC AAT GAT GAGA CACAAACAAA 214 80 

AC AAAC T C AG AAATACCTAG ATGCAGAGCC AAG T AAG AAA TCAGCTTATG ATCAAGCAGT 2154 0 

AAATGCAGCG AAAGCAATTT TAACAAAAGC TAGTGGTCAA AATGTAGACA AAGCAGCAGT 21600 

TGAACAAGCA TTGCAAAATG TGAACAGTAC GAAGAGGCG TTGAACGGTG ATGCGAAATT 21660 

AAATGAAGCT AAAGCAGCTG CGAAACAAAC GTTAGGTACA T T AAC AC AC A TTAATAATGC 217 20 

ACAACGTACA GCGTTAGACA ATGAAAT T AC ACAAGCAACA AATGTTGAAG GTGTTAATAC 217 80 

AGTTAAAGCC AAAGCGCAAC AATTAGATGG TGCTATGGGT C AAT TAG AAA CATC AAT TCG 218 4 0 

T G AT AAAGAC ACGACGTTAC AAAGTCAAAA TTATCAAGAT GCTGATGATG C T AAAC G AAC 21900 
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TGCTTATTCT CAAGCAGTAA ATGCAGCAGC AACTATTTTA AATAAAACAg CTGGCGGTAA 21960 

TACACCTAAA GCAGATGTTG AAAGAGCAAT GCAAGCTGTT ACACAAGCAA ATACTGcATT 22020 

AAACGGTATT CAmAACTTAG ATCGTGCGAA ACArGCTGCT AACACAGCGA TTACAAATGC 220 8 0 

TTCGGACTTA AATACAAAAC mAAAAGAAGC ATTAAAAgCA CAAGTAACAA GTGCAGGACG 2214 0 

TGTATCTGCA GCAAATGGTG T T G AAC AT AC TGCGACTGAA TTAAATACTG. CGATGACAGC 22200 

TTTAAAGCGT GCCATTGCTG ATAAAGCTGA GACAAAAGCT AGEGTAACT ATGTCAATGC 222 60 

TGATGCGAAT AAACGTCAAG CATATGATGA AAAAGTTACA GCTGCCGAAA ATATCGTTAG 22320 

TGGTACACCA ACACCAACGT TAACACCAGC AGATGTTACA AATGCAGCAA CGCAAGTAAC 22380 

GAATGCTAAG AC GC AG T T AA ACGGTAATCA TAATTTAGAA GTAGCGAAAC AAAATGCTAA 22440 

CACTGCAATT GATGGTTTAA CTTCTTTAAA TGGTCCGCAA AAAGCAAAAC T T AAAGAAC A 22500 

AGTGGGTCAA GCGACGACGT TGCCAAATGT TCAAACTGTT CGTGATAATG CACAAACATT 22560 

AAACACTGCA ATGAAAGGTC TACGAGATAG CATTGCGAAT GAAGCAACGA TTAAAGCAGG 22620 

TCAAAACTAC ACAGATGCAA GTCA2AACAA AC AAAC T G AC TACAACAGTG CAGTCACTGC 22 680 

AGCAAAAGCA ATCATTGGTC AAAC AAC TAG TCCATCAATG AATGCGCAAG AAATTAATCA 2274 0 

AG C G AAAG AC CAAGTGACAG C T AAAC AAC A AGCGTTAAAC GGTCAAGAAA ACTTAAGAAC 22800 

TGCGCAAACA AATGCGAAgC AAC AT T T G AA CGGCTTAAGT GACTTAACTGACGCTCAAAA 228 60 

AGATGCAGTG AAACGTCAAA TCGAAGGTGC AACGCATGTT AATGAAGTAA CACAAGCACA 22 920 

AAATAATGCG GATGCaTTAA ATACAGCTAT GACGAACTTG AAAAATGGTA TTCAAGATCA 22980 

GAATACGATT AAGCAAGGTG TTAACTTCAC TGATGCCGAC GAAGCGAAAC GTAATGCATA 2304 0 

TACAAATGCA GTGACGCAAG C T G AAC AAA T T T T AAAT AAA GCACAAGGTC CAAATACTTC 2 3100 

AAAAGACGGT GTCGAAACTG CGTTAGAaAA TGTACAACGT GCTAAAAACG AATTGAACGG 23160 

TAATCAAAAT GTTGCGAACG CTAAGACAAC TGCGAAAAAT GCATTGAATA ACCTAACATC 23220 

AATTAATAAT GCACAAAAAG AAGCATTGAA ACACAAATT GAAGGTGCGA CAACAGTTGC 23280 

AGGTGTAAAT CAAGTGTCTA CAACGGCATC TGAATTAAAT AC AG C AAT G A GCAACTTACA 2334 0 

AAATGGTATT AATGATGAAG CAGCTACAAA AGCAGCGCTT AATGGTACTC AAAACCTTGA 23400 

AAAAGCTAAA CAACACGCAA ATACAGCAAT TGACGGTTTA AGCCATTTAA CAAATGOAA 234 60 

AAAAGAGGCA TTAAAACAAT TGGTACAACA ATCGACTACT GTTGCAGAAG CACAAGGTAA 2 352 0 

TGAGCAAAAA GC AAACAAT G TTGATGCAGC AATGGACAAA TTACGTCAAA GTATTGCAGA 2 358 0 

TAATGCGACA ACAAAACAAA ACCAAAATTA TACTGATGCA AG T C AGAAT A AAAAGGATGC 2364 0 

GTACAATAAT GCTGTCACAA CTGCACAAGG TATTATTGAT CAAACTACAA GTCCAACTTT 2 37 00 
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AGATCCGACT GTTATCAATC AAGCTGCTGG ACAAGTAAGC ACAACTAAAA ATGCATTAAA 2 37 60 

TGGTAATGAA AACCTAGAGG CAGCGAAACA ACAAGCGTCA CAATCATTAG GTTCATTAGA 2 38 20 

TAACTTAAAT AATGCGCAAA AACAAACAGT TACTGATCAAATTAATGGCG CGCATACTGT 238 8 0 

TGATGAAGCA AATCAAATTA AGCAAAATGC GCAAAACTTA AATACAGCGA TGGGTAACTT 2 394 0 

GAAACAAGCG ATAGcTGACA AAGATGCTAC GAAAGCGACA GTTAACTTCA CTGATGCAGA 2 4 000 

TCAAGCAAAA CAACAAGCAT ATAACaCTGC TGT TACAAAT GCTGAAAATA TCATTTCAAA 20160 

AGCTAATGGC GGCAATGCAA CACAAGCTGA AGTTGAACAA GCAATCAAAC AAGTTAATGC 2 4120 

TGCAAAACAA GCATTAAATG GTAATGCCAA CGTTCAACAT GCAAAAGACG AAGCAACAGC 24180 

ATTAATTAAT AGCTCTAATG ACCTTAACCA AG C AC AAAAA GACGCATTAA AACAACAAGT 24240 

TCAAAATGCA ACT ACT GT AG CTGGTGTAAA CAATGTTAAA CAAACAGCAC AAG AG T T AAA 2 4 300 

CAATGCTATG ACACAATTAA AACAAGGCAT T GCAGAT AAA GAACAAACAA AAGCTGATGG 2 4 360 

TAACTTTGTC AATGCAGATC CTGATAAGCA AAATGCATAT AATCAAGCAG TAGCGAAAGC 24 4 20 

TGAAGCATTA ATTAGTGctA CGCCTGATGT TGTCGTTACA CCTAGCSAA TTACTGCAGC 24 4 80 

GTTAAATAAA GTTACGCAAG CTAAAAATGA TTTAAATGGT AATACAAACT TAGCAACGGC 24 540 

GAAACAAAAT GTTCAACATG C TAT T GAT C A ATTGCCAAAC TTAAACCAAG CGCAACGTGA 24 600 

TGAATACAGC AAACAAATCA CGCAAGCAAC ACTTGTACCA AACGTCAATG CTATTCAACA 2 4 660 

AGCGGCGACA ACGCTTAATG ACGCGATGAC ACAATTGAAA CAAGGTATTG CGAATAAAGC 24720 

AC AAAT T AAA GGTAGCGAGA ACTATCACGA TGCTGATACT GACAAGCAAA CAGCATATGA 24780 

TAATGCAGTA ACAAAAGCAG AAGAATTGTT AAAACAAACA ACAAATCCAA CAATGGATCC 24840 

AAATACAATT CAACAAGCAT TAACTAAAQ? G AAT G AC AC A AAT CAAGC AC TTAACGGTAA 24 900 

TCAAAAATTA GCTGATGCCA AACAAGATGC TAAGACAACA CTTGGTACAC TAG AT CAT T T 24 960 

AAATGATGCT CAAAAACAAG CGCTAACAAC TCAAGTTGAA CAAGCACCAG ATATTGCAAC 2 5 020 

AGTTAATAAT G T T AAG C AAA ATGCTCAAAA TCTGAATAAT GCTATGACTA ACTAAACAA 25 08 0 

TGCATTACAA GATAAAACTG AGAC AT T AAA TAGCATTAAC TTTACTGATG C AG AT C AAG C 2 514 0 

TAAGAAAGAT GCT TAT ACTA ATGCGGTTTC ACATGCAGAA GGTATTTTAT C T AAAG C AAA 2 52 00 

TGGCAGCAAT GCAAGTCAAA CTGAAGTGGA AC AAG C G AT G CAACGTGTGA ACGAAGCGAA 2 52 60 

ACAAGCATTG AATGGTAATG AC AAT G T AC A ACGTGCAAAA GATGCAGCGA AACAAGTGAT 25320 

TACAAATGCA AATGATTTAA ATCAAGCAAT G AC AC AAT T G AAACAAGGTA TTGCAGATAA 2 5380 

AGACCAAACT AAAG C AAAT G GTAACTTTGT CAATGCTGAT AC T GAT AAG C AAAATGCTTA 2 5440 

CAACAATGCG GTAGCACATG C T G AAC AAAT AATTAffGGT AC AC C AAAT G CAAACGTGGA 2 5500 

TCCACAACAA GTGGCTCAAG CGTTACAACA AGTGAATCaA GCTAAGGGTG ATTTAAACGG 2 55 60 
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TAACCATAAC TTACAAGTTG CTAAAGACAA TGCAAATACA GCCATTGATC AG T TACCAAA 25 620 

CTTAAATCAA CCACAAAAAA CAGCATTAAA AG AC C AAG T G TCGCATGCAG AACTTGTTAC 25 680 

AGGTGTTAAT GCTAT TAAGC AAAATGCTGA TGCGTTAAAT AATGcAATGG GTACATTGAA 25740 

ACAACAAATT CAAGCGAACA GTCAAGTACC AC AG T C AG T T GACTTTACAC AAGCGGATCA 2 58 00 

AGACAAACAA CAAGCATATA ACAATGCGGC TAACCAAGCG CAACAAATCG CAAATGGCAT 258 60 

ACCAACACCT GTATTGAGGC CTGATACAGT AACACAAGCA GTGACAACTA TGAATCAAGC 25920 

GAAAGATGCA TTAAACGGTG ATGAAAAATT AGCACAAGCG AAACAAGAAG CTTTAGCAAA 2 5 980 

TCTTGATACG TTACGCGATT TAAATCAACC ACAACGTGAT GCATTACGTA ACCAAATCAA 2 604 0 

TCAAGCACAA GCGTTAGCTA CAGTTGAACA AACTAAACAA AAECACAAA ATGTGAATAC 26100 

aGCaATGAGT AACTTGAAAC aAGGTATTGC aAACAAAGAT ACTGTCAAAG CAAGTGAGAA 2 6160 

CTATCATGAT GCTGATGCCG ATAAGCAAAC AGCATATACA AATGCAGTGT CTCAAGCGGA 26220 

AGGTATTATC AATCAAACGA CAAATCCAAC GCTTAACCCA GATGAAATAA CACGTGCATT 2 6280 

AACTCAAGTG ACTGATGCTA AAAATGGCTT AAACGGTGAA GCTAAATTGG CAACTGAAAA 2 634 0 

GCAAAATGCT AAAGATGCCG TAAGTGGGAT GACGCATTTA AACGATGCTC AAAAACAAGC 2 64 00 

ATTAAAAGGT CAAATCGATC AATCGCCTGA AATTGCTACA GTGAACCAAG TTAAACAAAC 264 60 

AGCAACGAGC C TAG AT C AAG CAATCEATCA AT TAT C AC AA GCTATTAATG ATAAAGCTCA 2 652 0 

AACATTAGCG GACGGTAATT ACTTAAATGC AGATCCTGAC AAACAAAATG CGTATAAACA 26580 

GGCAGTAGCA AAAGCTGAAG CATTATTGAA TAAACAAAGT GGTACTAATG AAGTACAAGC 2 664 0 

ACAAGTTGAA AGCATCACTA ATGAAGTGAA CGCAGCGAAA CAAGCATTAAATGGTAATGA 2 67 00 

CAATTTGGCA AATGCAAAAC AACAAGCAAA ACAACAATTG GCGAACTTAA C AC AC T T AAA 2 67 60 

TGATGCACAA AAAC AAT CAT TTGAAAGTCA AATTACACAA GCGCCACTTG TTACAGATGT 26820 

CACTACGATT AATCAAAAAG CACAAACGTT AGATCATGCG ATGGAATTAT TAAGAAATAG 2 6880 

TGTTGCGGAT AATCAAACGA CATTAGCGTC TGAAGAT TAT CATGATGCAA CTGCGCAAAG 2 694 0 

AC AAAAT G AC TATAACCAAG CTGTAACAGC TGCTAATAAT ATAATTAATC AAACTACATC 27000 

GCCTACGATG AATCCAGATG ATGTTAATGG TGCAACGACA CAAGTGAATA ATACGAAAGT 27060 

TGCATTAGAT GGTGATGAAA ACCTTGCAGC A El T AAAC AA CAAGCAAACA AC AG AC T T G A 27120 

TCAATTAGAT CATTTGAATA ATGCGCAAAA GCAACAGTTA CAATCACAAA TTACGCAATC 2 718 0 

ATCTGATATT GCTGCAGTTA ATGGTCACAA AC AAAC AG C A GAATCTTTAA ATACTGCGAT 2 7240 

GGGTAACTTA ATTAATGCGA TTGCAGATCA TCAAGCCGTT GAACAACGTG GTAACTTAT 27 300 

CAATGCTGAT ACTGATAAAC AAACTGCTTA TAATACAGCG GTAAATGAAG CAGCAGCAAT 27360 
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GAT T AAC AAA CAAACTGGTC AAAATGCGAA CCAAACAGAA GTAGAACAAG CTATTACTAA 274 20 

AGTTCAAACA ACACTTCAAG CGTTAAATGG AGACCATAAT TTACAAGTTG CTAAAACAAA 274 80 

TGCGACGCAA GCAATTGATG CTTTAACAAG CTTAAATGAT CCTCAAAAAA CAGC AT T AAA 2754 0 

AGACCAAGTT ACAGCTGCAA CTTTAGTAAC TGCAGTTCAT CAAATTGAAC AAAATGCGAA 27 600 

TACGCTTAAC CAAGCAATGC ATGGTTTAAG ACAGAGCATT CAAGATAACG CAGCAACTAA 27660 

AGCAAATAGC AAATATATCA ACGAAGATCA AC C AG AGC AA C AAAAC T AT G ATCAAGCTGT 27 720 

TCAAGCCGCA AATAATATTA TCAATGAACA AACTGCAACA TTAGATAATA ATGCGATTAA 27780 

TCAAGCAGCG ACAACTGTGA AT AC AAC G AA AGCAGCATTA CATGGTGATG TGAAGTTACA 27 84 0 

AAAT GAT AAA GATCATGCTA AGCAAACGGT TAGTCAATTA GCACATCTAA ACAATGCACA 2970 0 

AAAAC AT ATG GAAGATACGT TAATTGATAG TGAAACAACT A G AAC AG C AG TTAAGCAAGA 27 960 

TTTGACTGAA GCACAAGCAT TAGATCAACT TATGGATGCA T T AC AAC AAA GTATTGCTGA 2 8 020 

CAAAGATGCA ACACGTGCGA GCAGTGCATA TGTCAATGCA GAACCGAATA AAAAACAATC 2808 0 

CTATGATGAA GCAGTTCAAA ATGCTGAGTC TATCATTGCA GGATTAAATA ATCCAACTAT 2814 0 

CAATAAAGGT AATGTATCAA GTGCGACTCA AGCAGTAATA TCATCTAAAA ATGCATTAGA 28 200 

TGGTGTTGAA CGATTAGCTC AAGATAAGCA AACTGCTGGA AATTCTCTAA ATCATTTAGA 282 60 

TCAATTAACA CCAGCTCAAC AACAAGCGCT AG AAAAT C AA ATTAATATG CAACAACTCG 2 8 320 

TGATAAAGTG GCTGAAATCA TTGCACAAGC GCAAgC AT t A AATGAAGCGA TGAAAGCATT 2 8 380 

AAAAGAAAGT ATTAAGGATC AACCACAAAC TGAAGCAAGT AGTAAATTTA TTAACGAGGA 284 4 0 

TCAAGCGCAA AAAGATGCTT ATACGCAAGC AGTACAACAC GCGAAAGATT T GAT T AAC AA 28 50 0 

AACAACTGAT CCTACATTAG CTAAATCAAT CATTGATCAA GCGACACAGG CAGTGACAGA 28 5 60 

TGCTAAAAAC AATTTACATG GTGATCAAAA ACTAGCTCAA GATAAGCAAC GTGCAACAGA 2 8 620 

AAC G T T AAAT AACTTGTCTA ACTTGAATAC ACCACAACGT CAAGCACTTG AAAATCAAAT 2 8 680 

TAATAATGCA GCAACTCGTG GCGAAGTAGT ACAAAAATTA ACTGAAGCAC AAGCACTTAA 28 74 0 

CCAAGCAATG GAAGCTTTAC G T AAT AG CAT TCAAGATCAA CAGCAAACGG AAGCGGGTAG 28 8 00 

CAAGTTTATC AATGAAGATA AaCCaCmAAA AGrTGCTTAC CAAGCAGCAG TTCAAAATGC 28 8 60 

AAAAG AT T T A ATTAATCAAA CTAACAATCC AACGCTTGAT AAAGCACAAG TTGACAATT 2 8 920 

GACACAAGCT GTTAACCAAG CTAAAGATAA CCTACACGGT GATCAAAAAC TTGCAGACGA 2 8 980 

TAAACAACAT GCGGTTACTG ATTTAAATCA ATT AAAT GGT TTGAATAATC CGCAACGTCA 2 9040 

AGCACTTGAA AGCCAAATAA ACAACGCAGC AACTCGTGGC GAAGTAGCAC AAAAATTAGC 2 9100 

TGAAGCAAAA GCGCTTGATC AAGCAATGCA AGCATTACGT AATAGTATTC AAGATCAACA 2 9160 

ACAAACAGAA TCTGGTAGCA AGTTTATCAA T G AAG AT AAA CCGCAAAAAG ATGCTTACCA 2 9220 
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AGCAGCAGTT CAAAATGCAA AAGATTTAAT TAACCAAACA GGTAATCCAA CACTCGACAA 2 9280 

ATCACAAGTA GAACAATTGA CACAAGCAGT AACAACTGCA AAAGATAATC TACATGGTGA 2 934 0 

TCAAAAACTT GCTCGTGATC AACAACAAGC AGTAACAACT GTAAATGCAT TGCCAAACTT 2 94 00 

AAATCATGCA CAACAACAAG CATTAACTGA TGCTATAAAT GCAGCGCCTA CAAGAACAGA 294 60 

GGTTGCACAA CATGTTCAAA CTGCTACTGA ACTTGATCAC GCGATGGAAA C AT T GAAAAA 2 9520 

TAAAGTTGAT CAAGTGAATA CAGATAAGGC TCAACCAAAT TACACTGAAG CGTCAACTGA 2 9580 

TAAAAAAGAA GCAGTAGATC AAGCGTTACA AGCTGCAGAA AGCATTACAG ATCCAACTAA 2 9 640 

TGGTTCAAAT GCGAATAAAG ACGCTGTAGA CCAAGTATTA ACTAAGCTTC AAGAAAAAGA 2 97 00 

AAATGAGTTA AATGGTAATG AG AG AG T C GC TGAAGCTAAA ACACAAGCGA AACAAACTAT 2 97 60 

TGACCAATTA AC AC AT T T AA ATGCTGATCA AATTGCAACT GCTAAACAAA AC AT T GAT C A 2 9820 

AG C G AC G AAA CTTCAACCAA TTGCTGAATT AGTAGATCAA GCAACGCAAT TGAATCAATC 2 988 0 

TATGGATCAA TTACAACAAG CAGTTAATGA ACATGCTAAC GTEAGCAAA CTGTAGATTA 2 994 0 

CACACAAGCA GATTCAGATA AACAAAATGC TTATAAACAA GCTATTGCTG ATGCTGAAAA 30000 

TGTATTGAAA CAAAATGCGA ATAAGCAACA AGTGGATCAA GCACTTCAAA AT AT T T T AAA 30060 

TGCAAAACAA GCATTAAATG GTGATGAACG TGTAGCACTT GCTAAAACAA ATGGTAAACA 3012 0 

TGACATCGAC CAATTGAATG CAT T AAAC AA TGCTCAACAA GATGGATTTA AAGGTCGCAT 3018 0 

CGATCAATCA AAC GAT T T AA ATCAAATCCA ACAAATTGTA GATGAGGCTA AGGCACTTAA 302 4 0 

TCGTGCAATG GATCAATTGT CACAAGAAAT CACTGACAAT GAAGGACGCA CGAAAGGTAG 30300 

CACGAACTAT GTCAATGCAG ATACA:AAGT CAAACAAGTA TATGATGAAA CGGTTGATAA 30360 

AGCGAAACAA GCACTTGATA AATCGACTGG TCAAAACTTA AC T G C AAAAC AAGTTATCAA 30420 

ATTAAATGAT GCAGTCACTG CAGCTAAGAA AGCATTAAAT GGTGAAGAAA GACTTAATAA 304 8 0 

TCGTAAAGCT G AAGC AT T AC AAAGATTGGA TCAAT TAACA CATCTAAACAATGCTCAAAG 30540 

ACAATTAGCA ATCCAACAAA TTAATAATGC TGAAACGCTA AATAAAGCAT CTCGAGCAAT 30600 

TAATAGAGCA ACTAAATTAG ATAATGCAAT GGGTTCAGTA CAACAATATA TTGACGAACA 30 660 

GCACCTTGGT GTTATCAGCA GCACAAATTA CAT CAATGCA GATGACAATT TGAAAGCAAA 30720 

TTATGATAAT GCAATTGCGA ATGCAGCACA TGAGTTAGAT AAAGTGCAAG GTAATGCAAT 307 8 0 

TGCaAAAGCT GAAGCAGAGC AATTGAAACA AAATATTATC GATGCTCAAA AT G CAT T AAA 30840 

TGGAGACCAA AACCTTGCAA ATGCCAAAGA TAAAGCAAAT GCGTTTGTTA ATTCGTTAAA 30900 

TGGATTAAAT CAACAGCAAC AAGATCTTGC ACATAAAGCA ATTAACAATG CCGATACTGT 30 9 60 

ATCAGATGTA ACAGATATTG TTAATAATCA AATTGACTTA AATGATGCAA T G G AAAC AT T 31020 
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GAAACATTTA GTTGACAATG AAATTCCAAA TGCAGAGCAA ACTGTCAATT ACCAAAACGC 



TGACGATAAT GCTAAA 



31080 
31096 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

AT G AC AG AAT GGGAGCGAGG ACTTAGAATG TTTCCTAAAT CAGGTTTATT AAATTTTGAG 60 

TTAGCGATAG mAAATCGTTC ATTAAATGAT GAT G AAAAAG CATTAAAATA TGTGCGTAAA 12 0 

GCATTAAATG CAGACCCTAA AAATACAGAT TATATTAACT TAGAAAAAGA GTTGACTAAA 180 

TCAAATGAGT CGAAAAATAA ATAACTTTTA TGATGTACAA CAGTTATTGA AAAGTTACGG 24 0 

ATTTCTAATA TATTTTAAAA AT CC AGAAGA TATGTACGAA ATGATTCAAC AGGAGATTTC 300 

ATCATTGTAT CAATATGAAC TGTTATCTAA GGAAGAATAT TTGAAATGTA CGTTGATAAT 3 60 

TAATCAGAGA AGGAATGAAC AGAAATGACA AAAATTATT TAGCAGCTGA TGTAGGCGGG 4 20 

ACGACTTGTA AATTAGGTAT TTTCACACCT GAAT TAGAAC AATTACATAA ATGGTCTATT 4 80 

C AC AC T GAT A CATCTGATAG TACAGGATAT ACACTTTTGA AAGGAATTTA TGATTCGTTT 54 0 

GTTGAAAAAG TAAATGAAAA T AAT TAT AAT TTTTCAAATG TACTTGGCGT AGGTATTGGT 600 

GTACCAGGTC CTGTTGACTT TGAAAAAGGT ACAGTAAATG GAGCAGTAAA CTTATATTGG 660 

C C AG AAAAAG TTAATGTACG TGAGATTTTT GAACAATTCG TTGATTGTCC AGTGTATGTA 720 

GATAATGATG CTAACATAGC TGCTTTAGGG GaGAAACACA AAGGTGCTGG TGAAGGTGCC 7 80 

GATGATGTTG TTGCCATCAC ACTTGGTACA GGTCTAGGTG GAGGAATTAT TTCCAAATGG 84 0 

TGAAATCGTA CATGGTCATA ATGGCTCtGG CGCAGAAATA GGTCATTTTA GAgCAGACTT 90 0 

CgATCAACGA TTTaAATGTA ATTGTGGTCG TTCTGGATGT ATT G AAAC AG TTGCTTCaGC 9 60 

GACAGGCGTT GTTAACTTAG TTAACTTCtA CTATCCGAAG TTGJ5GTTTA GATCTTCTAT 102 0 

AT TAG AAT T G AT T AAAGAAA ATAAGGTtAC aGCAAAAGCT GTTTTTGATG CGGCAAAAGC 108 0 

TGGTGACCAA TTCTGTATTT TCATTACTGA AAAGGTTGCA AACTATATTG GATATTTATG 114 0 

TAGTATTATT AGTGTTACAA GTAATCCGAA ATATATCGTT CTAGGTGGAG GAATGTCTAC 1200 

TGCAGGACCT ATTTTAATTG AAAATATTAA AACAGAATAT CAT AAT T T AA CATTTGCACC 12 60 

TGCTCAATTT G AAAC T G AAA TTGTACAAGC GAAATTAGGT AATGATGCAG GTATTACAGG 132 0 

AGCAGCAGGA TTAATCAAGA CCTATGTATT AGATAAAGAG GGGGTAAAAT AATGGCTATT 1380 
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GTTGATGTGG TTGTTATTCC AGTTGGVACG GAAGGTCCGA GTGTTAGTAA ATATATTGCA 14 4 0 

GATATTCAGA AAAAACTTCA AGAATATAAA GCAATGGGTA AAATTGATTT TCAATTAACA 1500 

CCAATGAATA CTCTAATTGA AGGTGAATTA AGCGATGTAT TAGAAGTTGT GCAAGTGATA 1560 

CATGAATTAC CTTTTGATAA AGGTTTAAGT AGAGTTTGTA CAAATATCCG STTGATGAC 1620 

CGACGAGACA AAT C TAG AAA AATGAATGAT AAACTAACAT C AG T AC AAAA AC AT T T AGAA 168 0 

AATAGTGGTG AAAACCTATG AGGATTTCAA GCTTAACTTT AGGCTTAGTT GATACTAATA 17 4 0 

CGTATTTCAT CGAAAATGAC AAAGCTGTTA TTCTGATTGA CCCTTCAGGT G AAAG T G AAA 1800 

AAATTATTAA AAAATTAAAC CAAATAAATA AACCGTTAAA AGCTATTTTA TTAACACATG 18 60 

CACACTTTGA T CAT AT C GG A GCAGTCGATG ATATAGTTGA TCGATTCGAT GTCCCGGTTT 192 0 

ATATGCATGA AGCAGAGTTT GATTTTCTAA AAGATCCCGT TAAAAAT GGG G C AG AT AAAT 198 0 

TTAAGCAATA TGGATTACCA ATTATTACAA GTAGGTAAC TCCTGAAAAG TTAAmCGAAG 2 04 0 

GTAGCACAGA AATAGAAGGA TTTAAGTTnT nAyrTGTaCA CACACCTGGA CATTCACCAG 2100 

GAAGTTTAAC ATATGTGTTC GATGAATTCG CAGTTGTTGG AGATACATTA TTTAATAATG 2160 

GAATCGGACG TACAGATTTA TATAAAGGTG ATTATGAAAC GCTAGTTGAT TCTATTCAA 2220 

ATAAAATATT TGAATTAGAA GGC 22 4 3 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8009 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESa double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

TTGGnATCAT tyAcgGTAAA AAGAATAAaG CAAGATTtAT TTCATTAGTA CTAATTTGTG 60 

CAATGTTTGC AATTTGTTGG GTTGCATATA TTCAATGGGA GTCTACAATC GCTTCATTTA 120 

CACAATCTAT TAATATTTCa ATGGCACAAT ATAGTGTTTT ATGGACAATT AAC G G AAT AA 180 

TGATTTTAGT AGCACAACCA TTAATTAAAC CGATTCTCTA TCTGTTAAAA GGAAACTTAA 24 0 

AG AAG C AAA T GTTTGTCGGC ATCATCATTT TTATGTTGTC GTTCTTTGTC ACGAGTTTTG 300 

CCGAAAACTT TACAATATTT GTTGTCGGTA TGATTATTT AACTTTTGGA GAAATGTTTG 360 

TATGGCCAGC AGTTCCAACT ATAGCCAATC AGTTAGCGCC AG AT GG T AAG CAAGGACAGT 4 20 

ACCAAGGTTT TGTGAATTCA GCTGCTACAG TAGGAAAAGC ATTTGGTCCA TTTCTTGGTG 4 80 

GTGTATTAGT TGATGCGTTT AATATGCGCA TGATGTTTAT CGGTATGATG CTACTACTTG 54 0 
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TATTTGCATT AATATTATTA ATGGTTTTCA AGGAGAATAA TACGCAACCT AAAAAAATAG 600 

ATGCATAATG AGTAAATAGA ATTAACGTTA T AG AC T T G AA ATAAATGTCG TTATAACATA 660 

ATATTAATTT GTATAATTTA ATTTCGTTTG GAGCTTTTCT AC AG AAAG C T AGTGATGCTG 720 

AGAGCTAGTG TTAAGGACTA AATGTAAATC GTATTAATTT TAAATTGAAT GAATGACATC 780 

TCTTACTATT AAAAT GAG T G CACAATTTTT GTGAAATAGG GTGGTAACGC GGCAAATGTC 840 

GTCCCTATGT AAATAGAATA GTTAGAGGTG TCTTTTTTAT TGAATAGGAG GAAATGTGTT 900 

GAATTACAAC C AC AAT C AAA TTGAAAAGAA ATGGcAAGAC TATTGGACG AAAAT AAAAC 9 60 

ATTTAAAACA AATGATAACT TAGGTCAAAA GAAATTTTAT GC T T TAG AC A TGTTTCCATA 1020 

TCCATCAGGT GCTGGTTTAC ATGTTGGACA TCCTGAGGGc TATACAGCAA C AG AT AT CAT 1080 

TTCAAGATAT AAAAGAATGC AAGGATATAA TGTATTACAT CCGATGGGGT GGGATGCATT 114 0 

CGGATTACCA GCAGAGCAAT AT GC T T T AGA CACTGGCAAC GACCCACGTG AATTTACAAA 1200 

GAAAAAT AT C CAAACTTTTA AACGACAAAT T AAAGAAT T A GGGTTCAGTT ATGATTGGGA 12 60 

TCGTGAAGTT AATACAACAG ATCCAGAATA CTATAAATGG ACACAGTGGA TTTTCATACA 132 0 

GTTATATAAC AAAGGTTTAG CATACGTf GA TGAAGTTGCA GTTAACTGGT GTCCAGCATT 1380 

AGGCACTGTT TTATCTAACG AAGAAGTGAT TGATGGTGTC TCTGAACGTG GTGGACATCC 14 4 0 

AGTTTATCGT AAGCCGATGA AACAAT GGGT ACTTAAAATC AC AGAAT AT G CAGATCAATT 1500 

ATTAGCAGAT TTAGATGATT TAGATTGGCC TGAGTCTTTA AAAGATAT GC AGGCAATTG 1560 

GATTGGACGT TCTGAAGGGG CCAAAGTTTC ATTTGATGTA GATAATACGG AAGGAAAAGT 162 0 

AGAAGTATTT ACGACTAGAC C AG AT AC AAT CTATGGTGCA TCATTCTTAG TCTTAAGTCC 1680 

TGAACATGCA T TAG T T AAT T CAATTACAAC AGATGAATAT AAAGAAAAAG TAAAAGCTTA 17 4 0 

TCAAACAGAA. GCTTCTAAAA AGTCAGATTT AG AAC G T AC A GATTTAGCAA AAGATAAATC 18 00 

AGGTGTATTT ACTGGTGCAT ATGCAACTAA TCCTTTATCT GGTGAAAAAG TACAAATTTG 18 60 

GATTGCTGAT TATGTATTAT CAACATATGG TACTGGAGCA ATTATGGCAG TACCAGCGCA 1920 

T GAT G AC AGA GATTATGAAT TTGCTAAAAA GTT"EATTTG CCAATCATTG AAGTCATCGA 1980 

AGGTGGAAAT GTTGAAGAAG CAGCATACAC TGGTGAAGGT AAACATATTA ATTCTGGTGA 204 0 

ACTTGATGGT TTAGAAAATG AAGCGGCAAT TACTAAAGCT ATTCAATTAT TAGAGCAAAA 2100 

AGGTGCTGGC GAAAAGAAAG T T AAT T AC AA ATTAAGAGAT TGGTTATTCA GTCGTCAGCG 2160 

TTATTGGGGC GAACCAATTC CTGTCATTCA TTGGGAAGAT GG AAC AAT G A CAACTGTTCC 2220 

TGAAGAAGAG CTACCATTGT TGTTACCTGA AAC AG AT G AA ATCAAGCCAT CAGGGACTGG 22 8 0 

TGAGTCTCCA CTAGCTAATA TTGATTCATT TGTAAATGTT GTAGATGAAA AAACAGGTAT 234 0 

GAAAGGACGT CGTGAAACAA ATACAATGCC ACAATGGGCA GGTAGTTGTT GGTATTATTT 2 4 00 
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ACGTTACATC GATCCTAAAA ATGAAAATAT GTTAGCAGAT CCTGAAAAAT TAAAACATTG 24 60 

GTTACCTGTT GATTTATATA TCGGTGGAGT AGAACATGCG GTTCTTCACT TATTATATGC 2520 

AAGATTTTGG CATAAAGTCC TTTATGATTT GGCTATCGTA CTACTAAAG AACCTTTCCA 2580 

AAAATTATTT AACCAAGGTA TGATTTTAGG AGAAGGTAAT GAGAAGATGA GTAAATCTAA 2 64 0 

AGGAAATGTA ATCAATCCTG ATGATATAGT ACAGTCTCAT GGTGCAGATA CTTTGCGTCT 27 00 

TTACGAAATG TTTATGGGAC CTTTAGATGC TGCAATTGCA TGGAGTGAAA AAGGATTAGA 2KB 

TGGGTCTCGT CGATTCTTAG ATCGCGTATG GCGTTTAATG GTAAATGAAG ATGGGACATT 2820 

GAGTTCAAAA ATTGTAACTA CAAATAATAA ATCTTTAGAT AAAGTTTATA ACCAAACTGT 2880 

TAAAAAGGTA ACAGAAGACT TTGAAACATT AGGATTTAAT ACTGCTATTA GTCAATTAAT 2 94 0 

GGTATTTATT AATGAGTGTT ATS^AG T T G A TGAAGTTTAT AAACCTTACA TTGAAGGCTT 3 00 0 

CGTTAAAATG TTAGCACCTA TTGCACCACA TATCGGTGAA GAATTATGGT CAAAATTAGG 30 60 

ACATGAAGAG TCTATTACGT ACCAACCTTG GCCAACTTAT GACGAAGCAC TACTTGTAGA 3120 

TGATGAAGTA GAAATCGTTG TTCAAGTGAA TGGTAAATTG AGAGCTAAA TTAAAATTGC 3180 

TAAAGATACA TCAAAAGAAG AAATGCAAGA AATTGCCTTA TCTAATGACA ATGTTAAAGC 32 4 0 

GAGTATTGAA GGTAAAGACA T CAT G AAAG T CATCGCTGTT CCTCAAAAAT TAGTCAATAT 3300 

TGTAGCTAAA TAATGTTTTA AGGAGGACTT TGAAATGAAG TCAATTACTA CAGATGAATT 3360 

AAAAAATAAA CTTTTAGAAT CTAAACCAGT TCAAATTGTT GATGTTCGTA CTGATGAAGA 3420 

AACAGCAATG GGATATATTC CTAATGCAAA GTTAATTCCA ATGGATACCA TTCCGGATAA 34 8 0 

TTTAAATTCA TTTAATAAAA ATGAAATATA TTATATTGTA TGTGCTGGTG GAGTTCGAAG 354 0 

CGCTAAAGTT GTAGAATATT TAGAGGCAAA TGGCATTGAT GCCGTAAATG TCGAAGGCGG 3600 

CATGCACGCA TGGGGCGATG AAGGTTTGGA AATAAAAAGT ATTTAAAGTA GTGACATAAT 3 6 60 

TTAAAATAAT ATTACATTTG TAATGACACC AAGTAACGTT TCGGTTGCTT GGTGTTTTTT 3720 

GGTATGAATT ACTTTCTGTT ACAAAACAAT CTAAAGCGTT CTTGTTATGT TTTATAAGA 3780 

TTTTAATTAC AAAACGGAAA CTAAATTGTA ATAAAATAAA ACTTTATTTT ATAAAATGAT 38 4 0 

GATGATAAAA TTGAGTGAAC TTAAAATATT GTACAAAATA ATATAGCTAT AAATATAATA 3900 

TAGCTATAAA TATAATATGA GGGAGCGTAT ATTTTTAGCA TAATTCTTAA CAACACAGCA 3 960 

GAGAACAGAC AACCAGGAGG AAAATGAAAT GAATTTGTTA AAGAAAAATA AATATAGTAT 4 02 0 

TAGGAAGTAT AAAGTAGGCA TATTCTCTAC TTTAATCGGA ACAGTTTTAT TACTTTCAAA 4 08 0 

CCCAAATGGT GCACAAGCCT TAACTACGGA TAATAATGTA CAAAGCGATA CTAATCAAGC 414 0 

AACACCTGTA AATTCACAAG AT AAAG AT G T TGCTAAT5&.T AGAGGTTTAG CAAATAGTGC 4 200 
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GCAGAATACA CCTAATCAAT CTGCAACAAC CAATCAAGCA ACGAATCAAG CATTGGTTAA 4 2 60 

TCATAATAAT GGTAGTATAG TAAATCAAGC TACGCCAACA TCAGTGCAAT CAAGTACGCC 4 320 

TTCAGCACAA AACAATAATC ATACAGATGG CAATACAACA GCAACTGAGA CAGTGTCAAA 4 38 0 
CGCTAATAAT AATGATGTAG TGTCGAATAA TACCGCATTA AATGTACCAA CTAAAACAAA 4 4 40 

TGAAAATGGT TCAGGAGGAC ATCTAACTTT AAAGGAAATT CAAGAAGATG TTCGTCATTC 4 5 00 

TTCAAATAAA CCAGAGCTAG TTGCAATTGC TGAACCAGCA TCTAATAGAC C G AAAAAG AG 4 5 60 

AAGTAGACGT GCGGCACCGG CAGATCCTAA TGCAACTCCA GCAGATCCAG CGGCTGCAGC 4 620 
GGTAGGAAAC GGTGGTGCAC CAGTTGCAAT TACAGCGCCA TATACGCCAA CAACTGATCC 4 680 

TAATGCCAAT AATGCAGGAC AAAATGCACC TAACGAAGTG CTGTCATTTG ATGACAATGG 47 40 

TATTAGACCA AGTACCAACC GTTCTGTGCC AACAGTAAAC GTTGTAATA ACTTGCCGGG 4 8 00 
CTTCACACTA ATCAATGGTG GCAAAGTAGG GGTGTTTAGT CATGCAATGG TAAGAACGAG 4 8 60 

CATGTTTGAT TCAGGAGATA ATAAGAACTA TCAAGCACAA GGAAATGTAA TTGCATTAGG 4 920 

TCGTATACAT GGAACTGATA CGAATGACCA TGGCGATTTT AATGGTATCG AGAAAGCATT 4 980 

AACAGTAAAT CCGAATTCTG AATTAATCTT TGAATTTAAT ACAATGACTA CTAAAAACGG 504 0 

TCAAGGCGCA ACAAATGTTA TTATCAAAAA TGCTGATACT AATGATACGA TTGCTGAAAA 5100 
GACTGTTGAA GGCGGTCCAA CTTTGCGTTT ATT TAAAGTA CCTGATAATG TGAGAAATCT 5160 

CAAAATTCAA TTTGTACCTA AAAATGSCGC AATAACAGAT GCGCGTGGCA TTTATCAACT 522 0 
AAAAGATGGT TACAAATACT ATAGCTTTGT TGACTCTATC GGACTTCATT CTGGGTCACA 52 80 

TGTTTTTGTT GAAAGACGAA CAATGGATCC AACAGCAACA AATAATAAAG AGTTTACTGT 534 0 

AACAACATCA TTAAAGAATA ATGGTAATTC TGGTGCTTCT CTAGATACAA AGACTTTGT 5 4 00 
ATATCAAGTT CAATTACCTG AAGGTGTTGA ATATGTGAAC AAT T CAT T G A CTAAAGATTT 54 60 

TCCAAGTAAC AATTCAGGCG TTGATGTTAA TGATATGAAT GTTACATATG ATGCAGCAAA 552 0 

TCGTGTGATA ACAATTAAAA GTACTGGAGG AGGTACAGCA AACTCTCCGG CACGACTTAT 55 8 0 

GCCTGATAAA ATACTCGATT TAAGATATAA ATTACGTGTA AATAATGTGC CGACACCAAG 564 0 
AACAGTAACA TTTAACGAGA CATTAACGTA TAAAACATAT ACACAAGATT TCATTAATTC 57 00 

AGCTGCAGAA AGTCATACTG TAAGTACAAA TCCATATACT ATCGATATCA TCATGAATAA 57 60' 

AG AT GC AT T A CAAGCCGAAG T T G AC AG AC G TATTAACAA GCTGATTATA CATTTGCGTC 5820 
AT TAG AT AT C TTTAATGGTC TGAAACGACG CGCACAAACG ATTTTAGATG AAAATCGTAA 58 8 0 

CAATGTACCA TTAAATAAAA GAGTTTCTCA AGCATATATT GATTCATTAA CTAATCAAAT 594 0 

GCAACATACG TTAATTCGAA GTGTTGATGC TGAAAATGCA GTTAATAAAA AAGTTGACCA 6000 
AATGGAAGAT TTAGTTAATC AAAATGATGA ATTGACAGAT GAAGAAAAAC AAGCAGCAAT 60 60 
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ACAAGTTATC GAGGAACATA AAAATGAAAT AATTGGTAAT ATTGGTGACC AAACGACTGA 6120 

TGATGGCGTT AC T AG AAT C A AAGATCAAGG TATACAGACC TTAAGTGGGG ATACTGCAAC 6180 

ACCGGTTGTT AAACCAAA.TG CTAAAAAAGC AATACGTGAT AAAGCAACGA AACAAAGGGA 62 4 0 

AATTATCAAT GCAACACCAG ATGCTACTGA AGACGAGATT CAAGAT GCAC TAAATCAATT 6300 

AGCTACGGAT GAAAC AG AT G CTATTGATAA T G T T AC G AAT GCTACTACAA ATGCTGACGT 6360 

TGAAACAGCT AAAAATAATG GCATCAATAC TATTGGAGCA GTTGTTCCTC AAGTAACTCA 64 20 

TAAAAAAGCT GC AAGAGAT G CAATTAACCA AGCAACAGCA ACGAAAAGAC AACAAATAAA 64 8 0 

TAGTAATAGA GAAGCAACTC AGGAAGAGAA AAAT GCAGCA TTGAACGAAT TAACTCAAGC 65 4 0 

AACCAACCAT GCTTTAGAAC AAATCAATCA AGCAACAACA AATGCTAATG TTGATAACGC 6SD 

CAAAGGAGAT GGTCTAAATG CCATTAATCC AATTGCTCCT GTAACTGTTG TTAAGCAAGC 6660 

TGCAAGGGAT GCCGTATCAC AT GAT GCAC A AC AAC AT AT C GCAGAGATCA ATGCTAATCC 67 2 0 

TGATGCGACT CAAGAAGAAA GACAAGCAGC AAT T G AC AAA GTGAATGCTG CTGTAACTGC 67 80 

AGCAAACACA AACATTTTAA AC(£TAATAC C AAT G C T GAT GTTGAACAAG TAAAGACAAA 68 4 0 

TGCGATTCAA GGAATACAAG C AAT T AC AC C AGCTACAAAA GTAAAAACAG ATGCAAAAAA 6900 

TGCCATCGAT AAAAGTGCGG AAACGCAACA TAATACGATA T T T AAT AAT A ATGATGCGAC 6960 

GCTCGAAGAA CAACAAGCAG CACAACAATT ACTTGATCAA GCTGTAGCE CAGCGAAGCA 7 02 0 

AAAT ATT AAT GCAGCAGATA CGAATCAAGA AGTTGCACAA GCAAAAG AT C AGGGCACACA 7 08 0 

AAATATAGTA G T GAT T C AAC CGGCAACACA AGTTAAAACG GATACTCGCA ATGTTGTAAA 714 0 

TGATAAAGCG CGAGAGGCGA TAACAAATAT CAATGCTACA ACTGGCGCGA CTCGAGAAGA 7 200 

GAAACAAGAA GCGATAAATC GTGTCAATAC ACTTAAAAAT AGAGCATTAA CTGATATTGG 72 60 

TGTGACGTCT ACTACTGCGA TGGTCAATAG TAT TAG AG AC GATGCAGTCA ATCAAATCGG 7 32 0 

CGCAGTTCAA CCGCATGTAA CGAAGAAACA AACTGCTACA GGTGTATTAA ATGATTTAGC 7380 

AACTGCTAAA AAGCAAGAAA TTAATCAAAA CACAAATGCA ACAACTGAAG AAAAGCAAGT 74 40 

GGCTTTAAAT CAAGTGGATC AAGAGTTAGC AACGGCAATT AATmATATAA ATCAAGCTGA 7 500 

TACAAATGCG GAAGTAGATC AAGCGCAACA ATTAGGTACA AAAGCAATTA ATGCGATTCA 7 5 60 

GCC AAAT ATT GTTAAAAAAC CTGCAGCATT AGCACAAATC AATCAGCATT ATAATGTAA 7 62 0 

ATTAGCTGAA ATCAATGCTA CACCAGATGC AACGAATGAT G AG AAAAAT G CTGCGATCAA 7 680 

TACTTTAAAT CAAGACAGAC AACAAGCTAT TGAAAGTATT AAACAAGCTA ACACAAATGC 774 0 

AG AAG TAG AC CAAGCTGCGA CAGTAGCAGA GAATAATATC GATGCTGTTC AAGTTGATGT 7 800 

AGTAAAAAAA CA^GCAGCGC GAGATAAAAT CACTGCTGAA GTGGcGAacG TATTGaAGCG 7 8 60 
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GTTAAACAAA CACCTAATGC AACTGACGAA GAAAAGCAGG CTGCTGTTAA TCAAATCCAA 7 920 

TCAACTTTAA AGATTCAAGC AATTTAATCC AAATTTAATC CAAAACCCAA ACAAATGGAT 7 980 

TCAGGGTAGG ACACCACTTA CAAATCCAA 8009 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

ACCCACCCCn TGGGGATAnT TTACCTGGTG GGGCCTTCGA TTGCCTTTAG GTGAAACCaG 60 

CATGTTCaTT AGGATCTGCT AATTTAATTG AT AC AAC C AA ATTTATTCCA TTAGGTTTAG 120 

AGATGAATGC TAACCATATT CATTCTGCTA AAGATGGTCG TGTTACTGCG AC AG C T G AAA 180 

TTATTCATCG AGGTAAGTCG AC AC AT G TAT GGGATATAAA AATTAAGAAT GACAAAGAAC 240 

AATTAATTAC AGTTATGCGT GGTACAGTTG CTATTAAACC TTTAAAATAA AAGAACTGCT 3 00 

AGCTGAAATG T TAT GAG AT A TTCATAACTA CGGCTAGCAG TTTTTTTATG CGCTATATTG 3 60 

TTGTAGTTTT AGAAATGCTT GTTCAATGCG TTCGGCAGCT TTACGGCCAC C CAT AAC ATT 420 

TCTACCAAAT GGTCCTAATT CTAAGTCTGC AAAGCATCCT GCGACAAATA GATTTGGTAT 4 80 

CCATTCTAAT TTTTCGGAAA TAACAGGGTA ATTACATTCG TTGATAGGTG CATCATAATT 540 

TTGTATTAAT TGCTTAATAA GTGGTTGTGA CATAAAATCT ETTCAAAAC CAGTTGCAAC 600 

CATAATCTGT TGATATGGAA CAGAATCATT TTCAGTGTTA ATTACACCAC CACTAATTTG 660 

AGTGATAGGT GTTTTATGCa CATTTATACG ACCATTTTTA ATATGTTTTT TAAGGCGTAA 720 

GTACAGTTCG TGAGGCATTG ATCCTTTATG ACGTTCGCGT TGTACAATGG CATTTCTTTC 7GB 

AGGCATGCTT TTAGTACTTA AAAATGAAGA CATATTTTTC GGACCTAACC AACCAGGATC 840 

AGCATCAAAG TCATGTATTT CAATATCTTT ATTTAGCCAT AAATGAATCT TTTTATCGTT 900 

ATCATGATTT AACAATTTAA GTGCAAGATG TGCAGCAGTa ATGCCGCTAC C AAC GAT AT G 9 60 

ATCGGTCTTA TCATATACTA CTTGATCAAG TTCTTTCTCG AAGATATGAT TTACATTCTG 1020 

TTTGTCTTTT AAAATGTCAG GCATAAACGG AATATTTGTA CTGCCTATTG CAATAACGAC 1080 

GCAATCTGTA GTGATAATTT GTCCATCTTC TAACTTGATA TGCCATTTGT CTTCTTGTTT 1140 

ATCTAAAGTT TGAACTAAAC CTTGAACCAA GCAATCCTCT AATTGATAT GTTTAGAAGC 12 00 

ATGTGCAATA TGATCCATAA ACATTGTCAA TTCAGGTCGT TGATAAGGAC CATAAAAAGC 12 60 

ATTTGTATAT TGGTGCTGTT TAGCGAATTG TTTTAGATGG AACGGTTGTG GATGTACGTG 1320 
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ATGTACAATC GGTGATCTTA AATAAGGCAT TTCTATTCGA TTTGTATATG AGTTAAACCT 138 0 

TTGGCAAAAA GTTTCGTGTG GGTCAATGAT TGTTAATCGG TCTGTTGTTA ATCCGCTTGA 14 4 0 

TAATAGTTTT TGTGCGATTG CAGTTCCCTG TATGCCACCG CCGATAATTG TCCAATGCAT 1500 

AATAAAACCT CTCTCTTTTT AAAACGTAAT AGTTACGATT TATAATTATT ATTATCATAA 1560 

T AC AT AAC G A CATGAAAGGC AATTAAATTA AAGAGATATA TGTAGATAGG GCGAATCTGT 162 0 

AGTCAAAGAA AAAATCATTG AAAAAGAGGT AACAATGTCA AAAGAwAACA GCAGTAAAAT 1680 

CATTCCTAAT TTGGAATCAT CTTACTGCTG TTTGTTGTTG ATTTATATTC ATGATTTTGT 17 4 0 

TATATAATCT ACAATTTTGT GTCTTTTAAG TCTTCCGAAA TTTCATCGAC TTTAGTTTT 18 00 

TTAGTATAAG GCGTTTTAAT ATTATATGCT GCTTTCATAA TCATATGACT T GAAAG AGGA 18 60 

CCTGTAATTA ATACAAAGAT AATCGCAACG ATTAATTGCA T ATT T AC AAA ACCTTGAGTA 192 0 

GCAATAAAAT ATAAAAACGT ACCAAATAGT AATGACATTG CACCTAATGT TGATGCTTTT 198 0 

CCGGCAGCAT GTGCACGTGA ATATACATCT TCAAGTCTCA ATAATCCTAT AGCTGCTAGG 20 4 0 

GCGCTAATTA AAGCACCGAT GATAACAAAG ATAAGTGCAA GACTAATCAG TATGATTTTG 2100 

ATCATGTTCA ATCACCTTAC CTTTGTCCAT AAAT T TAG AG AATACTGCAG TACCTAAAAA 2160 

AGCTAATATA CCAATCATCA TAATAACGAC AATCATGAT TTAATATTTA ATAAAATACT 2220 

GAATAATGCT ATAACTGCCA TTAATTGAAG ACCAATCGCA TCTAATGCGA C AAC AC GAT C 2280 

GGCAAGTGAT GGGCCTAGCA CAACGCGAAT GAGCATAGCT AACATAGAAA TGACAACTAT 2 34 0 

GATTAATGCA ATAACGATAA T AAC AT TAT G ATTCATTATA TTTCGCCCAC CTCTCTTACA 2400 

ATTTTCTCTA ATGATGTTTT AATACTTTCT ACTTCTTGCT CTTTAGTTGA AAAATCTATG 24 60 

GCATGAATAT AAATTTTTGT ACGATCGTCA CTTACACCAA GCACTACAGT ACCAGGTGTT 2 520 

AATGTAATTA AAT TAG AC AG CAAGACAATT TGCCAATCTT TTTTTAAATC TGTGTGATAA 2580 

AC AAAGAAT C CTGGTTCATT TTTAATCGAA GGTTTAATAA TAATTTTCAA AACATCAAAA 2 64 0 

TTAGCTTTAA TCAGTTCGAT TAAGAAAATA ATAACTAATT TAATAATACG ATATAGCGTG 27 00 

ATGACATAAA ATCTACCTGG T AAC AC T C T G TGTAAGAGGT AAACAAGAAC TAGGCCAAAG 27 60 

ATGAAACCTA ACACAAAGTT ATTTGTTGTG TAACTATTTG TCACAACAA CCAAAACACT 2820 

GCGATAATAA AGTTTAATAC TAATTGTACA GCCATGTTAT TTACCTCCTA ATACAGCTTT 28 80 

AACGTAGGTT GATGGATTGT AGAATGTTTC TGCACCAGCT TTTACCATTG GATATAAGTA 294 0 

ATCTGCTGAC AATCCATATA AAACAGTTAT C AC AAC T G C A ACGATTGCAA TCGTAGTTAA 3000 

AT AT T T G AC G TCGACTTTGT TATTAAGATC ATATCCTTTT GGTTGACCGA AAAAGCCTTG 30 60 

TAGGAATATG CGAATGACAG AATATAATAC GACTAAACTT GATAATAAGA C GAT G AC AC C 3120 
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ACTTAAATAA AATCCTCTTT CAAATGTTGA TTGGACAATA AAAAATTTTC CATAAAAGCC 3180 

ACTGAGTGGG GGAATGCCAG CTAAACTTAA TGCTGCGATA AAGAATGACC AACCAAGTAC 32 4 0 

AGGATATCGT TTAATTAAGC CACCAAATTG TCTTAAATCA GCAGTGCCTG TAATTTTAAT 3300 

CATAATTCCG ATAAGCAAGA ATAATGCAAG TTTTACTAAC ATGTCGTGCA ATGTATAGTA 3360 

AATAGCCCCA ATCATACCTG ACTCTGTCAT CATTGCAACG CCGACTAAGA TSCACCTAC 34 2 0 

AGCAATCATG ACATTGTATA GGATGATTTT TTTAATGTTG GCATATGCAA CAGCACCGAC 34 80 

ACAACCAAAG ATGATCGTTA ATAGTGCTAA GAATAAAATG ACATAATGTG AAAAGCTTAC 35 4 0 

AT TAT C ACT A AAGAATAGGC TCAATGTTCT AGCGATTGCA TAAACACCAA CTTTTGTTAA 3600 

CAAAGCACCA AAGAATGCAA TGATTGGAAT TGGTGGgCAT AGTATGCACT AGGTAACCAA 3 6 60 

ACAAACATTG GGAATACGCC AGCTTTTGTA GCAAAAACAA AGATAAATAG TATGAAAACG 372 0 

ATATTGACTA AGCCACTGTC ATGCGCTGAA AGGTTAGCTA ATTTATTGCT TATATCTGCT 37 8 0 

AGATTCAATG TTCCTACTAC TGAATATAAA ATCETACAC CCATTACGAA GAAGGATGAC 384 0 

GATACAACGT TAACAAGAAC ATATTTTATT GTTTCTTGTA GTTGAATTTT TGTAGAACCA 3 90 0 

ATTACTAATA AGAAATAAGA T G AC AT T AAA AATACTTCGA AAAATACGAA TAGGTTGAAA 3 9 60 

ATGTCACCAG TTGTGAATGC ACCAATGATA CCTATTAACA TAAATAGTAC TGAAAAATAA 4 020 

TAATAATATC TTTCACGTTC AATACCAATT GTTTGGTATG AATATAAAAT CACAATAGCT 4 08 0 

GTAATAATAA TACTAGTAAT TATTAGTAGG GCACTGAATA TGTCTAATAC AAAGACAATA 414 0 

CTGTATGGTG CTTTCCATGA ACCTAGCTCT ACGCGTATTG GTCCATGTTT AACAACATTT 4200 

GCTAAATTGA TAATTGCCGC GACCAAGGTT AATAATGTAC CGCCTAGTGC GACATAACGC 4 2 60 

TTTATAATAG GACGCTTTCC AATAAAGACA AGTAATATGG CTGTAATTAC TGGAATAACT 4 320 

AGCGTTAACA CAAGCATATT ACTTTCAATC ATCTTCTGGA ACTCCTTTCA TACTCTCAAC 4 380 

GTTATCTGTG CCTAATTCTT TATATGTTCT AAATGCTAAT STAAGAAAA AGGCTGTTGT 4 440 

CGCAAgGCGA TAACGATTGC TGTTAAAATA AGTGCTTGCG GGaTAGGaTC AACATAGCTT 4 500 

TTTACGTTCG CTTCATAAAT TGGAACAGTA CCATGTTTAA GTCCGCCCAT AGTTATTAAA 4 560 

AATAAATTTG CTGCATGTGT TAATAGTGTA GTTCCCATAA CAATTCGTAT CAGACTTTTA 4 602 

G AC AAAAC G A GATAGACACT AATTGCTGTG AGAATACCAC TAACAAAAAT CATAATAATT 4 680 

TCCACTATTC GTTCTCTCCA ATCGAAATAA TAATTGTCAT G AC AG T AC C A ACTACTGCAC 4 74 0 

AT AAAAC AC C GAAATCAAAG AATACTGCTG TTGTCATATG AACAGGTTCT AATATAAATA 4 8 00 

AC GG TAT AT C AAATGTGACA TGCETAAAGA AATTTTTGCC TAAAAACCAA CTTGCGATAG 4 8 60 

GCGTCGCAAT ACAAAAAACT AATCCGATAC CTATCAAGAT TTTAAAATCT AATGGGAAAA 4 920 

TTTTACGCAT TGTTTCTATA TCAAATGCAA TCGTAATGAT AACAAGTGAA CTTGCGAATA 4 980 
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ATAATCCGCC GACGAAACCG CCACCAGGTG TATAATGTCC TGCTAAGAA AG T G AAAAAC 50 4 0 

C AAAG AC CAT TACCATGAAA AAGATAATAA CTGCAGCAAA TTGCAAAATT AGATCATTTT 5100 

GTTGTCTATT CATGATTTTT CACCTCGTTA CCTTGCGTTT GACGCTTTTT ACGTAATTTA 5160 

ATCATTGTAT ATACAGCTAA TCCTGCGATA CCAAGCACAG AT G AC T C G AA TAAAGTATCC 5220 

ATACCACGGA AATCAACAAG TATGACGTTT ACCATGTTTT TACCGTGAGC tAAATCATAA 52 80 

ACGTGCTCTT GATAAAACTT AG AT AT C GAT TCAAAATGTC TATTTCCGTA TGCAATTAAA 534 0 

CCGATAATAA T G AC G G AC AA ACCAACACCA CCAGCAAT T A AAGC AT TAG T AAGCTGGAAT 54 00 

GAGCGCTTTT CATTATAACG ATTTAAATTT GGTAAGTGGT AGAAGCATAA TAAGAACAAT 54 60 

GCTGTTGAAA TAGATTCAAC GACAAACTGT GTCAATGCTA AGTCGGGTGC TTTAAAGAAT 5520 

ATAAACAATA C AG AC AC AG C ATATCCAACT GCACTTAACA TAAT GATGCT AAATAATCTT 55 8 0 

GATTTAGCGA AAAGAATTAA AAAGGCAGCA CTTAATAATA AAAT T ACGAT ACAAATTCG 5 64 0 

AAAATTCTAA TCGGACTAAC GTCTTTAAAA TTAATGTTGA AAGGTACTGA GAATATAGTG 57 00 

ACAAATGTTA ATAAAATTAA TGCACCAAAA ATGATAACTA AATTATTACG TGAATAATCG 57 60 

GTAACATAGC TATTCGTCAT CTTTTCAGAG TAGTTTGGAA TAACATTTGC ACTTCTGTTG 582 0 

TACCAATAAT TGAATGTTAG TTTACCAGGT TGTCGTTGCA ACAATTTCAC CCAATAACTA 58 8 0 

AATGTCACAA TTAGTAAGAT ACCTAAAATA T AAAT C AC T A AT G T T GAT AA AAAGGCAGGC 5 94 0 

GTTAATCCAT GGAACATATG GAATTCAACA TCATCAATTA CCGTATGATT AATCGAAGag 6000 

TnAGCTGGTT CAATAATCGA AT T AG T T AAA ATGCCAGGA ATAAACCAAA TACAATTACT 6060 

AATGTAGCTA AAATAGCTGG T GAT AAAAGC ATTAATATTG ATACTTCGTG TGCTTTTTTA 6120 

GGTAATTGTT CAGGTTTATA TTGTCCGAAA AATATATGCA TTATAAATTT AATTGAATAT 6180 

AC AAAT G T GA AGACACTGCC CACTATACCA ATGATTGGGA ATAGGTAGCC TAATGTATCA 624 0 

ACACTGAATA AATTTGCTTG GCTTGCTGTA AATGTTGTTT CTAAAAATGA TTCTTTTGAT 6300 

AAGAAACCAT TGAACGGTGG TACACCAGCg CATACTTAAT GCTGTAATAA CAGTGATTGT 63 60 

AAAT G AAAT A GGCATAATTG TTAGTAAGCC ACCTAATTTC TTAACATCAC GTGTACCAGT 64 2 0 

AGAATGATCC ACTGCACCTG TAATCATAAA TAGGGCACCT TTAAATGTTG CATGGTTGAT 64 8 0 

TAAATGGAAT ATTGCAGCCG TAAATGCAGC AGCATATATT TTGCTATCAT CGCCTTGATA 654 0 

GTGATAACTA ATGGCACCGA TTCCAAGCAT CGCCATAATC ATACCTAATT GGGATACTGT 6 600 

TGAAAATGCC AGTATACCTT TCAAGTCTTG TTGTTTTGTT GCGTTAGCG AAgCCCAGAA 6660 

TAATGTAATT AAACCAACGA GTGTGACAGT CCATACCCAA CCTTGCGATG CTGCGAAGAT 6720 

TGGTGTCATT CGAGCGAT T A AATATAACCC TGCTTTAACC ATTGTTGCTG AATGAAGATA 67 8 0 
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AGCACTGACT GGTGTAGGTG CTTCCATTGC ATCTGGTAGC CAAATATAAA ATGGAAACTG 68 4 0 

AGCAGATTTT GTAAAAGCAC CAATCATGAT TAAAATCATC GCAAAAATGA AGAATGGGCT 6900 

ATTTTGAATT TCAGAAGCAT GTTGAATCAT GTACTGAATG CTAAATGATT GTGTTGGTAT 6960 

AGCGAGTAAG AT G AT AC C AC CTAATAATGA TAGACCACCA AATACTGTGA TTATGAGCGA 7 02 0 

TTTTTGAGCA CCATATATAG ATGCTTGTCG TTCGCGCCAG AATGAAATAA GTAAAAAACT 708 0 

AGAAAATGAC GTTAGCTCCC AGAATAAATA TAGAATAATA AC AT TAT C T G AAAG T AC G AC 714 0 

ACCTAACATT GCACCCATAA ATAGTAATAA ATAACAATAA AAATTCCCTA GTTGTTCTGA 7 200 

CTTACTTAAG TAGCCGATTG AATATAATAC TACTAAACTG CCGATTCCTG AATAAGCAA 7 2 60 

AC T AAAG AG T AAACCTAAGC CATCAAGATA TAAATCAAAG T T CAT ACCAA AATGAGGCAT 7 32 0 

CCAATTTAAG GTTTTCATTA CAGTATTACC TGACATCGTC GTTTTAATTA ATGTAAGCAT 7 38 0 

ATAAATAAAT ATGACGATAG GGACAGGTAA TACGAACCAT CCTAAATGTA TACGTTTAAA 74 4 0 

AAATCTATAC AGGATAGGAA TAATGAGTGC GAATATTAAC GGTAATATCA CCGCAATATG 7 500 

TAACAAACTC ACTATGTTGT CCTCCTTTAA AAAAT AT T T A TGTTATTCAT TAT AC AT G AA 7 5 60 

TGATATAGTT CTGAAAAACG TACACACTCC TTGTTGTGCT TTATTTTCAG AaGTATTTAA 7 62 0 

ATAAGAAGAA ACACGTCATT TTTTATTTAA AATTTTCTTT GTATTGAAGT GAATAATCTT 7 68 0 

CTTTTAAGCG TGCTAAACTA GCTAAAGACA TTTCAGCATG TTTTGTTTGC TGAGCTTTAA 77 4 0 

GTTTAGTTTC TAAATCTGTA ATTGCTTGTT GAAGTGAATC TTCATAGCGC AATACATCAA 7 8 00 

CAT T GAAG T C GCGTAATTGT GAACGTTTCG TATAGCGTTT TTCAAAATGG CTTAATGCTT 7 8 60 

TGCGGTCATG GAAAAATACA CCTTCAGTTT CAGTAGGGTT ATGTAAATCA CCTTGTTTCG 7 92 0 

GGTGTTTGAT AACTTGTTCA ACTTTAACAA GGACATCGTC TCCATTTTCT TCAACAATCG 7980 

TGACACCATA GCTACCTGTT TTGTGTGAAA AT C GAT AT AG CTTCATGCTA TTTTCCTCCC 804 0 

TTAAAAGTAT GTTAATATAT ATGTATCATA ACATGAATGG AGAATATAAA TGGCTAACTA 8100 

TCCACAGTTA AACAAAGAAG TACAACAAGG T G AAAT C AAA GTGGTTATGC ACACAAATAA 8160 

AGGTGACATG ACATTCAAAT TATTTCCAAA TATTGCACCA AAAACAGTTG AAAATTTTGT 8220 

GACACATGCA AAAAATGGTT AT TAT GAT G G AATCACATTC SCCGTGTCA TTAATGACTT 8280 

CATGATTCAA GGTGGCGATC CAACAGCTAC TGGTATGGGT GGCGAAAGTA TTTATGGCGG 8 34 0 

TGCTTTTGAA GATGAATTTT CATTAAATGC ATTTAACTTA TATGGCGCAT TATCAATGGC 8 4 00 

TAACTCAGGA CCTAATACTA ATGGTTCACA ATTTTTCATT GTTCAAATGA AAGAAGTACC 8 406 

TCAAAATATG TTAAGTCAAC TTGCAGATGG TGGCTGGCCT CAACCAATCG TTGATGCATA 8 52 0 

TGGCGAAAAG GGTGGTACAC CATGGTTAGA T C AAAAAC AT ACAGTATTCG GTCAAATCAT 8 58 0 

TGATGGTGAA aCTACATTAG AAGATATTGC AAATACAAAA GTGGGACCAC AAGATAAACC 8 64 0 



261 

( opied h orn 1< 80(ii>0(i on 08/ 1/ t?00 ; 



AC T T CAT GAT GTTGTAATTG AATCTATTGA TGTTGAAGAA TAATATCTAA ACATAATTAA 8 7 00 

CTACCAACAT TTTAAACTCG GATAAAGCTA ATTTATGAAT GGATTAGTAT ATATTCCAAC 87 60 

gAAAATAAAT AAACTAATAT GATGAGCAAT CTCAATATAT TTATCaAGAA AGCACAGTTT 8 820 

TTAAATAGAT GTGTATTTTA AAGATAATAG TTGAGGTTGC TTTTTATGT T T T AC AG AG A 88 8 0 

ATTGCTATTC AAATAGTAAA TAAATTGAAA ACAAAGTAGC TGGATATCAT ATTGATTTAG 8 94 0 

ATAGGAATTT GTTGCTAATT TTATTTGTAA ATCCAAGTTT GTAGAATTCT TATTCATTTA 9000 

TAAAATAATA TTCGTATGAT TTGATTTTTT AAT TAG T C C A CCATTTCGAT TTGTGCTATG 90 60 

ATAATAGTGT TAAGTAAACG AAATAAGGGG TTATTAAGTT GAATAACTAC AAAATTGGCC 912 0 

AAC AT AT C AA GGTGCGTGTA cTGGTATTCA ACCATACGGT GCGTTTGTTG AGACCCCTAA 918 0 

T CAT AC T G AA GGACTGATTC ATATATCAGA AATTATGGAT GACTACGTTC ATAATTTGAA 92 4 0 

GAAATTTCTA TCAGAAGGCC AAATTGTTAA AGCTAAAATT TTGTCTATAG AT GAT GAAGG 9300 

AAAGCTTAAT CTATCATTAA AGGATAATGA TTACTTCAAA AATTATGAGC GTAAGAAGGA 9360 

AAAACAATCA GTATTAGATG AAATCAGAGA AACAGAAAAA TATGGGTTTC AAACACTTAA 9420 

AGAACGCTTA CCAATCTGGA TAAAACAGTC AAAGCGAGCA ATTCGAAACG ACTAAfiGAA 94 8 0 

CAGATAAATC GTACCGAAAA T C AT AC AAAG GGTCTGAAAT GAAAGTTTCT TAGACTATAA 954 0 

AAGAG AT TAG TATCTATTAA ATTTTATTAG ATACTAATCT CTTTTTGTCT ACGATAACGT 9600 

AATATGaTTG ATTCTATTTA C AC G T AC AAA TGGTTTAAGG TGACATATCC ATTATCTTTG 9 6 60 

T TAG AT AG AA TCGTTGATTT GCaATATTGT ATGTGGATTT GTTTTTTTTA TTTATTTTAG 97 20 

AAAT G AGAAC T AC AAC T T AA AGTATTAAAC GAATTGCAAC TATATAAACA GATAATTGGA 97 80 

GAATGAAAAA AT T AC AT G T T ATAGTCAACT CAATAATTTT AAGGAGGAAT TAAGTAATGA 98 4 0 

AAAGTAAATA CGAACCATTG TTTGATAAAG TAGAATTiEC AAATGGAGTA G AG T T G AG AA 9900 

ATCGATTTGT GTTAGCCCCT T T AAC AC AT A TTTCTTCAAA TGATGATGGT ACTATTTCAG 9 9 60 

AT G T AGAAC T TCCTTATATT GAAAAGCGTT CACAAGATGT TGGTATTACA ATTAATGCTG 10020 

C GAG T AAT G T GAGTGATGTC GGAAAAGCAT TTCCAGGACA GCCATCAATC GCGCATGACA 10 080 

GTAATATTGA AGGACTAAAA CGATTAGCTA CAGCAATGAA GAAAAACGGT GCCAAAGCAC 1014 0 

TCGTACAAAT ACATCATGGC GGTGCACAAG CATTGCCTGA ATTAACACCT GATGGAGACG 10200 

TCGTAGCACC AAGTCCAATT TCTTTAAAAA GTTTTGGTCA GAAACAAGAA CATAGTGCTA 102 60 

GAG AAAT G AC GAATGAAGAG ATTGAACAAG CAATCAAGGA TTTTGGTGAA GCAACGCGAC 10320 

GTGCAATTGA AGCAGGGTTT GATGGTGTTG AAATACATGG CGCGAATCAT T AC T T AAT T C 10380 

ATCAATTTGT ATCACCATAC TATAATAGAA GAAATGATGT ATGGGCAAAT C AAT AT AAAT 104 4 0 
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TCCCGGTCGC TGTGATTGAA GAAGTACTTA AAGCGAAAGA AGCGffiTGGC AATAAAGACT 10500 

TTATAGTTGG ATACAGATTA TCTCCAGAGG AAGCGGAGTC TCCAGGAATC AC AAT GG AAA 10560 

TTACAGAGGA ACTCGTTAAT AAAATTAGCC ATATGCCAAT CGACTATATT CATGTTTCAA 10620 

TGATGGATAC GCATGCAACG ACACGTGAAG GTAAATACGC TGGACAAGAA AGACTGCCTT 10680 

TAATTCACAA ATGGATAAAT GGTCGTATGC CACTTATCGG TATTGGTTCA ATTTTCACAG 10740 

CTGACGAAGC TTTAGATGCA GTTGAAAATG TTGGTGTTGA CTTAGTAGCC ATTGGTAGAG 108 00 

AGCTACTACT GGATTATCAA TTTGTTGAAA AAATTAAAGA TGGACGGGAA GATGAAATTA 10860 
TTAATTACTT TGATCCAGAG AGAGAAGATA ATCATCACTT AACTCCTAAT TTATGGCATC 10 920 

AATTTAATGA AGGGTTTTAT CCATTACCAC GTA 10 95 3 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

T T T G AT An AA AACTGAATnA ATTAAATGTA TCGATTCAAC CTAATGAAGT GAATTTACAA 60 

GTTAAAGTAG AGCCTTTTAG CAnAAAGGTT AAAGTAAATG TTAAACAGAAAGGTAGTTTA 12 0 

GC AG AT GAT A AAGAGTTAAG TTCGATTGAT TTAGAAGATA AAGAAATTGA AATCTTCGGT 180 

AGTCGAGATG AC T T AC AAAA TATAAGCGAA GTTGATGCAG AAGTAGATTT AGATGGTATT 24 0 

T C AG AAT C AA CTGAAAAGAC TGTAAAAATC AATTTwCCAG AACATGTCAC TAAAGCACAA 300 

CCAAGTGAAA CGmAGGCTTA TATAAATGTA AAATAAATAG CTAAATTAAA GGAGAGTAAA 3 60 

CAATGGGAAA ATATTTTGGT ACAGACGGAg TAAGAGGTGT CGCAAACCAA GAACTAACAC 420 

CTGAATTGGC ATTTAAATTA GGAAGATACG GTGGCTATGT TCTAGCaCAT AATAAAGGTG 4 80 

AAAAAC AC C C ACGTGTACTT GTAGGTCGCG JTACTAGAGT TTCAGGTGAA ATGTTAGAAT 54 0 

CAGCAT T AAT AGCTGGTTTG ATTTCAATTG GTGCAGAAGT G AT GC GAT T A GGTATTATTT 600 

CAACACCAGG TGTTGCATAT TTAACACGCG ATATGGGTGC AGAGTTAGGT G T AAT GAT T T 660 

CAGCCTCTCA TAATCCAGTT GCAGATAATG GTATTAAATT CTTTGGATCA GATGGTTTA 72 0 

AACTATCAGA TGAACAAGAA AATGAAATTG AAGCATTATT GGATCAAGAA AACCCAGAAT 780 

T AC C AAGACC AGTTGGCAAT GATATTGTAC ATTATTCAGA TTACTTTGAA GGGGCACAAA 84 0 

AATATTTGAG CTATTTAAAA TCAACAGTAG ATGTTAACTT TGAAGGTTTG AAAATTGCTT 900 

TAGATGGTGC AAATGGTTCA ACATCATCAC TAGCGCCATT CTTATTTGGT GACTTAGAAG 9 60 
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C AG AT AC T G A AACAATTGGA TGTAGTCCTG ATGGATATAA TATCAATGAG AAATGTGGCT 1020 

CTACACATCC TGAAAAAT T A GCTGAAAAAG TAG T T G AAAC TGAAAGTGAT TTTGGGTTAG 108 0 

CATTTGACGG CGATGGAGAC AGAATCATAG CAGTAGATfi. GAATGGTCAA ATCGTTGACG 114 0 

GTGACCAAAT TATGTTTATT ATTGGTCAAG AAATGCATAA AAATCAAGAA TTGAATAATG 1200 

ACATGATTGT TTCTACTGTT ATGAGTAATT TAGGTTTTTA CAAAGCGCTT GAACAAGAAG 12 60 

GAATTAAATC TAATAAAACT AAAGTTGGCG ACAGATATGT AGTAGAAGAA ATGCGTCGCG 132 0 

GTAATTATAA CTTAGGTGGA GAACAATCTG GACATATCGT TATGATGGAT TACAATACAA 138 0 

CTGGTGATGG TTTATTAACT GGTATTCAAT TAGCTTCTGT AATAAAAATG ACTGGTAAAT 14 4 0 

CACTAAGTGA ATTAGCTGGA CAAATGAAAA AAT AT CCACA ATCATTAATT AACGTACGCG 1500 

TAACAGATAA ATATCGTGTT GAAGAAAATG TTGACGTTAA AGAAGTTATG ACTAAAGTAG 1560 

AAGTAGAAAT G AAT G GAG AA GGTCGAATTT TAGTAAGACC TTCTGGAACA aACCATTAGT 162 0 

TCGTGTCATG GTTGAAGCAG CAACTGATGA AGATGCTGAA aGATTTGCAC AACAAATAGC 168 0 

TGATGTGGTT CAAGATAAAA TGGGATTAGA TAAATAAATA CTGTATACA AATGAGCCGA 17 4 0 

TGCGTATGcA nTcgtTTTTT GTGTTTGTAG AAATAATTTA TAGTACAAAC GTAAAATGAT 1800 

ATAAACAAAA TAAAAACAAA GTAATCAATA TGTAATATAA AAT AC AC TGG TACTCAATAT 18 60 

AT AAT GAT G A TAAAATTAAT TTTAATTAGA TAGAGTTGCT TTGTGTTTTT AACGCAGATG 1920 

C TACT ACT T A TCTTAACAGT TGATTAAGTG AAAT CAT T T A ACAGCGAGAA TAATCAACCA 1980 

GGAGGATGAC T T AAT G AAT T TATTCAGACA AC AAAAAT T T AG TAT C AGAA AATTTAATGT 204 0 

CGGTATTTTT TCAGCTTTAA TTGCCACTGT TACTTTTATA TCTACTAACC CGACAACAGC 2100 

GTCTGCAGCA GAGCAAAATC AGCCTGCJCA AAATCAACCA GCACAACCAG CTGATGCCAA 2160 

TACACAGCCT AACGCAAATG CTGGTGCTCA AGCTAATCCT ACAGCACAGC CAGCTGCACC 2220 

TGCCAACCAA GGACAACCAG CAGTACAACC AGCAAACCAA GGTGGACAGG CTAATCCAGC 22 8 0 

AGGAGGAGCA GCACAACCAA AT AC AC AAC C AG C T G G AC AA GGTGATCAAG CTSTCCGAA 2 34 0 

TAACGCTGCA CAAGCACAAC CTGGAAATCA AGCAACACCG GCAAACCAAG CAGGTCAAGG 2 4 00 

AAATAACCAA GCAACACCTA ATAATAATGC AACACCGGCA AAT C AAAC AC AGCCAGCGAA 24 60 

TGCTCCAGCA GCAGCGCAAC CAGCAGCACC TGTAGCAGCA AACGCACAAA CTCAAGATCC 2520 

AAATGCTAGC AATACTGGTG AAGGCAGTAT TAATACGACA T T AAC AT T T G ATGATCCTGC 2 58 0 

CATATCAACA GATGAGAATA GACAGGATCC AACTGTAACT G T T AC AG AT A AAGTAAATGG 2 64 0 

TTATTCATTA ATTAACAACG GTAAGATTGG TTTCGTTAAC TCAGAATTAA GACGAAGCGA 2700 

TATGTTTGAT AAGAATAACC CTCAAAACTA TCAAETAAA GGAAACGTGG CTGCATTAGG 27 60 
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TCGTGTGAAT GCAAATGATT CTACAGATCA TGGTAACTTT AACGGTATTT CAAAAACTGT 2820 

AAATGTAAAA CCAGATTCAG AATTAATTAT TAACTTTACT ACTATGCAAA CGAATAGTAA 28 80 

GCAAGGTGCA ACAAATTTAG TTATTAAAGA TGCTAAGAAA AATACTGAAT TAGCAACTGT 2 94 0 

AAATGTTGCT AAGACTGGTA CTGCACATTT ATTTAAAGTA CCAACTGATG CTGATCGTTT 3000 

AGATTTACAA TTTATTCCTG ACAATACAGC AGTTGCTGAT GCTTCAAGAA TTACAACAAA 30 60 

TAAAGATGGT TATAAATACT ATTCATTCAT TGATAATGTA GGTCTATTCT CAGGATCACA 3120 

TTTATATGTC AAAAATAGAG ATTTAGCACC GAAAGCAACT AACAATAAAG AATATACTAT 318 0 

TAATACTGAA ATCGGTAACA ATGGTAATTT TGGTGCTTCA TTAAAAGCAG ATCAATTTAA 32 4 0 

AT AT G AAG T A ACATTACCAC AAGGTGTAAC TTACGTTAAT AATTCATTAA CTACAACATT 3300 

CCCTAATGGT AATGAAGACA GTACAGTATT GAAAAATATG ACGTTAATT ATGATCAAAA 33 60 

TGCAAATAAA GTTACATTTA CAAGCCAAGG TGTGACAACG GCACGTGGTA CACACACTAA 34 2 0 

AGAAGTTTTA TTCCCAGATA AATCTTTAAA ATTATCATAT AAAGTTAATG TTGCGAATAT 3480 

CGATACACCT AAAAAT AT T G ATTTTAATGA AAAATTAACA TATCGTACTG CTTCAGATGT 354 0 

TGTAATTAAT AATGCGCAAC CAGAAGTaCA CTAACTGCAG ATCCATTTTC AGTAGCGGTT 3600 

GAAATGAACA AAGATGCGTT GCAACAACAA GTAAACTCAC AAGTTGATAA TAGTCATTAC 3660 

ACAACAGCAT CAATTGCAGA ATACAATAAA CTTAAACAAC AAGCAGATAC TATTTTAAAT 3720 

GAAGATGCGA ATCATGTTAA AACTSCAAAT CGTGCATCTC AAGCGGATAT TGATGGTTTA 37 8 0 

GTAACTAAAT TACAAGCTGC ATTAATTGAT AATCAAGCAG CAATTGCTGA ATTAGATACT 3840 

AAAGCTCAAG AAAAGGTTAC AGCAGCACAA CAAAGTAAAA AAGTTACGCA AGATGAAGTT 3900 

GCAGCACTTG TAACTAAAAT TAACAATGAT AAAAATAATG CAATCGCAGAAATTAATAAA 3960 

C AAAC T AC AG CACAAGGTGT CACAACTGAA AAAGATAATG GTATCGCAGT GTTAGAACAA 4 02 0 

GATGTGATTA CACCAACAGT TAAACCTCAA G C G AAAC AAG ATATTATCCA AGCAGTTACA 4 080 

ACTCGTAAAC AACAAATTAA AAAGTCAAAT GCATCATTAC AAGATGAAAA AGATGTAGCA 4140 

AATGATAAAA TTGGTAAAAT TGAAACAAAG GCAATTAAAG ATATTGATGC AGCAACAACA 4 200 

AATGCACAAG TAGAAGCCAT TAAAACAAAA GCAATCAATG ATATTAATCA AACTACACCT 4 2 60 

GCTACAACAG CTAAAGCAGC AGCTCTTGAA GAATTTGACG AAGTTGTTCA AGCACAAATT 4 32 0 

GATCAAGCAC CTTTAAATCC TGATACAACA ffiT G AAG AAG TAGCGGAAgC TATTGAACGT 4 380 

ATTAATGCAG CTAAAGTTTC TGGTGTTAAA GCAATTGAAG CGACAACGAC TGCACAAGAT 4440 

T T AG AAAGAG TTAAAAACGA AGAAATCTCA AAAATTGAAA ATATTACTGA CTCTACGCAA 4 500 

ACAAAAATGG ATGCCTATAA T GAAG T T AAA CAAGCTGCAA C AG C TAG AAA AGCTCA&AT 4 5 60 

GCTACAGTTT CAAATGCAAC AAATGAAGAA GTAGCAGAAG CTGATGCAGC AGTAGATGCA 4 62 0 
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GCTCAAAAGC AAGGTTTACA TGACATCCAA GTTGTTAAAT CAAAACAGGA AGTTGCTGAT 4 680 

ACAAAATCAA AAGTATTAGA TAAAATCAAT GCAATTCAAA CACAAGCAAA AGTTAAACCT 4 74 0 

GCAGCTGATA CGGAAGTAGA AAACGCATAT AATACACGTA AACAAGAAAT TCAAAATAGC 4 8 00 

AATGCTTCAA CTACAGAAGA AAAACAAGCT GCATATACAG AATTAGATAC TAAAAAGCAA 4 8 60 

GAAGCAAGAA CAAATCTTGA TGCTGCAAAT AC AAAC AG T G ATGTAACAAC AGCTAAAGAC 4 92 0 

AATAGTATTG CTGCAATTAA TCAAGTTCAA GCTGCCACm CTAAGAAATC GGATGCAAAA 4 98 0 

GCGGAAATCG CTCAAAAAGC AAGTGAACGT AAAACAGCAA TTGAAGCAAT GAATGATTCG 504 0 

ACTACTGAAG AACAACAAGC AGCGAAAGAC AAAGTGGATC AAGCAGTAGT T AC TGCAAAC 510 0 

GCTGATATAG ATAATGCTGC AGCAAACAAT GATGTGGATA ATGCAAAAAC TACAAATGAA 5160 

GCTACAATCG CAGCCATTAC ACCTGATGCA AATGTTAAAC CAGCAGCAAA ACAAGCAATT 522 0 

GCAGATAAAG TACAAGCTCA AGAAACAGCA ATTGATGGAA ATAACGGCTC AACAACTGAA 52 8 0 

GAAAAAGCAG CTGCTAAACA ACAAGTTCAA ACTGAAAAAA CAACAGCTGA TGCCGCAATA 534 0 

GATGCAGCAC AT AC AAAT G C GGAAGTTGAA GCGGCTAAAA AAGCAGCAAT TGCTAAAATT 54 00 

GAAGCGATTC AGCCAGCAAC AACAACTAAA GATAATGCGA AAGAAGCAAT TGCTACGAAA 5 4 60 

GCGAATGAAC GTAAAACAGC AATCGCTCAA AC G C AAG AC A TTACTGCTGA AGAAATTGCA 5 52 0 

GCGGCTAATG CGGACGTAGA TAATGCTGTG ACACAAGCAA ATAGCACAT TGAAGCTGCT 55 8 0 

AAT AG T C AAA ATGATGTAGA CCAAGCGAAA ACGACAGGTG AAAATAGTAT TGATCAAGTA 5 64 0 

ACACCAACAG TTAATAAAAA AGCAACTGCA CGTAATGAAA TCACAGCAAT TTTAAATAAC 57 00 

AAATTGCAAG AG AT T C AAG C tACGCCAGAT GCAACAGATG AAGAAAAACA AGCAGCTGAT 5 7 60 

GCTGAAGCAA ATACTGAAAA TGGTAAAGCA AATCAAGCCA TTTCAGCAGC AACTACTAAC 582 0 

GCACAAGTTG AT G AAG C T AA AGCAAATGCA GAAGCAGCGA TTAATGCGGT AACACCAAAA 588 0 

GTTGTGAAGA AACAAGCGGC TAAAGATGAA ATTGATCAAT TACAAGCAAC GCAAACAAAT 5 94 0 

GTTATCAATA AT GAT C AG AA CGCTACAA:A GAAGAAAAAG AAGCAGCTAT TCAACAATTA 6000 

GCAACAGCAG TTACAGACGC GAAAAATAAT ATTACAGCTG CAACTGATGA TAATGGTGTA 6 0 60 

GATCAGGCGA AAGACGCTGG AAAGAATTCA ATTCAAAGCA CGCAACCAGC AACAGCGGTT 6120 

AAATCAAATG CTAAAAATGA TGTTGATCAA GCTGTGACAA CTCAAAATCA AGSATTGAT 618 0 

AATACAACTG GTGCTACAAC T G AAG AG AAA AATGCAGCAA AAGATTTAGT TTTAAAAGCT 62 4 0 

AAAGAAAAAG CGTATCAAGA TATCTTAAAT GCACAAACAA CTAATGATGT TACGCAAATT 6300 

AAAG AT C AAG CAGTTGCTGA TATTCAAGGT ATTACTGCAG ATACAACAAT TAAAGATGTT 6360 

GCGAAAGATG AATTAGCAAC AAAAGCAAAC GAACAAAAAG CGCTTATTGC ACAAACTGCA 6420 
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GATGCGACTA CTGAAGAAAA AGAACAAGCA AATCAACAAG TAGACGCACA ATTAACACAA 64 80 

GGTAATCAAA AT AT T G AAAA TGCACAGTCA ATCGATGATG TAAACACTGC AAAAGATAAT 65 4 0 

GCAATTCAAG CAATTGACCC AATTCAAGCA TCAA(AGATG TTAAAACGAA TGCAAGAGCG 6600 

GAATTGCTAA CTGAAATGCA AAATAAAATA ACTGAAATAC TTAATAATAA T GAG AC T AC T 6 660 

AATGAAGAAA AAGGTAACGA TATTGGACCA GTTAGAGCAG CATATGAAGA AGGTTTAAAT 6720 

AATATTAATG CAGCAACTAC TACAGGTGAT GTAACTACTG C T AAAGAT AC AGCAGTACAA 67 8 0 

AAAGTTCAAC AACTTCATGC AAATCCTGTT AAGAAACCAG CAGGTAAAAA AG AAT TAG AT 68 4 0 

CAAGCTGCAG CTGATAAGAA AACACAAATA GAACAAACAC CAAATGCATC ACAACAAGAA 6 90 0 

ATTAATGATG CAAAACAAGA AGTTGATACT GAATTAAATC AAGCGAAAAC AAATGTCGAT 6 9 60 

CAATCATCAA CAAATGAATA TGTTGATAAT GCAGTTAAAG AAGGAAAAGC TAAAATTAAT 7 02 0 

GCAGTTAAAA CATTTAGTGA GTACAAAAAA GATGCTTTAG CTAAAATTGA AGATGCATAT 7080 

AATGCTAAAG TAAACGAAGC GGATAACTCT AACGCATCGA CTTCAAGTGA AATTGCTGAA 714 0 

GC GAAAC AAA AACTTGCTGA ATTAAAACAA ACTGCGGATC AAATGTTAA TCAAGCTACT 72 00 

TCTAAAGATG ACATTGAAGT TCAAATTCAT AATGACTTAG AT AAT AT TAA CGATTACACA 7 2 60 

ATTCCAACAG GTAAAAAAGA ATCAGCTACA ACAGATTTAT ATGCTTATGC AGATCAGAAG 7 320 

AAAAATAATA TTTCAGCTGA C AC T AAT G C A ACACAAGATG AAAAGCAACA AGCAATTAAG 7 38 0 

CAAGTTGACC AAAATGTTCA AACTGCATTA GAAAGCATTA ATAATGGTGT GGATAATGGT 7440 

GACGTTGATG AT GC AT T AAC ACAAGGTAAA GCAGCAATTG ATGCTATTCA AGTAGATGCT 7 5 00 

ACTGTTAAAC CTAAAGCGAA CCAAGCTATT GAAGTTAAAG CAGAAGATAC GAAAGAATCT 7 5 60 

ATTGATCAAA GTGACCAGTT AACT5CTGAA GAAAAAACTG AAGCATTAGC AATGATTAAA 7 62 0 

C AAAT T AC AG ATCAAGCTAA ACAAGGTATT ACTGATGCAA CAACAACTGC TGAAGTTGAA 7 68 0 

AAAGCGAAAg cTCaAGGACT TGAAGCATTT GAT AAC AT T C AAATCGACTC AACAGAAAAA 77 4 0 

CAAAAAGCTA TCGAAGAATT AGAAACTGCA CTAGACCAGA TTGAAGCAGGTGTAAATGTC 7 8 00 

AACGCTGATG CTACAACTGA AGAAAAAGAA GCGTTTACGA ATGCTTTAGA AGACATTTTA 7 8 60 

TCAAAAGCAA CTGaAGATAT TTCTGATCAA AC T AC AAAT G CAGAAATCGC TACTGTCAAA 7 920 

AATAGTGCGC TTGAACAACT TAAAGCACAA CGTATTAATC CTGAAGTTAA GAAAAATGCT 7980 

TTGGAAGCAA TCAGAGAAGT GGTTAACAAG CAAATAGGAA tAATTAAAAA TGCAGATGCA 804 0 

GATGCATCGG CGGAAAGAnA TTGCACGTAC GGGATTTAGG TAGATATTTT GGACCGATTT 8100 

GCTGGATAAA TTTAGGGTnA AACCCCAACC AATGCCGAAG TTGCCTGAAT TACCA 8155 
(2) INFORMATION FOR SEQ ID NO: 64: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

TTTGGGCCCA TTGGGACCGG TATGTtTTiG AGATATCTGA AATTGTAAAA CGTGTTGGAT 60 

CTGTTTTATT TGCAGCACCC ATACTGGAAA TCACTTTAAT CCCTCGGTCA AGACACTCTT 120 

TCATTAAGTG TACTTTGTAC ATTATTGTAT CACTTGCATC TACAAAATAA TCTATATCGT 180 

AGTTATCGAA AATTTCTTCA TATGTCTCTT CTGTATAAAA CAT AT G T AAG GGCEGACTT 24 0 

TACAATCTGG ATTAATTAAT TTAATACGTT CTTCCATCAA AGAAACTTTA CTTTGTCCTA 300 

CCGTTGTAGT TAAAGCGTGT AATTGTCTGT TTACATTTGT AATATCAACA TCATCTTTAT 3 60 

CTATTAATAT AAT AT G AC C A ATATTCGTTC TTGCTAATGC TTCAGCAGCA AATGAACCAA 420 

CACCTCCAAC GCCAAGTATG AC AAC AG T T T GTTGCTTCAA TAAATCTAAA CCTTGTTGTC 4 80 

CAATCGCTAG TTCATTTCTT GAAAATTGAT GTTTCATTAT TTTACCTCTT TCACTGATTT 540 

ATACATAAGT ACATAGTAAC TTAAAATTTT ATATTTAGCA TTATCACTTT GATTATTTTC 600 

CCAAAATTCA ACGAGGAAAC ATT T ATT AAA CGCTAAAAA CCCAACTAAT TCTTTATTAA 660 

AAACTTAAAG AAACGCATAA AAATACGCAA GACAAAGTCT TGCGTATCGA TAGAGTCCGT 720 

ATTGCCGTAG TTATAATAGC TTGATCATTC GGCCTGTTAT ATACAGGTGG GTGCCCTGTT 7 80 

TCTTGTTTTG TACGTCCTTC ATATAAGGCG TGTACGCTGC AAGAAAACCC ATTGGGCTCC 84 0 

CTTGATCAAA GAGTGTTAGG CCCAAATTAA AAAGCAAACT TACGAACAAC TCAGATGACT 900 

ATCTTATGAT GT TAT ATT AC CACATAATTA AAATTAATGA AATTATAACA AACCAAAGTT 960 

TATTGATTTT TTAAAATTTA GTGACGAATT C G C AAAG AAA GTTCTTCTAA TTGTTTATCA 1020 

GAAACTTCAC TAGGCGCATT CGTTAATAAA CATGTAGCAG ATGCTGTTTT AGGGAATGCG 1080 

ATTGTATCTC TCAAGTTTGT TCTATTAGTC AATAACATGA CTAATCGGTC tAATCCTAAT 114 0 

GCAATACCGC CATGTGGTGG TGCACCATAT TTAAATGCAT C TAG T a AG AA GCCGAACTGT 12 0 0 

TCCTgTGCTT GTTCTTTAGT AAATCCAAGA ACTTCGAACA TTTTTCTTG TAACTCACCA 1260 

TCATGAATTC TGATTGAACC GCCACCTAAT TCATAACCAT TTAATACTAT GTCATAAGCA 132 0 

TTTGCCTCAG CTTCtTCTGG CGCAGTGCCA AGCTTAGCAA TATCAGCTTC TTTTGGAGAT 1380 

GTAAATGGAT GATGTGCTGC AACGTAACGT TTCGCATCTT CAT CAT AT T C TAATAATGGC 14 4 0 

CAATCTGTCA CCCATAAGAA GTTTAATTTT GTTTCATCGA TTAAACCTAA TTCTTTAGCT 1500 

AATTTGACAC GTAATGCACC TAAACTTTGT GCAACGACAT TTGGTttGTC TGCAACAAAC 1560 
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ATTACTAAGT CACCAGCTTC AGCACCAGTT AATGTAAGTA ATGTTTCAAC ATTTTCTGTT 



cAAAGAAACG 



1620 
1630 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

CAATTGGACA TCTTGTATGA AAAGGACAAC CTTGCGGCGG ATTACTTGGC GAAGGTAATT 60 

CTCCTTTTAA TATAATTCTA TTGTTATTAT GTTTAT CAAT TTGTGGTATT GATGAAATCA 120 

ACGCTTTTGT ATATGGATGT TTGGGATTTT CATAAATTTC TTTATCAGS GCGATTTCAA 18 0 

CTATATGACC TAAATACATA ACTCCAATGA CATCACTTAT ATGTTTTACT ACACTTAAAT 24 0 

CATGTGCGAT AAATAAATAG CTTAAGTTAA ATTGTTCTTG TAAATCTTTT AATAAATTCA 300 

GTACTTGAGA T T G AAC AG AT ACATCTAATG CACTTACAGG CTCATCAGCA ACAATTAAAC 360 

TCGGACGCAA AGCCAATGCT CTTGCAATTC CCACTCTTTG TCTCTGTCCA CCTGAAAATT 4 20 

CATGTGCATA TTtATAATAT GCATCTTCAC TTAGGCCAAC ACATTTTAAT AAATATAGTA 4 80 

CTTCTTTTTT TATTTCTTCT TTTGGCAATT TTTTATAATT TAAAATAGGT TCTGAAATGA 54 0 

TATCTCCAAC CATTTGCATC GGATTCAATG ATGCATACGG ATCTTGAAAT AT CAT CT GAT 600 

ATTGTTGTCG TGATTTTCTG AGTTTTTTAC CTTGTAATCT TGTTATATCT TCACCATTAA 660 

CAATTATTGA GCCTGAAGTT GCATCTTCAA GCCTGATAAT CACTTTACCT AACGTTGACT 720 

TACCACAACC CG 7 32 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION SEQ ID NO: 66: 

AATATATTCA TATGTTTCAT CAACAATATT AGCTGCTTTT TGAATTAAAG CAATTTCGTC 60 

AGCATCTTTG ACGTCTCTAA TTTTATCTAC AG TAT T AG AA ATGCTTATTA ATGATATACG 120 

GCTTTTATTT AATTCAAGGT ATGTATCATA AC T T AC AT GA TGCCCCTCAA AACCTACATT 180 

TTCAAAATTT TCTTGGTGTA GCAATTCTTT AATCTCACCA ATAATAGTAG ATTTACGATT 240 
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AATAATTTCA TAATTTGGCG CCTGCTTAGT TGCTTGATCA ATATATCTAA AGTCTGTTAT 300 

CAAATATTGT TTATCTTTAG ATATGATAAG TGCTCCACTG GTACCAGTAA AACCTGATAA 3 60 

ATATCTTCTA TTGTAATCCG AAAGAATGaT AATCGATCT AAATGTTTTT GTTCTAAAAT 4 20 

ACGATGCACT TGTGTTATTC TGCTCATCTT ATTACCTCCT AGTGAAcGgT tTGTGCaTTT 4 80 

CAACTTTATA CATTAAAATA ATATCATAAT AAGGATAAAA AATAATAGAT ATTGATTTTA 54 0 

GGGAGATAGT AATGAAAAAA TTGGTTTCAA TTGTTGGCGC AACATTATTG TTAGCTGGAT 600 

GTGGATCACA AAATTTAGCA CCAT TAGAAG AnAAAACAAC AGATTTAAGA GAAGATAATC 660 

ATCAACTCAA ACTAGATATT CAAGAACTTA ATCAACAAAT TAGTGATTCT AAATCTAAAA 72 0 

TTAAAGGGCT TGAAAAGGAT AAAGAAAACA GTAAAAAAAC TGCATCTAAT AATACGAAAA 780 

TTAAATTGAT GAATGT1ACA TCAACATACT ACGACAAAGT TGCTAAAGCT TTGAAATCCT 840 

ATAACGATAT T GAG AAA GAT GTAAGTAAAA ACAAAGGCGA TAAGAATGTT CAATCGAAAT 900 

TAAATCAAAT TTCTAATGAT ATTCAAAGTG CTCACACTTC ATACAAAGAT GCTATCGATG 9 60 

GTTTATCACT TAGTGATGAT GATAAAAAAA CGTCTAAAAA T2KCGATAAA TTAAACTCTG 1020 

ATTTGAATCA TGCATTTGAT GAT AT T AAAA ATGGCTATCA AAATAAAGAT AAAAAACAAC 108 0 

TTACAAAAGG ACAACAAGCG TTGTCAAAAT TAAACTTAAA TGCAAAATCA TGATAGGAGT 1140 

CTTTTAATGC GTAATATAAT ATTTTATCTT GTACTTATTA TTGCTGCGAT TGGATTAGTA 1200 

ATGAATCTAG ATGCCTTTAT TTTTTCAATC GTCAGAATGT TAATCAGCTT TGcgTAaTAG 12 60 

CTGGTATTAT TTATCTGATT TATTATTTCT TCATCTTAAC TGAAGACCAA CGCAAATATC 132 0 

GCAAAGCAAT GCgTrAaGTA TAAAAGAAAT CAAAGAAGAA AATAGATAAA AAAACGGAAG 1380 

CACTTGTAGG TAAAATAGTC TACGTGCTTC CATTTTTTAT TCTAAAAACT ACTTTCTAAA 14 4 0 

CATCCATTCA TCTGAACGAT ATTTTTCAGT TAATTCTTCC ACTTCTGCCA ATTGAGCTTC 15 00 

TGtTAATTCA AGTGGCTTTA ATTCTATATT TAAACCTTTC TTAAAACCTT TCTCGAAAGC 15 60 

TTCTTCCATT TGACTAATAG TAATGTGTTC ATCTGAAATA TCATTGATGGCAACTGCTTT 162 0 

TTCAACGAAT GCCTCTTTCA TTTTTAATTT TAATCTTTCA TTTTTATAAA T r AAC AT AT C 168 0 

AAACAGTTCA TCAATATCAA TATCTTGTAA AATCGAACCG TGTTGGAGGA TTACGCCCTT 174 0 

TTGTCTCGTT TGAGCACTCC CAGCAATCTT ACGGCCTTCA ACAACTAGCT CATACCAACT 1800 

TGGTGCATCA AAAC AC AC T G AACTTCGAGG TTGTTTTAAT TTTTGACGCT CTTCAGGCGT 18 60 

TTTAGGTACC GCAAAATAAG TATCAAATCC TAAGTTTTTA AATCCTTCTA ATAATCCTTG 1920 

TGAAATCACT CTGTACGCTT CTGTAACTGT AGAAGGCATA TTCGGATGCG ATTCAGGCAC 198 0 

AATCACACTG TAAGTTAACT CTTTATCATG 'AGCACCCCA CGGCCACCAG TTTGACGCCT 2 04 0 
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TACGAGACCA AAACCTTTCT CTTTAACCTT ATCAATATCA ATTTCTTTTT GTAGCCTTTG 2100 

GAAATACCCT ATTGATAATG TTGCAGGATT CCATGTGTAA AAACGTATAA CTGGATCAAT 2160 

TTCACCTCTA GAGACAAAAT TTAATAACGC TTCATCCATT GCCATATTAT AATATGGTC 2220 

TTTACTTCCT GTATTAATAA AATTCCAAGT TTCAGTCATA TTCAAGAACT CCTATCAGCA 22 8 0 

AAATGTATAA TATTTGATTC GCTAATTAAT CAATTTAACT AAATGAATAA TAATTGCAAT 234 0 

TCTTTAGTGA AATATTTTGA TAATTTGACC TAACAGTCTT ATAATTATAT TATCGTTTAA 24 00 

TTAGGGAGGA TGCAAGATGA GTGCTAGTTT GTACATCGCA ATAATTTTAG TTATAGCAAT 24 60 

TATTGCTTAT ATGATTGTTC AACAAATTCT TAACAAGCGA GCTGTTAAAG AAT TAG AT C A 2 5 20 

AAATGAATTC CAT AAT GGGA TTAGAAAAGC TCAAGTCATC GAT G T TAG AG AGAAAGTTGA 25 80 

C T AT G AC T AC GGTCACATTA ATGGGTCTCG CAATATTCff ATGACAATGT TCAGGCAACG 2 64 0 

ATTCCAAGGA TTAAGAAAAG ATCAACCGGT ATACTTATGT GATGCCAATG GGATTGCTAG 2 7 00 

CTATAGAGCC GCTCGTATTT TGAAAAAGAA TGGATATACA GATATCTATA TGTTAAAAGG 27 60 

CGGCTATAAA AAATGGACTG GAAAAATAAA GTCTAAAAAA TAGTTTTTGT AAATTTAATA 2820 

TACGATTTAA TAAAATCTGA GTGTTAATTG ATCATCAATA AC AATAC T C A GATTTTAATT 28 80 

TTTTAACAAA GTCTGTTACT ATATTTCTCT AGCTTCACTG ATCATTAAAC TTAGTTTCAG 2 94 0 

CATAATAAAG AAAGTTCAGC TCATTTTCAA TACGATTCAA TTACCGCAAT CTAAAAAATG 30 00 

AAAAGACAAT TTCTATGAAA GAATAATACC AAACCCTAAG AGT TAT TACT TCGGTTTAGT 3060 

TTTCTTGTTT AAATAGAAAT TGTCTTTTTC AATTGATTTT G AAAC CAT T A TCCTTAAATC 3120 

TTCATACAAA GTTAGAATAA TAATTCTCGG AATATGTGTT TAATACTTTA TTTTTCCTGT 3180 

TTAAGATTTT C AAAC T T T AA TATTGGTTTA CGAGCAGCTG TAGCTCGTC TAATCGATCA 32 4 0 

ATCACAGTTG TATGTGGTGC TTCTAGCacT TTATCAGGAT CATTTTTAGC TTCTTCAGCA 3300 

ATACTAATTA ATGTATCGAT AAAAT AAT C A AGTGTTTCTT TAGACTCTGT CTCAGTCGGT 33 60 

TCAATCATCA TACCTTCTTC AACATTTAAT GGGAAGTATA TTGTTGGTGG AT G T AC AC C G 3420 

AAATCTAATA ATCGCTTAGC CATGTCTAAA GTACGTACAC CAAATTCTTT TTGACGCACA 34 80 

CCACTTAACA CAAACTCGTG T T T AC AAT AT TGTTTATAAG GTATTTCAAA GTGTTTAGAT 354 0 

AAACGTGCTT T AAT AT AAT T CGCATTAAGA ACCGCTGCTT C AG AAAC C T C TTTAAGTCCA 3 600 

GTTGCTCCCA TAG T T C G AAT AT AC G T ATAA GCTCTTAAGT AAATACCAAA GTTACCATAA 3660 

AATGGTTTTA CACGTCCGAT AGAATTTTTA ATGTCATTAT CAT ATT T AAA TTTGTCGCCA 37 2 0 

TCTTTAATAA CCATTGGCTT TGGTAAGTAA CTTGCTAGTT CTTTTACTAC ACCGACTGGA 37 8 0 

CCTGAACCAG GACCGCCACC ACCATGTGGA CCAGTAAATG TTTTATGCAA GTTAAATGA 384 0 

ACAGCATCAA ATCCCATATC TCCTGGGCGA ACTTTGTCCA TAATAGCGTT TAAATTCGCA 3900 
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CCATCATAAT ATAATAGACC ACCAGCATTA TGGACGATTT CACGGATTTC CATAATATTT 3 9 60 

TTTTCGAAAA TACCTAAAGT GTTTGGATTA GTTAACATAA TAGCTGCTGT ATTTTCATTT 4 020 

ACAACACGTT TCAAGTCATC AATATCAACT TCGCCACGTT CGTTTGATTT TACAGTAACT 4 08 0 

GATTTAAATC CTGCAAATGa AGCTGAGGCT GGaTTCGTAC CATGCGCAGA ATCTGGcACA 414 0 

ATGACTTCAT CACGATGACC TTCACCATTA TTCTCATGGT AAGCTTTAAA TATCATCAAT 4 2 00 

GCAGTCCATT CACCATGTGC GCCAGCAGCT GGTTGAATG TCACCTCATC CATACCAGTA 4 2 60 

ATTTCTTTTA ATTCTTCTTG CAAACTATAA ATAATTTCTA ATGAACCTTG AACTTGATCT 4320 

TCATCTTGTA ATGGATGTGA TTCACTAAAT CCTGGTATTC TAGCAACCTT TTCATTAATT 4 380 

TTAGGGTTAT ACTTCATCGT ACATGAACCC AATGGATAAA ATCCGTTGTC TACACCGAAA 4 4 40 

TTTTTATTTG AAAGTTCAGT ATAATGACGT ACTAAGTCTA GTTCAGCAAC TTCAGGAAAC 4 500 

TCCGCTTTGT TTTTACGAAT AAATTTATCA TCTAACAATG ACTCAACAGA ATTTGTTTTA 4 5 60 

ATATCACTTT TTGGTAATGA ATATGCATAT CTGCCTTCAC GAGATCTTTC AAAAATTAAT 4 62 0 

GGACTTGATT TACTAGTCAT TTAACTCACC AGCCTTTTCT ACAAATGTAT CGATTTCATC 4 68 0 

TTTTGTTCTT AATTCAGTTA CAGCTATTAA CATGTGATTT TTAAAGTCGT CTGAAACAAC 474 0 

ACCTAAATCA AAACCACCGA TAATATTGTA CTTCACTAAT TCCTCGTTAA CTTGTTGAAT 4 800 

TGGTTTGTCA AATTTGACTA CAAACTCATT GmnAAGnTGT ACATCTAAT ACTTCAAAAC 4 8 60 

CTTTTTTAAT AAATTGTTGT TTAGCATAGT TAGCATGTTC TATATTTTGA ACTGCAATAT 4 920 

CATAGATACC TTGTTTACCA AGTGCTGACA TTGCAATTGA TGaCGcTAAA GCATTTAATG 4 98 0 

CTTGGTTAGA ACAAATATTA GATGTCGCTT TATCGCGTCG AATATGTTGT TCACGTGCTT 504 0 

GTAATGTTAA TACAAAGCCA CGATTACCTT CATCATCTTG TGTTTGACCG ACTAATCTAC 5100 

CTGGCACTTT ACGCATTAAC TTTTTCGTCG TTGCAAAATA TCCACAATGT GGCCCACCGA 5160 

ATTGAGCAGG AATTCCGAAT GGCTGAGTAT CACCTACAAC AATATCTGCA CCAAATGAAC 5220 

CTGGAGGTGT AAGTAATCCC AATGCTAATG GATTTGCATA TACGATAAAT AATGCTTTTT 52 8 0 

TATCTTCAAT AAAGCTATGA ATCTTTTCAA GATCTTCAAT TGAACCGTAA AAGTTTGGAT 5 34 0 

ATTGTACTGC AACAGCTGCT GTTTCATCAT CCACTGCTGC TTCTAATTTT TTCAAATCTG 5 4 00 

TAACAGTGCC ATCTAAATCG ATTTCCACTA CTTCGAATTC CTTACGCGTCTTAGCATAAG 54 60 

TATGAAGTAC TTGTAATGCT TGATAATGTA AACCTTTTGA GACTACAATT TTATTTTTCT 5 52 0 

TTGTTTGACT AAATGCTAAG ATACATGCTT CAGCAAAGCT AGTCATCCCA TCATACATAG 5 58 0 

AAGAATTTGC TACATCCATA TCTGTTAATT CACAAATTAA AGTTTGGAAC TCAAAAATGG 5 64 0 

CTTGTAATTC ACCTTGAGAA ATTTCCGGTT GATATGGCGT ATATGCTGTG TAAAATTCTG 5700 
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ATCTTGAAAT CATAGCATCC ACAACTGATG GCGCGTAATG ATCATAAACA CCAGCACCCA 



rAAATGATGT ATGCGTTTCT TTAGTGATAT tCTTGCTkGC AATGGGGATT TAAACnTCTA 
AGTAACGGTG TnTCCGCG 



5760 

5820 
5838 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE . CHARACTERISTICS : 

(A) LENGTH: 18355 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

ATnATAATTG GCTTTGCTAA TAATTACTTC CCTGAATTAC aAGTATTAGC AAACGAAATA 60 

AAATCTGATA TGGCTAGTTC ATTAAAACAA TGATATTTTT ATTTAAATTT TTaAAGCTTT 120 

GTACGAAATT GTACAAAGCT TTTTTGGTGC GTATTGTATG GGCAACAACT TGSGATGAA 18 0 

AATCCGTTAC AGGATTGGTA ATAGGAAATG TTAGCGAAAG ACAAGGGTAT CCATTGTAGA 24 0 

TTAACAAAAG GACGTTTCCA CAAGTGTGGG TTATTCTCAC TAAAGCAATA CGCAGAGACA 300 

ACTTACGTAA AATTTTGAAC TGACTAGAAC GGAACTTCTA CTCAATTATT GATAAAAATT 3 60 

TTCAAAAAGA CTTGAATGTG CTGAGAATAC GAAGTTTATG GAAGGATTAT CAAAATATAA 420 

ATGTGCATTC ATTTACAACC TTTATTGACA ATGATTCTCA ACTAATATAG TATATAATCA 480 

AATCGTAATA GTTACGATTT GTTTTCTGCA ACTTTTTTGA AGTTTTAGTT GAGGTGAAAA 54 0 

CAATAAAAGC ATCTAAGTGA ATGTAGTTAA CGGA&ACTG CATTCGCTTG TAGAGCCACA 600 

AGAAGCAACT TTAAATAAGG TTTACGGTTG CATTTTGATA CAACAACCGA TTACTAAGTC 660 

ATGCTTTCCA CTTTGCGGGT TAGCATGACT TACCTAATAG ATAGAGCTAT TAGGTTCAGC 720 

TTCTAAAAAA TTACAGTTTT AGAGGAATAC AGTTGcTTGc tTCGCAACAA CTGCATAAGA 7 80 

GCCATGGTTT TCGCTTTTGC GAAT T AGCAT GACTTACCTA C TAG AT AG AG CTATTAGGTT 84 0 

CATCTTCTAA AAAAT T AC AG GTTTAGAGGA ATACAGTTGT TTGcTTCGCA ACAACTGCAT 90 0 

AAGAGCCTCT AGTAATTAAA ATTACAGAGG CTCTAAAAAT ACATCTAAAG GAGTGTCGTA 960 

TGAATCGGCA GGTTATAGAA TTTTCTAAGT ATAATCCTTC GGGGAATATG ACGATACTTG 102 0 

TTCATTCAAA ACATGATGCT AG T GAAT AT G CATCTATCGC CAATCAGTTG ATGGCCGCAA 108 0 

CACATGTATG CTGTGAACAG GTAGGCTTTA TAGrATCAAC ACAAAATGAT GATGGTAATG 114 0 

ATTTTCACTT AGTTATGAGC GGTAATGAAT TTTGCGGTAA TGGACGATG TCATATATAC 1200 

ATCATTTGCA GGAAAGTCAT TTGCTTAAAG ACCAACAGTT TAAGGTGAAG GTGTCTGGCT 12 60 

GTTCGGATTT AGTGCAATGC GCAATTCATG ATTGCCAATA CTATGAAGTT CAAATGCCAC 1320 
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AAGCCCATCG TGTTGTGCCA ACAACAATTA ATATGGGTAA TCATTCATGG AAAGCAATAG 138 0 

AAATTATTTA TGAAACATAT G T AC AT TAT G TGATTCCAGT TAAACAAGTA ACAACAGAAA 14 4 0 

TTCAACATTT GGTTGAAGCG TTTGTGCGTG AgcAACAATG GAG T C AC AAA TATAAAACAG 15 00 

TAGGTATGAT GCTTTTTGAT GAACAACGTC AATTTTTACA GCCATTAATC TATATACCAG 15 60 

AAATTCAAAG TTTAATTTGG GAA8&TAGCT GTGGTTCTGG TACAgcATCA ATTGGGGTTT 1620 

TTAATAATTA TCAACGTAAT GACGCATGCA AAGATTTTAC AGTACATCAG CCAGGGGGCA 1680 

GTATTTTAGT GACATCAAAG CGATGTCATC AATTGGGATA TCAAACTTCA AT T AAAGGAC 17 4 0 

AGGTTACAAC TGTAGCTACA GGaAAAGCAT ATATAGAATA AGGAGCCTACAATGAATAAC 1800 

TTTAATAATG AAATCAAATT GAT ATT AC AA CAATATTTAG AAAAGTTTGA AGCGCATTAC 18 60 

GAGCGTGTAT TACAAGACGA TCAATATATC GAAGCATTAG AAAC AT T GAT GGATGACTAT 192 0 

AGTGAATTTA TTTTAAATCC TATTTATGAA CAACAATTTA ATGCTTGGCG TGACGTTGAA 1980 

GAAAAAGCAC AATTaATAAA ATCACTGCAA TATATTACAG CGCAGTGTGT TAAACAAGTG 204 0 

GAAGTCATTA GAGCGAGACG T C TAT TAG AC GGACAGGCGT CTACCACAGG TTACTTTGAC 2100 

AATATAGAAC ATTGTATTGA TGAAGAGTTT GGACAATGTA GTATAGCTAG C AAT G AC AAA 2160 

TTATTGTTAG TTGGTTCAGG TGCATATCCA iTGACGTTAA TTCAAGTAGC AAAAGAAACA 222 0 

GGTGCTTCAG TTATCGGTAT TGATATTGAT CCACAAGCCG TTGACCTAGG GCGCAGAATC 22 80 

GTTAACGTCT T AGC AC C AAA T G AAGAT AT A ACAATTACGG ATCAAAAGGT ATCTGAACTT 234 0 

AAAG AT AT C A AAGAT G T G AC GCATATCATA TTCAGCTCGA CAATTCCTTT AAAGTASGC 2 4 00 

ATTTTAGAAG AATTATATGA TTTAACAAAT GAAAATGTCG TAGTTGCAAT GCGCTTTGGT 24 60 

GATGGCATCA AAGCAATATT TAATTATCCG T C AC AAG AAA CAGCGGAAGA TAAGTGGCAA 2520 

TGTGTGAATA AACATATGAG ACCACAGCAA ATTTTTGATA TAGCACTTTA TAAAAAAGCA 25 80 

GCTATAAAGG TAGGTATTAC GGATGTCTAA ATTATTAATG ATAGGCACTG GTCCgGTCGC 2 64 0 

AATCCAATTA GCGAATATTT GCTATTTAAA ATCAGATTAT GAGATTGATA TGGTTGGACG 2 7 00 

TGCCTCAACA T C AG AAAAAT CAAAACGCTT ATATCAAGCG TATAAAAAAG AGAAACAATT 27 60 

T G AAG T C AAA ATACAAAACG AGGCGCATCA ACATCTGG& GGTAAGTTTG AAATTAATCG 28 2 0 

TTTGTATAAA GATGTTAAAA ACGTTAAGGG TGAATACGAA ACGGTTGTCA TGGCATGCAC 28 80 

AGCAGATGCT TATTATGACA CACTACAGCA ATTGTCGTTA GAAACTTTGC AAAGTGTCAA 2 94 0 

ACATGTCATT TTAATATCAC CGACATTTGG TTCGCAAATG ATTGTCGAAC AATTTATGTC 3000 

TAAATTTAAT AAAGATATCG AAGTGATTTC ATTCTCAACT TATCTTGGCG ATACACGTAT 30 60 

TGTTGATAAA GAAGCGCCTA ATCATGTGTT GACAACAGGT GTAAAAAAGA AATTGTACAT 3120 
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GGGATCGACA CATTCAAACT CAACAATGTG TCAACGAATC TCTGCTTTAG CTGAGCAATT 3180 

GAAGATTCAA TTAGAAGTCG TTGAATCGCC ACTGCATGCT GAAACGCGAA ACAGTTCGCT 324 0 

TTATGTGCAC CCACCACTAT TTATGAATGA CTTTTCATTG AAAGCCATTT TCGAAGGAAC 3300 

AGATGTACCG GTTTATGTGT ATAAGTTATT TCCTGAAGGA CCGATAACGA TGACACTAAT 3360 

CCGTGAAATG CGTTTAATGT GGAAGGAAAT GATGGTTATT TTACABCAT TTAGAGTGCC 34 20 

GTCAGTCAAC CTGCTTCAAT TTATGGTGAA GGAAAAT TAT CCAGTACGTC CTGAAACTTT 34 80 

GGATGAAGGT GATATTGAGC AT T T CGAAAT CTTGCCAGAT ATCTTACAAG AATATCTGCT 3540 

TTATGTAAGA TATACCGCAA TCCTCATTGA TCCATTTTCA CAGCCAGACG AAAAC G G AC A 3600 

TTACTTTGAT TTTTCAGCTG TACCATTTAA GCAAGTCTAT AAAAATGAAC AGGATGTTGT 36 60 

TCAAATTCCA AGAATGCCAA GTGAAGATTA TTACAGAACG GCGATGATTC AGCATATTGG 37 2 0 

GAAAATGCTA GGTATCAAAA CGCCAATGAT TGATCAGTTC CTAACTCGCT ATGAAGCAAG 37 8 0 

TTGCCAGGCG TACAAGGATA TGCATCAJGA TCAACACTTA TCTTCTCAAT T T AAT AC AAA 38 4 0 

TCTATTTGAA G G AG AT AAAG CACTCGTCAC AAAATTTTTG GAAATCAATA GAACGCTTTC 3900 

ATAATAAGGG TTTGAAGTTT TATAATAGAA AAAAATTATT GAATTATGTT TGACATTTAC 3960 

ATAAAAATAA GCAAAT AAT T GAGAAAAATA AT CAT T AC G A TTTGATTAAG TAAGCAACT 4020 

TATCAATTTA G AAAG AG G AA AAGCAAATGA GAAAACTAAC TAAAATGAGT GCAATGTTAC 4 08 0 

TTGCATCAGG GCTAATTTTA ACTGGTTGTG GCGGTAATAA AGGTTTAGAG GAGAAAAAAG 4140 

AAAAC AAG C A ATTAACGTAT ACGACGGTTA AAGATATCGG TGATATGAAT CCGCATGTTT 4200 

ACGGTGGATC AATGTCTGCT G AAAG TAT G A TATACGAGCC GCTTGTACGT AAC AC GAAAG 4 2 60 

ATGGTATTAA GCCTTTACTA GCTAAAAAGT GGGATGTGTC TGAAGATGGG AAGACATACA 4 320 

CGTTCCATTT GAG AG AT G AC GTTAAATTCC ATGATGGTAC GCCATTTGca TGctGACGCA 4380 

GTTAAGAAAA ATATTGACGC AgTTCAAGAA AACA&AAAT TGCATTCTTG GTTAAAGATT 4 44 0 

TCGACATTAA TTGACAATGT T AAAG T T AAA GATAAGTACA CGGTTGAATT GAAT T T G AAA 4 500 

GAAGCATATC AACCTGCATT GGCTGAATTA GCGATGCCTC GTCCATATGT ATTTGTGTCT 4 5 60 

C C AAAAG AC T TTaAAAACGG TACAAcAAAA GATGGCGTTA AAAAGTTCGA TGGTACTGGT 4 62 0 

CCATTTAAAT TAGGTGAACA CAAAAAAGAT GAGTCTGCAG AC T T T AAC AA AAATGATCAA 4 680 

TACTGGGGCG AAAAGTCTAA AC TTAACAAA GTACAAGCAA AAGTAATGCC TGCTGGTGAA 474 0 

ACAGCATTCC TATCAATGAA AAAAGGTGAA ACGAACTTTG CCTTCACAGA TGATAGAGGT 4 800 

ACAGATAGCT TAGACA2AGA CTCTTTAAAA C AAT T GAAAG ATACAGGTGA CTATCAAGTT 4 8 60 

AAGCGTAGTC AACCTATGAA TACGAAAATG TTAGTTGTCA ATTCTGGTAA AAAAGATAAC 4 920 

GCTGTGAGTG AC AAAAC AG T CAGACAAGCG ATTGGTCATA TGGTAAACAG AGATAAAATT 4 98 0 
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GCCAAAGAAA TTTTAGATGG TCAAGAAAAA CCAGCAACTC ASTATTTGC GAAAAAT G T A 50 4 0 

ACAGACATTA ATTTCGATAT GCCAACACGT AAGTATGACC TTAAAAAAGC AGAATCATTA 5100 

TTAGATGAAG CTGGTTGGAA GAAAGGTAAA GACAGCGATG TTCGTCAAAA AG AT G G T AAA 5160 

AACCTTGAAA TGGCAATGTA C TAT G AC AAA GGTTCTTCAA GTCAAAAAGA ACAAGCAGAA 52 2 0 

TACTTACAAG CAGAATTTAA GAAAATGGGT ATTAAGTTAA ACATCAATGG CGAAACATCA 5280 

GATAAAATTG CTGAACGTCG TACTTCTGGT GATTATGACT TAATGTTCAA CCAAACTTGG 5340 

GGATTATTGT ACGATCCACA AAGTACTATT GCAGCATTTA AAGAGAAAAA TGGTTAT GAA 5 4 00 

AGTGCAACAT CAGGCATTGA GAACAAAGAT AAAATATACA ACAGCATTGA TGACGCATTT 54 60 

AAAATCCAAA ACGGTAAAGA GCGTTCAGAC GCTTATAAAA AC AT T T T GAA ACAAATTGAT 5520 

GATGAAGGTA TCTTTATCCC TATTTCACAC GGTAGTATGA CAGTTGTTGC ACCaAAAGAT 5580 

T T AG AAAAAG TATCATTCAC ACAATCACAG TATGAATTAC CATTCAATGAAATGCAGTAT 564 0 

AAATAAAGGA GCAATTAGAT GTTCAAATTT ATCTTAAAAC GTATTGCGCT CATGTTTCCA 5700 

TTGATGATTG TAGTAAGTTT TATGACATTT CTATTGACGT AT AT T AC AAA T GAAAAT C C A 57 60 

GCTGTGACAA TTTTACATGC ACAAGGGACG CCAAATGTAA CACCAGAGTT GATTGCAGAA 5820 

AC GAATG AG A AGTACGGTTT CAATGATCCA TTATTAATTC AATATAAAAA TTGGTTACTT 5880 

GAAGCGATGC AATTTAATTT TGGTACAAGC T AC ATT AC AG GTGACCCAGT TGCTGAACGT 5 94 0 

ATTGGTCCAG CAT T TAT GAA TACAT TGAAA TTAACAATAA TTTCAAGTGT TATGGTGATG 6000 

ATTACATCAA TTATTTTAGG TGTAGTTAGT (EATTAAAAA GAGGAAAGTT CACTGATCGT 60 60 

GCGATACGTT CAGTGGCTTT CTTTCTAACT GCATTACCAT CATATTGGAT AGCTTCAATA 6120 

CTTATTATTT ACGTTTCAGT GAAGTTAAAC ATATTGCCGA CTTCTGGATT AACAGGTCCA 6180 

GAAAGTTACA TATTGCCAGT GATCGTTATT ACGATTGCCT ATGCTGGTAT TTACTTAGA 62 4 0 

AATGTTAGAC GCTCGATGGT GGAACAATTA AATGAAGATT ATGTACTTTA TTTAAGAGCA 6300 

AGCGGTGTGA AATCTATCAC ATTAATGTTG CATGTGTTGC GTAATGCTTT ACAAGTTGCG 6360 

GTATCAATCT TTTGTATGTC TATACCAATG ATAATGGGTG GACTAGTTGT TATCGAGTAT 64 20 

ATCTTTGCAT GGCCTGGACT AGGTCAATTA AGTTTAAAAG CAATACTTGA ACACGATTTT 64 8 0 

CCAGTCATTC AAGCATATGT ATTAATTGTA GCGGTATTAT TTATTGTATT TAATACATTA 65 4 0 

GCAGATATCA TTAATGCGCT ATTAAATCCA AGATTAAGGG aGGGCGCACG ATGATAATTT 6600 

TAAAmCGATT ATTmCArGwT AAAGGTGCAG TAATTGCTT AGGCATTATT GTATTATATG 6 660 

TCTTTTTAGG ATTAGCAGCA CCACTTGTGA CATTTTATGA TCCTAACCAT ATCGATACAG 67 2 0 

CAAACAAATT TGCTGGCATG AGTTTTCAAC ATCTACTAGG TACTGACCAT TTAGGTAGAG 67 8 0 
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ATATTTTAAC TAGGTTAATT TATGCGATTA GACCAAGTTT GTTATATGTC TTTGTTGCGC 68 4 0 

TATTTGTTTC TGTACTTATT GGATCTATTT TAGGATTCTT ATCAGGATAT TTCCAAGGGT 6900 

TTGTTGACGC CTTAATCATG CGTGCGTGTG ATGTTATGTT GGCATTCCCA AGTTATGTTG 6 9 60 

TAACGTTAGC ATTAATTGCA TTGTTTGGAA TGGGTGCCGA AAATATTATC ATGGCATTTA 7020 

TTTTGACGCG TTGGGCATGG TTCTGTCGTG TTATACGTAC AAGTGTTATG CAGTACACTG 7 080 

CTTCTGACCA TGTAAGATTT GCTAAAACAA TCGGTATGAA TGATATGAAA ATTATTCACA 714 0 

AACATATTAT GCCATTAACA TTAGCAGATA TTGCTATCAT CTCTAGTAGC TCGATGTGTT 72 00 

CAATGATCTT GCAAATATCT GGCTTTTCAT TTTTAGGATT AGGTGTAAA GCGCCTACTG 7 2 60 

CAGAGTGGGG CATGATGCTT AACGAaGCTA GAAAAG T GAT GTTTACACAT CCTGAAATGA 7 32 0 

TGTTTGCGCC AGGTATTGCC ATAGTGATTA TAGTGATGGC ATTTAACTTC TTATCCGATG 738 0 

CTTTACAAAT TGCTATTGAT CCCCGCATCT CTTCTAAAGA TAAACTTCGT TCTGTGAAAA 74 4 0 

AAGGAGTGGT GCAATCATGA CATTGTTAAC AGTTAAACAT TTGACGATTA CAGATACCTG 7500 

GACAGATCAA CCACTCGTGA GTGATGTGAA TTTTACATTA ACTAAGGGTG AAaCTTTAGG 7 5 60 

CGTTATTGGA GAAAGTGGTA GTGGTAAATC AATCACTTGT AAATCGATTA TTGGTTTGAA 7 62 0 

TCCCGAACGA CTCGGGGTGA CAGGTGAffi.T TATCTTTGAT GGTACAtCAA TGTTGTCATT 7 68 0 

ATCTGAATCG CAATTGAAAA AGTACCGTGG TAAAGACATT GCGATGGTCA TGCAACAAGG 77 4 0 

TAGTCGTGCC TTTGACCCAT CAACTACTGT CGGTAAACAA ATGTTTGAGA CTATGAAAGT 7 8 00 

ACATACGTCA ATGTCTACAC AAGAAATTGA AAAG AC AT T G ATTGAATATA TGSTTATTT 7 8 60 

AAGTTTGAAA GATCCTAAAC GTATATTAAA ATCATACCCT TACATGTTAT CAGGAGGAAT 7 920 

GTTACAGCGA TTGATGATTG CTTTAGCGTT AgcTTTgAAA CCAAAGTTAA TCATTGCTGA 7 98 0 

TGAGCCGACA ACGGCTTTAG ATACAATTAC ACAATATGAT GTACTGGAAG CATTTATAGA 8 04 0 

TATTAAAAAA CACTTTGACT GTGCGATGAT TTTCATTTCA CAT GAT T T AA CGGTTATTAA 8100 

CAAGATTGCA GACCGTGTTG TTGTGATGAA AAATGGTCAG CTTATTGAAC AAGGGACACG 8160 

TGAATCAGTC TTGCATCATC C AG AAC AT G T TTATACGArt ATT kT AT TAT CAACGAAGAA 8220 

GAAGATTAAT GATCATTTTA AACATGTGAT GAGGETGAT G T AC AT GAT T 'AAAATTAAAG 828 0 

ATGTTGAAAA GTCATATCAA AGCGCACATG TTTTTAAGCG TCGTCGAACA CCTATCGTGA 8 34 0 

AAGGTGTGTC ATTTGAGTGT CCAATCGGTG CGACGATTGC GATTATCGGA GAAAGTGGTA 8 4 00 

GCGGTAAATC GACGTTGAGT CktATGATAT TAGGTATTGA GAAACCGGAT AAAGGTTGTG 84 60 

TAACCTTAAA TGATCAACCG ATGCATAAGA AGAAAGTGAG ACGTCATCAA ATTGGTGCTG 8 52 0 

TATTTCAAGA TTATACGTCA T CAT T ACAT C CATTTCAGAC TGTTAGAGAA ATCTTATTTG 8580 

AAGTGATGTG TCAATGTGAT GGACAACCTA AAGAAGTTAT GGAAGTCCAA GCAATTACAT 8 64 0 
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TGTTGGAAGA AGTCGGTCTA TCTAAGGCAT ACATGGATAA ATATCCTAAT ATGTTATCAG 8 7 00 

GTGGAGAGGC GCAACGTGTT GCGATTGCGC GTGCAATATG TATTAACCCT AAATATATTT 87 60 

TGTTTGATGA AGCCATTAGT TCACTCGACA TGTCAATTCA AACACAAATA T TAG AT T T AT 8 820 

TGATTCATTT ACGTGAAACG CGTCAGTTGA GTTATATTTT TSICACACAT GATATTCAAG 8880 

CTGCCACGTA TTTATGTGAT CAATTAATTA TTTTTAAAAA CGGAAAAATA GAAGAACAAA 8 94 0 

TTCCGACAAG CGCATTGCAT AAAAGTGACA ATGCTTATAC AAGAGAATTA ATAGAAAAAC 9000 

AACTATCATT CTAAGGAGTG AGATAATGAA AGGTGCAATG GCTTGGCCCT TTTTGAGATT 90 60 

ATATATATTA ACATTGATGT TCTTTAGTGC CAATGCAATC TTAAACGTGT TTATACCTTT 912 0 

ACGAGGGCAT GATTTAGGCG CAACGAATAC GGTTATCGGT ATCGTTATGG GGGCATACAT 9180 

GTTAACAGCA ATGGTATTTC GACCATGGGC AGGACAAATT ATTGCTCGTG TCGGTCCCAT 92 4 0 

TAAAGTATTA AGAATTATTT TGATTATCAA TGCCATAGCT TTAATTATTT ATGGTTTTAC 9 30 0 

TGGCTTAGAA GGTTATTTCG TAGCACGTGT TATGCAAGGT GTGTGTACGG CATTCTTTTC 9360 

TATGTCTTTA CAGCTAGGTA TTATTGATGC ATTACCAGAG GAACATCGTT CTGAAGGTGT 94 2 0 

AT CAT T G T AC TCGCTATTTT CAACGATTCC AAACTTAATC GGACCATTAGTTGCCGTAGG 94 8 0 

TATTTGGAAT GCAAATAATA TTTCAC TATT TGCAATTGTC ATTATCTTTA TCGCATTAAC 954 0 

AACAACATTC TTTGsTATCG CGTGACCTTT GCTGAACAGG AACCCGATAC GTCAGATAAG 9600 

ATTGAAAAAA TGCCGTTTAA CGCTGTAACT GTTTTTGCGC AATTTTTCAA AAATAAAGAG 9 660 

TTGTTGAACA GTGGTATTAT CATGATTGTT GCATCGATTG TATTTGGTGC AG T TAG T AC A 9720 

TTTGTACCGT TATACACAGT GAGTTTAGGA TTCGCGAATG CGGGAATCTT TTTGACAATA 97 8 0 

CAGGCCATCG CAGTTGTTGC GGCAAGATTT TACTTAAGGA AATACATTCC GTCAGATGGT 98 4 0 

ATGTGGCATC CTAAATATAT GGTATCTGTA OAT CAT TAT TAGTAATCGC GTCATTTGTA 9 90 0 

GTGGCATTTG GTCCGCAAGT AGGTGCAATT ATTTTCTATG GTAGTGCGAT ATTAATAGGA 9960 

ATGACGCAAG CAATGGTGTA CCCAACATTA ACATCATACT TAAGCTTCGT CTTACCAAAA 10020 

GTAGGTCGTA ATATGTTGTT AGGTTTATTT ATTGCCTGTG CAGACTTAGG TATATCETA 10080 

GGTGGCGCAT TGATGGGACC TATTTCCGAT TTAGTAGGAT TTAAATGGAT GTATCTAATT 1014 0 

TGTGGTATGT TAGTCATTGT AATAATGATT ATGAGTTTCT TGAAAAAGCC AACACCACGT 102 00 

CCAGCGAGTA GTCTTTAATG AAGTGAATTA AAGCATATTA AGTTAATGAA TATTTAAATT 102 60 

TTAAAAGGTA TATTGaGCAT GGCGATTCAT GTGCTTCATG CTAGGACATG AAACATTCTA 10320 

TATGGCTCGT TTTTAGAACG ACAtATATCT AAATAAAGCA CGCTTArAAG TGAGTTTTGA 1038 0 

TACTGACTTT TAAGTGTGCT TTTTGATGTG TATGGATAAT GGATAATACT CTGAAAAATT 10440 
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T T AC AT G AAA ATATGCAAAA CGAGTATAAC TGCTAATTQ. T AG AAAT AG C T C AC CAT AAA 10500 

ATTACGGTAT GAT T T T AAAT ATAAGTAAGT CGCACTACCT GCTAGTATCA ATGCTGGAAT 105 60 

GAATTCCCAC CAT G TAT T AA TGTATGGATA G T AG AAC AG A GTTTCAAGGA TAATGGACAA 10 620 

TACTATTGTA ATCTTTAAAG GTATTAATCT GCTTAATTCT TGAATTAAAA TATGACGGAA CL68 0 

AATAAGTTGA CAAATCAAAG TATTTAATAT AATGGTTAAC GAAAATATAG CTATTAAACT 10740 

GATGGAaCCA TACCCTTTAA TGAGCGGGTA AATGTCAAAG ACAGTAAAGG AATCTACATT 10800 

TAGTGCGAAA ATATTGAAAT GATTTAAAAG T AAAAAG AG T ACGACACTTA GTGTAAATGA 108 60 

TATAAGAATA TGCCATTTAT ATTTAGCACT AGCAACGATT TGCGAACGTA TCATTGGAAT 10920 

AAACGCATCT TCATGCATCA GACGAAAAAT AGCTAGTGAA ATAATAACTG CGAGTAAATA 10 980 

GCTAATGTTC ATTGAAATAG GAAAAGAGAA ACCCCACGGA GCTTGTTGAG TGAATACAGC 11040 

TACTAACCCA AAAGTTAAAA AG AC GAT AAT GATCGGCAAG ATGTTACCA AAAATATGTA 11100 

AAGGAAAATA AATCCAATAT CACGTTTGAA AAAACGCGAT TGTTCGGTAG CGTATTCTTC 11160 

TTCTATGTAA TGTTTATTTG TATTTGACAT AGTATACCTC TTAAATAGTT GTATTATATA 11220 

GATACTTTAG CACATATTAC TTTGTATTGT ATGTTTTATA CATTAAAATT TAAAATGAAA 11280 

AACATATCAT AAAATTGTTT T AT AAAAT G A AGCGCTTCCA TTGTGTTTTG TTTTGTAAGG 11340 

TGTATCATAA ATATTGAATT GAAATTTTGG GGGGAGGTAT TGTAATGACG TTTCTTACAG 114 00 

TCATGCAATT TATAGTTAAC ATTATCGTTG TAGGATTCAT GCTTACGGTT ATTGTTATCG 114 60 

GGCTTATTTG GTTAATTAAA GATAAAAAC AAT C AC AAC A TAGTGTATTA AGGAATTATC 11520 

CTTTACTAGC ACGTATTAGA TATATTTCAG AAAAAATGGG ACCGGAATTA CGTCAGTATT 1158 0 

TATTTTCTGG GGATAATGAA GGGAAACCTT TTTCACGTAA TGAT TATAAA AATATCGTTT 11640 

TGGCTGGAAA ATATAACTCT CGTATGACCA GCTTCGGTAC TACTAAAGAT TATAAGACG 11700 

GCTTTTACAT ACAGAACACA ATGTTTCCGA TGCAACGTAA TGAGATTTCA GTAGATAATA 117 60 

C AAC AT T G T T AT C AAC AT T C ATT T AT AAAA TCGCGAATGA GCGTTTATTT AGTCGTGAAG 11820 

AATATCGTGT GCCGACAAAG ATTGATCCGT ATTACTTAAG T GAT GAC CAT GC AAT AAAAT 1188 0 

TAGGTGAACA TTTAAAACAT CCATTTATTT TAAAACGTAT C G T AGG AC AA TCTGGTATGA 1194 0 

GTTATGGCGC TTTAGGAAAA AATGCCATTA CAGCTTTATC TAAAGGTCTA GCTAAAGCGG 12000 

GCACTTGGAT GAATACAGGT GAAGGTGGCT TATCAGAATA TCATTTAAAA GGTAATGGGG 120 60 

AT AT CAT T T T CCAAATTGGT CCCGGTTTAT TTGCETTCG TGATAAAGAA GGTAATTTTA 12120 

GTGAAGGTTT ATTTAAAGAG GTTGCACAGT TATCTAACGT ACGCGCATTT GAGCTGAAGT 12180 

TGGCACAAGG TGCTAAGACT CGTGGTGGTC ATATGGAAGC TGAAAAGGTA AAC G AAG AAA 122 4 0 

TTGCTAAAAT CCGAAATGTT GAACCTTATA AAACAATCAA TTCACCTAAC CGTTACGAAT 12 300 
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TTATTCATAA TGCTGAAGAT TTGATTCGTT TCGTCGATCA GTTGCAGCAA TTAGGTCAAA 12360 

AACCAGTAGG ATTCAAAATT GTAGTAAGCA AAGTTTCAGA AATTGAAACA CTTGTACGTA 12 4 20 

CGATGGTGGA ACTAGATAAG TATCCAAGCT TTATTACGAT TGATGGTGGT GAAGGTGGTA 12 4 80 

CTGGTGCAAC ATTCCAAGAA TTACAAGATG GTGTTGGCTT ACCGCTATTT ACAGCTCTAC 12540 

CTATTGTGTC TGGCATGTTA GAAAAATATG GTATTCGAGA TAAAGTGAAA TTGGCGGCAT 12 600 

CTGGTAAGTT AGTGACACCA GATAAAATTG CGATTGCACT AGGTTTAGGT GCAGATTTTG 12 660 

TAAATATCGC ACGTGGGATG ATGATTAGTG TCGGTTGTAT AffiGAGTCAA CAATGTCACA 12720 

TGAATACGTG TCCTGTAGGT GTTGCAACGA CAGATGCGAA GAAAGAAAAA GCATTGATTG 127 8 0 

TTGGAGAAAA GCAATATCGT GTCACAAACT ATGTAACAAG TTTGCATGAA GGCTTATTCA 128 4 0 

ATATTGCAGC AGCTGTTGGC GTATCCAGTC CTACAGAAAT TACTGCTGAT CATATTGTAT 12 900 

ATCGAAAAGT CGATGGTGAG TTACAAACGA TACATGATTA TAAATTAAAA CTCATTAGTT 12 9 60 

AACTTAATTA TTTCGGGAAA TTGAAAGCAG CGGATTTTAG CGTTACTGCA AATAATTTTA 13020 

TATTAGTAGT GGATGCTGGT CACACAAGAA CTTCAAATAT TAAAGCCCTC AGAATATGAA 1308 0 

TTAAGGTTTG TAACCTTAGT CTTSTCTGAG GGCATTTTTA AGTTATAAAC TATTTGTCGT 13140 

CCATTTTATC TTTTTCTTTT AAACCTCTGT GCTTTAATTG CTTTTCAAGT TTTTCAAAAC 13200 

TAATATCTTT ATTTTCTTTA GTCGAAACAC CAAGACGTTT ATTTAATTTT TTCATGTCAA 132 60 

CTTCTGTGTA ATCTATGTCT AAGTGyTCAA TTGCTTTTTT ATCTTTATAGTCTACTTTGT 13320 

ATTTTACGCC TTTAAGGTCT TTGAAAATAC TTTCAGATTT GGCGAATAAC TTTTTGGCTT 1338 0 

CGTCTTTATC CATACCTAGA TCGTCATATT TAATTGTGTT GATTGTAGAC TGTTTTAAAA 134 4 0 

CTTTATCATC TTTATATGTG AT AGAAG T T A GTACATGTTT AC C AC T AAC A TCACCwTCAT 13500 

ATGTTTTGGT TTGTTCTTTA CCACAAGCTG ATAATGCAAT GATACAAACT AATGCTACTA 13560 

CAATTAATGA ACATAATTTT TTCAAAGTCA GTCGCCTTCT TTCGATATTT GT AT TAT AAA 13 620 

GAAATTATAA CATTTACTAA AAAATGATGT TATTCAAAAA TTTAAATTTT GTCATTTTTT 13680 

TTGAAGATAT GAGTTTTTTT AAGCGGATTC (TCACAAAAT TTTAAAAATA TTTAAGCCTk 137 4 0 

AAAATGATAA AGCGkTAGGG AACGTTTTTC TGAAAGTTAG TGATACAATA GTTTTAAGTT 138 00 

GAAATACAGG AGGATGAATA AC AT GAAT C A GTCAGTCAAA TTACTTAAAC ATTTAACAGA 138 60 

TGTAAACGGC ATTGCTGGTT ATGAAATGCA AG T T AAAGAA GCAATGCGTa ACTATAAGA 13920 

GCCTGTCAGT GATCAAATTA TTGAAGATAA CTTGGGTGGC ATTTTTGGAA AGAAAAATGC 13980 

TGAGAATGGT CAATACTCAA TTATGATTTC TGGTCATATG GATGAAGTTG GTTTTATGGT 14 04 0 

AACAAAGATT GATAAACATG GTTTTATTTC ATTTACGCCA kTgGTGGATG GTGGAATCAA 14100 
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GTCATGCTAT CTCAAAAAGT AACGATTACA ACAGATTCGG GCAAAGAAAT TAGAGGTATC 14160 

ATCGGTTCTA AACCGCCACA TGTCTTAACG CCTGAAGAAC GTAAAAAGCC AATGGAAATC 14220 

AAAAAT AT G T TTATAGATAT TGGTGTTAGT AGCAAGGAAG AAGCTGAAGA AGCTGGCGTT 14280 

GAAGTAGGCA ATATGGTTAC GCCATATAGT GAATTTGAH TGCTTGCAAA TGATAAATAT 14 34 0 

TTAACTGCGA ArCATTTGAT AATCGCTATG GCTGTGCATT AGCTATTGAG GTATTAAAAC 144 00 

GTTTAAAAGA TGAAAATATT GGCATTAACT TATACAGTGG TGCCACAGTG CAAGAAGAAG 144 60 

TTGGTTTGCG TGGTGCGAAA GTGGCAGCGA ATACGATTAA ACCAGACTTG GCGATAgcTG 41520 

TcGATGTAGG TATTGCTTAT GAT AC C C C AG GTATGTCAGG TCAAACGAGC GATAGTAAAC 14580 

TAGGCGGTGG TCCAGTTGTC ATTATGATGG ATGCTACAAG TATTGCTCAC CAAGGTTTGC 14 640 

GAAAgcATaT TAAAGATGTA GCTAAGGAAC ATAACATCGA AGTACAATGG GATACGACAC 14 700 

CAGGTGGAGG TACAGATGCG GGAAGTATTC ATGTCGCAAA TGAAGGTATT CCAACGATGA 14 7 60 

CAATCGGTGT TACGCTGCGA TACATGCATT CTAATGTTTC AGTGCTCAAT GTAGATGATT 14 820 

ATGAAAATTC TATCCGTCTT GTTACTGAAA TTGTCCGTTC ATTGAATGAT GAAAGTTATA 14 880 

AAAATATCAT GTGGTAATCA AATCCATAAA TAATAAAGAA TCCTTTAAT ATGGTAGGTT 14 940 

GTTAAACAAT TGTCTAATTT TAATTCTTAG TCATTAGACA GTATCCATGT TAATAGGATT 15000 

TTTTGTTTTT AATTTAAATG CTGAAAATCA ATTATGCCTA AATTTTGATA T T AC AAGAAA 150 60 

ATGATTTTTT CTTAAATGTA ATTGCACTAA AAACCAAAAA AACGGGAATA ATATACCTGA 15120 

TATATTACAT GAGGAGCGGT GCAAATGTTG TTAGAAATTA AAGATTTAGT GTATAAAGCG 1518 0 

AGCGATAGAA TCATACTAGA TCATATCAGT CTAAAAGTAG ATAAAGGCGA GAGTATTGCC 152 4 0 

ATTATAGGTC CATCAGGTAG TGGTAAAAGT ACATTTCAAA AGCAAATATG TAATTTGTTT 15300 

AGTCCAACTA GTGGAGAACT TTATTTTALA GGTAAACCCT ATAATGATTA TGACCCGGAA 15360 

GAATTGCGTC AACGAATCAG TTATTTGATG CAGCAAAGTG ACTTGTTTGG TGAAACGATT 15420 

GAAGATAACA TGATATTCCC ATCACTTGCA CGTAATGATA AATTTGATAG AAAACGTGCA 154 80 

AAGCAAT TAA TTAAAGATGT CGGTTTGGGA CATTATCAAT TAAGTTCGGA AGffGAAAAT 1554 0 

ATGTCGGGTG GTGAGCGGCA AAGAATTGCT ATAGCGCGCC AACTGATGTA TACACCGGAT 15600 

ATTCTTTTAT TAG AT G AAT C GACCAGTGCA TTAGACGTTA ATAATAAAGA AAAGATAGAA 15 660 

AATATCATTT TTAAATTAGC AGATCAAGGC GTGGCAATTA TGTGGATTAC CCACAGCGAT 1572 0 

GACCAAAGTA T GC G AC AC T T TCAAAAGCGT ATAACAATTG TTGATGGTCA AATTTCTAAT 15780 

ACAGAGGAGT TGAATCAACA TGAGTAATAC GGCACTTGGA TTGACGGCAT TGCTTTTAGT 158 4 0 

CATTCCGATT ATCATTTCAT ATAAAGAAGG TTTACATATT ATTAAAGATT TAATTGTTGC 15 900 

GACATTACGA GCAGTTGTGC AATTAATCAT TTTGCGATTT TTGCTGCATT ATATTTTTAA 15 960 
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AATAAACGAT AAATGGCTGC TTATTTTATG TGTATTGGTC ATTATTATTA ATGCATCATG 1602 0 

GAATACAATT AGTCGAGCAT C AC C AG T GAT GCATCATGTG TTTTGGATAT CATTTCTAGC 1608 0 

TATCTTCATT GGAACGGCAT TACCGCTTGC AGGTACTATT GCGACAGGGG CCATTCAATT 1614 0 

TACCGCAAAT GAAGTTATAC CTATCGGCGG CATGCTTGCA AATAATGGCT TGATTGCAAT 16200 

TAATTTAGCT TACCAGAATT TAGATCGTGC ATTCGTACAA GATGGTACTA ATATTGAATC 162 60 

TAAATTATCA CTTGCAGCTA CACCTAAATT GGCTTCTAAA GGTGCAATAC GTGAAAGTAT 16320 

TCGTTTAGCT ATAGTGGCAA CTATTGATTC GGTTAAAACA TATGGGCTTG TGTCGATTCC 16380 

TGGTATGATG ACAGGCTTAA TTATTGGTGG CGTACCACCT TTACAAGCGA TTAAATTTCA 16440 

ATTGTTAGTC GTGTTTATTC ATACAACTGC GACCATTATG TCTGCTTTGA TTGCGACATA 16500 

TTTAAGCTAT GGTCAATTTT TCAATGCAAG ACATCAATTA GfiGCACGAA ATACTGATGT 16560 

TAAGAGTGAA TCATGATAGA TTTTACTGCA TCAGATTTAG GCATTAGTTT TAATTGGAAA 16620 

TGAAGTGACG CGCACATATA GTATCGCTAT TCATTAGCGC AGCGAAAATA TTCATAAAGG 16680 

CACGCATACT TTGTAGTCAG TTATCTGTTC TGACATATAA AGCGTGCGTG CTTTTTTGGA 16740 

GTTATTGTTG AAACTGAAGT AAT TAT AC AT AAT TAT T AAA TGACATACTT GTGTTAATTT 168 00 

TTCAAATACT GAAAAACAAT T T C a AT AAT T TTCCaATTAA GCACAGAAAA T T AAAGC AAA 168 60 

AT AT TAT AT A AT AG AAC GG T TAT AT AT aAA nATTngTgCA CACATTTTTT AATAAATCGT 16920 

TATTCTAAGG GAAATGAATA TCGGVAATTT TGTTTGAAAG GAGTTTTAAA TTGTCAATCA 1698 0 

TGCGACTATT TACATTCATT TTAAGTATTT TTATCGTAGG AATGGTTGAA ATGATGGTTG 17 04 0 

CAGGAATTAT GAACTTGATG AGTCAGGACT TACATGTATC AGAAGCTGTC GTTGGTCAAT 17100 

TAGTGACAAT GTACGCTTTA ACATTTGCGA TATGTGGACC TATTCTGGTTAAATTAACGA 17160 

ACCGTTTTTC ATCAAGGCCT GTATTATTAT GGACATTACT TATATTTATC ATTGGTAATG 17220 

GCATTATTGC TGTAGCGCCA AATTTTTCaA TATTAGTAGT TGGTAGAATT ATCTCATCTG 17280 

CAGCAGCAGC AC T AAT TAT C GTAAAAGTAT TAGCTATTAC AGCGATGTTA TCAGCACCTA 17 34 0 

AAAATCGTGG TAAAATGATT GGACTTGTCT ATACAGGGTT TAGTGGTGCT AATGTTTTTG 17 4 00 

GTGTACCAAT TGGAACGGTT ATCGGCGATT TAGTAGGTTG GCGCTATACA TTTCTATTCT 174 60 

TAATTATTGT GAGTATTATT GTTGGCTTCT T GAT GAT GAT CTATTTACCG AAGGAT CAGG 17520 

AAATACAACG AGGCCCTGTG AAT CAT GAGA (ACCATCTCA TGAAAATCAT GTTACTTCGA 17 58 0 

AAATATTAAG ACCTGCTGAA GTAGCAAAAT ATTTAATTAT TACGTTTTTA GTATTGATTG 17 64 0 

CAAACTCAGT GACATTCGTC TTTATAAATC CACTTATTTT ATCTAATGGT CATGATATGT 17 700 

CATTCGTTTC ATTAGCACTT CTAGTAAATG GAATCGCTGG CGTTATTGGA ACATCATAG 17 7 60 
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GTGGTATATT CTCCGATAAA ATTACAAGTA AGCGTTGGTT AATGATTTCT GTTTCTATTT 17820 

TTATCGTCAT GATGTTACTT ATGAATTTAA TCTTACCTGG TTCAGGTCTA TTGTTAGCAG 17 880 

GACTATTTAT TTGGAATATC ATGCAATGGA GTACTAATCC AGCAGTGCAA AGCGGTGTGA 17 94 0 

TTCAACATGT TGAA3GCGAC ACAAGCCAAG TAATGAGTTG GAACATGTCT AGTTTAAACG 18 000 

CTGGTATTGG TGTTGGAGGC ATTATTGGAG GCTTGGTCAT G AC AC AT G T T TCTGTTCAAG 18060 

CTATCACATA TACGAGTGCC ATCATTGGCG CATTAGGATT AATCGTTGTT T T C AC AT T G A 18120 

AAAATAATCA TTATGCTAAA ACATTTAAAT CATCATAAT CTCATATGAm AAGCACGCCT 18180 

GCTATCAAAT TCAGGTGTGC TTTTTTAGAT GCGATAACGT TATTGATATG TGCGATAATA 1824 0 

GCGACGTTCA TTATGATACA TCGGCCAAGG CATTTTACCG CTTTTAGCAA AATTAGCTAA 18300 

ATCATTTTGC ATTTGTCGAC TTAAAAATTT AAGGTGaGCA GTTGTTGGaT ATgAT 81355 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO: 68: 

CGCAAAGAAG TACAAAAAAT GTTTTTACAA GAAGGTATTA AAACACCTCA ACCAAT TATG 60 

ACTGCTTATA ATCATAGTGA AAACGgTGTT TAGTAGTTTA TAATACATGG AGGTCATATT 120 

TAATGGCGTC AAAATATGGA ATAAATGATA TAGTAGAAAT GAAAAAACAA CATGCGTGTG 180 

GAACAAACCG TTTTAAGATT ATTAGAATGG GTGCAGACAT AAG AAT T AAA TGTGAAAATT 240 

GTCAAAGAAG TATTATGATT CCACGTCAAA CGTTTGATAA AAAACTTAAA AAAATCATCG 300 

AATCTCATGA TGATACACAA AGATAGGAGA ATGATTAATG GCTTTAACAG CAGGTATCGT 360 

TGGATTGCCA AACGTTGGTA AATCAACATT ATTTAATGCA ATACAAAAG CAGGTGCTTT 420 

AG C AG C G AA C TATCCATTCG CTACGATTGA TCCTAATGTA GGGATAGTAG AAGTGCCAGA 4 80 

TGCTAGATTA C T T AAAT TAG AAGAAATGGT TCAACCTAAA AAGACATTGC CGACTACATT 54 0 

TGAATTTACA GATATCGCTG GTATTGTGAA AGGTGCTTCA AAGGGAGAAG GGTTAGGTAA 600 

TAAATTCTTA TCACATATTA GAGAAGTAGA TGCGATTTGT CAGGTCGTTC GTGCATTTGA 660 

TGATGATAAC GTAACTCATG TTGCTGGTCG AGTAGACCCT ATTGATGATA T T G AAG T T AT 72 0 

TAATATGGAA TTAGTACTAG CGGACTTAGA ATCTGTTGAG AAACGTTTGC C T AG AA T T G A 780 

AAAAT T AG C A CGTCAAAAAG ATAAG\CTGC TGAAATGGAA GTACGTATTT TAACAACTAT 840 

TAAAGAAGCT T TAG AAAAT G GTAAACCCGC TCGTAGTATT G AC T T T AAT G AAGAAG AT C A 900 
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AAAATGGGTG AATCAAGCGC AATTACTGAC TTCTAAAAAA ATGCTTTATA TCGCTAATGT 



960 



TGGTGAAGAT GAAATTGGTG ATGATGATAA TGATAAAGTA AAAGCGATTCGTGAATATGC 



1020 



AGCGCAAGAA GACTCTGAAG TGATTGTTAT TAGTGCAAAA ATTGAAGAAG 



AAATTGCTAC 



1080 



ATTAGATGAT GAAGATAAAG AAATGTTCTT AGAAGaTTTA GGTATCGaAG 



AACCAGGATT 



1140 



AGATCgrTTA ATTAGGAmCA ctTATGAATT ATTAGGnTTA TCCACCATAA 



TT 



1192 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

AATATAGCTG CAATAGCATC TCGTTTCATT TGTATAATCA ATTCCGGTTT AAATATCAGT 60 

GTGAACGTAA GCACGACACA GAT T AAAAAT AACACTGCCG GAATGAGTCG TTTCAATCGT 120 

CGCTtCCAAA AC T C TAG C AA ATCGATTTTT TGCGTCCGAT AATACTCACT TATCAACAAA 18 0 

CTTGTTATTA AATAACCTGA AATAACGAAG AATGTATCTA CTCCTAAAAA GCCCCCACTT 240 

AACCATTGTG CATTCAAGTG ATAAATAATG ATTCCTATAA CTGCGAATGC CCTCAATCCA 300 

TCTAATCCAG GTAAGTATCG CGGGGAATAC ATTTTTTCTA AACGTTTAAA GTCTTTTGTA 360 

TCCATGTTAA TAAACGCCCC ATTTATTTTT CTCTATTTTG TAGTATATCA CAAT'ATTTT 4 20 

GAAAATAAAA TATTGCACTG aTTTTCATTA ATTGATTTAA CCCTTAATTA AGATAGTTTT 4 80 

AAATTTTTTA TTAAGTAGAA AACAATTATT ACAGTTGATT TCATTACTGC AAAC C AC AT A 54 0 

TAAATTTGTC GAT TT TACT A CATAACATAG ATTATCATAG ATTCTTGAAT TTTTAGCAAA 600 

ATAACTGTTA TTTTCATTAT AT T T T T AC AA AAAAAGGTTC GTTTTATATT TTATGCATCT 660 

TACTGTAACA GAATCATTAA GATATGCTAT TCGAATATAC TTTTTCAAAA TTTATATAAT 7 20 

GAATAAATTA ACATGTATTG AAAAAAAAGC GAAATGCAGC CTATCCTCTA ATGTAAACCA 7 80 

AAC GAT AT AT CTCGTCAGAC TTTATATTTA AACGCTA'GT GTCACTTTTA AAAT GAAT AT 84 0 

TACTAAGATT GTCATATCAA TTATTATTGC ATCGAATTAA TCTTTTAAAT TTCTGTAATA 900 

AAACCTATTA TCTACATCTA ATTTAGGAAA ACTACTTGGT AAATCTCCTC GCTTAATTTC 9 60 

ACGGAAGTCA TTATTAGAAT AAAAAT ACTG TGCACTAATA AATTTATCAA TTGTTCCTAA 102 0 

ATAAATACCA TCGATATTTT GTTCTTTACA TGTCATTATA ACTTTATCTA AAAGTTTTTT 1080 

ACCTATTTTT AAATTCCTAT AACCTTTATC AACAAACATT TTTTTAAGTG CAGACATATT 114 0 
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ATTATCTAGT CTAATCAAAC CTATAGTACC AACAATATTT TGaTGATTGT TTATTGCAAG 12 00 

CCAAAATgCC CTCCATTATT CAAATAGTTA TGTTCGATGT TCTCCAAATC AGGTTGATCA 12 60 

TCTCTATCAA TTTTTATATa AATTCATTTT TTTGAATCGA TAAAATAAAC TCGATTAGCT 1320 

CTTCCTTATA AGACCTATTA TATTCAATTA TGTTTATAGC CATTTTTATC TCCTTTTTCA 1380 

TTTAATTTAA TTATAAAATG TGCGTTTAGT TTGTATCTAG TGTJ5TCAGT ACAGCCTCAA 14 4 0 

ATGAAGTTTC ATTCCACTTG GCACTTAATA AAGACAAGTA TTTTAGCAGT AATACAATAA 1500 

AGTCCAATAA ATTTCCCTAA CTTCAATATC CACTTTTTAA AAAATGTATT TTTAATTAAT 15 60 

AAAAAAACTC TCCCCAATTT CTATGGGAAG AGCTATATAT TTAATGTCTA AACATTACTT 1620 

TTATTTATTA TGAAGGAATT AGAATCCCCA AGCACCTAAA CCTTGTGCTT TGTATGCTTT 1680 

AACAGCTGCG TTGATTTGTT GGTCAACAGT GTTTGTTGGA CCCCAACCTG GCATAGTTTG 17 4 0 

GAATAAACCT GAAGCACCTG ATGGGTTGTA AGCATTTACT TGACCATTTG ATTCACGAGC 18 00 

GATGATTGCA GCCCATGTAG AAGCTG\AAC ACCAGTACGT TGAGCCATGA TTTGAGCTGC 18 60 

TGATGAACCA GTAGCACCTG CAGTATTACC ATTGCTTAAT CTCACTGAAC TTGAAGTAGT 1920 

TGAAGTGCTG TAGTTATGGT AAGTTGGAGC TGAAACAGCT TCAACGTtTG AGTTACTTGA 1980 

TTGTGCATTG TAGCTTACTG ATTGTACATT TGAACCTTGG TTGTATGAAG fflGTGTAGTC 2 04 0 
TGCACCTGCA ACGTTTGAGA AACCAGCAGT TTGACCATTA GCTGCTTCAT AGCTCCATGA ' 2100 

CCATGTAGTA CCATTTGAAG TGAAGTTATA TTGGAAACCA TCTTTTACAA AGTGGATGTC 2160 

ATATGCACCA TCTTTGATTG GAGCTGCATT TAATTGATCT TGGTGATTAT GCGCTAAGTC 2220 

AACTAAGTGT GCTTGATCAA CGTTTACTTC AGCAGCGTGT GCTTGATGTC CTGTACCTGC 2280 

TGCGTAACCT GTTACACCTA ATGCCACTGC TAATGATGAT GCCATAATTG TCTTTTTCAT 234 0 

AGTAAAAAAT CCTCCAGTAA TAATTGTnAG TTTATGTTTT TAGTAATTAT AtTTTGaATT 24 00 

TGAATGTCGT AGTgCAAGTT TAAATTGTCT TT'ATTTCTT TCaACGGTAC TCACTATATC 2 4 60 

AC Aa AAAAC C AGCCAGTAAA TTACACTTTC TTTACAAAAC ATTACAATAT CAAGTGTTAT 2520 

TTGtAATGTT GAAATATGGC TGTTTTATAC TGTAATGTGA AATATGTGCC CTTTAGAATC 2580 

CAATCAACCC TTGAAATAGT CTTTAACACA TAAGATTTTT ACTATATTTA GCTCAACTA 2 64 0 

TACAGCTTTC GTAATATTAC AGATTGTATT TTTGTTACAT AGCTGTAATA TATCTGACAT 27 00 

GTAACTTCTC TTATTtTCGT ACATTAAACG CAATTAwAAA GCAATCAACA AATATGTTTC 27 60 

T AC AC AT G T A TTGATTGCTA TTATTGTTGT ATATTCAAAG TTTTAAAACA CACATCTTTT 28 2 0 

GTGAATTGTC TTATCTTTTA TTAGCGCAAA TAAACTGCAG CTCAATTATA TTGTTCAACT 2880 

TCATTCTCGC AATTCACAAT AACATTAAAT AATTTTTGGT CTCATATTTT CAAAAAACAT 2 94 0 

ACTGTTATTA TCCCATGAAT TTAAAAATAT CATTAGTATA TAAACGAAAC ACTTTACGAT 3000 
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AAATGATATC TGCAAGCCAA GCTGTTACAA ATGGTACAAC AAAGAACGCT ACTACAATTA 30 60 

GTAAGACACT CAACCAAGCA GAATCAACCT CCATAAATTT AAATGCATTA ATCGGTCCTA 3120 

CCATTCCTAT AAAACCAAAT CCAGCTGACT CTTTCGTTCC ATGAATACCT ACTAATGCTG 3180 

ATACCAAACC TGATACAATG GCTGTCGTTA ATATTGGTAA C AT AAGAAT T GGATATTTCA 3433 

CCATATTAGG TATCATCATT TTAACGCCTC CAAAGAAGAC GGATAACGGC ACCCCTAAAC 3300 

GATTCACTTT ACTTGTACCA ATTATCAATA CTGCTTCAGT CGCGGAGATA CCAATTGACG 3360 

CTGATCCAGC TGCTAAACCT GTAATACCTA TCGCAAAGGC AATGGCCACA GTTGATAGTG 34 20 

GCGAAATAAT AATAAGACTA AST AC CAT TG AAATCAAAAT ACTCATGACA ATCGGTTGTA 34 80 

ATTCTGTAAA ACCATTAACC ATATTACCGA TGGCTGTTGT AATCATTTTC GTATACGGCA 3540 

AT AT T AAAAC ACCAATTGCA CCTGAAATAC CGCCAACAAC TGTTGGGAAT ACAATCAATG 3600 

CCATACTACC TACGCGATGT TGAATAAGTA AAATGAATAA CACTGCAffiC GCTGCTGTAA 3660 

TCATTGTATT AATTAAATCA CCAATACCCG TAATCATCCA AGCACCATTT TTAAACTGCG 3720 

CTGCACCGCT TCCTACATAT GCTGCACTTG CCACAACAGC AATTGCTAAT GGCGATAGGT 37 80 

CAAATTTCAT GGCAACCAAT GCACCAATCA AAGCAGGTAC TGTAAATTGA ATTGCAACGA 38 4 0 

CAACGCCTAA TAACGTTTTA AAAATCGGAT GATAATCCAT AAAGTATTTA AAAATTTCTC 3900 

CAAGTATCGC ATTAGGAACT AAACCCGCAA CAATACCTAT GGCGACACCT GAT AAAAC T C 3 960 

TAAATATAAA ATCTTTGGGT GTAATTGTTT TAATTGATGT C AT AAT AT C A TCCTTCCATT 4 020 

TATGTATATA CATCTGTATG CAAATAATAA AGAGCCTTAA GTTATAAGCT GCCACTAGCT 4 08 0 

TAAATTCTAA GATGTGCATG CCGATGTTGT TATATTTAGG CTAGCAGTAT CATCTATAAC 414 0 

TCAAGACTAT GAAAAATAGT ATATCACAAA ATTCTGAATT TTTAGATAAA TAAATTGGCA 4200 

ATTTTTCAAA CATATTGTTA C AAT AC AC T T TTATTTTATC TTCATTTTTA AAATCATTA 4 2 60 

ATACAATAGA AGAAAGACAT TCAAATGCTT ACCAAAAAGG T AC AT TAT T T GTTAGGAGCG 4 32 0 

TATCAGCaCT TACATATCAT CAACACAATT GACAATATAA TAGAAGATAC TGATAATAAG 4 38 0 

TGTTAAAACA ACAGATGTTA GGTAGTGAAC AAATGATGGA AAGTAAATCC ATAGATCCAA 44 4 0 

GAATCGTTAG AACCAAACAA TTGCTTGTCG ATGCTTTTCT TAAAATTTCT AGAGAAAAGA 4 5 00 

AATTATCTCA AAT T AC AG T T AAAG AT AT C A CTGATaTTgC TACGCTAAAT CGTGCAaCAT 4 5 60 

TTTACGCTCA TTTCGCTGAT AAAGAAGACC TCCTAGACTA C AC AT TAT C T GTAACCATTT 4 620 

TAAAAGACTT G AAT GAT AAT TTGAGCATTT CTAATCEAT TAATGAAAAG GTTCTGCGTA 4 680 

ATATTTTCAT TTCAATTGCG AGTTATATCA AAGATGCTGC AAAGTCTTGC GAATTAAATA 4740 

GTGAAGCATT TTGCAACAAA GCACATCAAC GTATTAATAA TGAATTAGAA GATATTTTTG 4 800 
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CGATTATGTT AGAAAACAGC TATCCGGAGC AT C AAC GAGA TATCATTGTA AATAGTGCGA 4 8 60 

GTTTTTTAGC AGCTGGTATC TCAGGCTTAG CATTACATTG GTTTAACACG AGTCAAGAGA 4 920 

CAGCCGATGT GTTTATCGAT CGCAACCTTC CATTTTTAAT TCATCATATA GCACATTTTT 4 98 0 

AATAAAACTT GGTATTTAGT CATGCATCTT GAAATCACTA TGTGACTTAG GTTCATACTT 504 0 

GTACACACAA TAAAATTTAA. CGTATTACGA TTGATTAGCC GTGTCTAGGA CATAAATCAA 5100 

CGTCCTATAC TCTACAATGT CAT AT T AGC A GTCGTTAACT GAATGAAAAT AAGCTTGTCA 5160 

TTAAAACATA TAGATTTTAG T G AC AAG CAT TTTTGTTTTT GCGTACTTAA AC AAC AC T T C 5220 

AGGCAATATG TTGTTTAGGC AACAAATGAT ATGTGCGTGT TTATGGCAA ACGTACGACA 52 8 0 

TAGTAGTATA GTATGTCTAA AC AAC AT AT G TTGCATAGTT GATATGCGTT GTTTAAATAC 534 0 

TAAGATAGGA GGGATTGACG TGAGCGAGAC AG AT G AAC C T CAGGGGTTTG AACGCACGCA 54 00 

TAATATATTA AATATTAATC AG AG TAG T C T GGGTGTAGTG ACATACATTA CAAATAAATT 5 4 60 

AAAGTCGACG TTGAAGCAAC ACATAATAAT TGCTCGTGGT AAAAAGCGAA TCGACTATCG 5 52 0 

ACTGTCGTAT AACTTTTACA TACGTATTAT GATAATGTAG AAATCAAGAA AATCGACTGT 5580 

GAATATACCT ATGCTATGCC CATTGCAATT TTAATAAGAC ACACGATGTC ATTCGACAAT 5 64 0 

GCTCATTTCT TTGCTCAGTT ACGTCATCCT GTCTTATAAA ACAACATTGC AGACATGTAT 5700 

ATCAAACGAC ACTTCAATAA CATCACTTTG CCcATCGTAC TACTAGTAAA ATCGTGTCTC 57 60 

AAATCCCTTA TTTTAATTCC AAAAAtCTGC TGGTCAAAAG ACCGAGAAAC T AAAAAC AT T 5 82 0 

ACTTAATGTG TTGATAAATT ACCATATAAA AATAATCTCA AAATATATCA fiACTTGATT 58 80 

CTAAGGAGGA TATGACAATA TGAAAATTTT AGATAGAATT AATGAACTTG CAAATAAAGA 5 94 0 

AAAAGTACAA CCACTTACTG TAGCTGAAAA ACAAGAACAA CATGCATTGC GTCAAGAcTA 600 0 

CTTAAGcATG ATCCGAGGAC AAGT AT T AAC AACATTTTCC ACAATAAAAG TGGTTGATCC 6060 

AATCGGTcAG GAT G T C AC AC CAGATAAAGT TTATGATCTT CGCCAACAAT ACGGTTATAT 612 0 

TCaAAATTAA tATTTGCTCA CGAGGTATTG CACTTAAGGT GCCAACTGAC CTCATAAACA 618 0 

AAGCCCATAC T GAT T G AAG A CACTAATGTG tCsaCCATGG TGCACAT TAC GCTTCATCTC 624 0 

TGTATGGGCT TTTTATTTAT TCTTTTGAGA ATTTCATTTT AGC AG AC C AA AAAATTAAAA 6300 

TGGTGTCCAT CAAGATCTTT GAACATTGCT CCATAAAACC CTTGGCTTAC TGTTGGTTCT 6360 

TGAACGACTG TGCCACCCGC TTCTTTCACT T T AT TCACC A ACTGGTCAAC TTCTTCATTT 64 2 0 

GTGTTCACAC CTAGAGAAAT CAT C AC T T C A TTTGGTTCAG TATTAAGGCT TTGCTGACT 64 8 0 

ACATTTTGAA AATGCTTGTn TTCTATTAAA AT TACGGkTG tTTGACCTAT tTGAATGCCG 654 0 

ACCATTTTAT CTAACATTTG TGGGTTTCTA TTTATTTTAA ATCCTAACGC TTTATAAAAC 6600 

TGTGCGCTCT TTTCTAAATC TTGCACATGC AAAT TAAACC ACATTGATTG AATCATGATT 6660 
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GCACCCCATT CATTACTTAT TATAGTTTTG GACTTTAAGC CAATCACTTA ATGATAATCT 67 2 0 

TGTTGGATTT ATTTCAGCCA TTAATTCAAA GTCTACTTCA TAACCTTTTT CTTCCAACCA 67 8 0 

TTGCTTTTCT GCAACACCAC TAACAAATTC TCCTTCTATA ACAGTAGATT TACCTGTCAC 6840 

TTCACTAAAA ATTGTTGCTG CTTCACTTAA TGTAACTTCATCGGAACCAA TCTCTATTGA 6 900 

TTGATGCGTA AAGCTTTGTG GATGTGCAAA AATATACGAT GCAATTTTAG CTATATCAAT 6 9 60 

AGAAGAAATC ATTGTGAATT TTATATTCGG ATTAATAAAT TCTGGTAATG TAATACGTTC 7 020 

ATCTTCGACT TTAGCAATGC GTAAAAAATT ATCCATAAAG AATGATGGTT TGATAACTGT 7830 

T GC AT T TATA T TAG AT T C C A TTAATCTATT TTCTATTTTT GCTAGTACTT CAAAGTGTGG 714 0 

GCCAGTTCGA TTTCGATTAA CCCCTCCCGC AGTACTATAC ACAATATGTT GAATATTTTC 7200 

TTGCTCAGCT ATTTCAATTA TCTTCATACC TTGTCTTAAT TCTTCGCTAA CATCATCTTT 7 2 60 

AACGATTGGC TGAATACTGT ATAAGCCATA CTTACCTTTC ATCGCTGATT GCAAACTAAC 7 32 0 

ATTATCACTC AGATCACCTT CArCGATTGA TAAATGCGGA TGTCCTATGT CTGAAAGTTT 7 380 

ACGATTATnC TTATTTCTAG TTAATGCACT TACATACCAT CCATCCTCTA ACAACTGTTT 74 40 

TACAACTGCA TTACCTTGCT TCCCTGTTGC GCCTATTACn AAAATATCCT TCAT 7 4 94 
(2) INFORMATION FOR SEQ I D NO : 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE INSCRIPTION: SEQ ID NO: 70: 

AATTTATTTC GCCGTCCCAC CCCAACTTGC ATTGTCTGTA GAAATTGGGA ATCCAATTTC 60 

TCTTTGTTGG GGCCCcGCCC CAACTCGCAT TGCCTGTAGA ATTTCTTTTC GAAATTCTCT 120 

GTGTTGGGGC CCCTGACTAG AATTGAAAAA AGCTTATTAC AAGCGCATTT TCGTTCAGTC 18 0 

AATTACTGCC AATATAACTT CGTAGATCAT AG AAC AT T G A TTTATTTCCC AGCCTATTCT 24 0 

TTTCATAAAA AAAAG AC GG A TTAATTATCC GTCTTTTTCT TATATCTTTT TATAATATTT 300 

AGCAAAGGTA ATAATGATAT TAATAATGTA CAAAAAATAT AAATCAAATC G AC AT C C T T A 3 60 

T AAAAC AT C A G AAC C AC T AA AAACASVAAA GCACAAAATA AAATTAAATT TAAAATAAAC 4 20 

GACCACTTTT CAAAAAAATC TCtTTTCaTa TTTCCACCCC TAATTTTAAT AAGCATTATT 4 80 

TTATATTCTC TTTTAAGTTT AT TAT T C AAA AGGAAAACAG AAATATCTTT CaATATTATT 54 0 

ATAAACATTT CAACTACTTT TAAAAACCAA CAAAAAAATA CTTATTTTAA EAGATGAGC 600 
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ATAAGTGAAC ATAGTTCTTT AGTTATAATA ATTAATTCAA CCAAAAGTCG ATTTGTTTTT 660 

GCAATTGGTT TTCATTTCCT CTTAAAGATA TTTTCATTAA ATCTGTCAAA TCAATAGACG 720 

CTATATTTTT CAACTTATCT C TAT AT T TAT TTTTAGTACG TCTTTCTAAA TTTCCCCATT 780 

CCTCTTCTTC GTGAGTTAAT AAATGAAGCA TTGCTCGTTC TTGTATATTT TCAATCATTT 840 

TTAAATTCGG TTTTAAAATA TGCAAATCAT CAAAACAATC TTTCCAACAA TCAACCATAT 900 

CTCGTTTTAA TTCAATTTCC ACACGCCATA GAAATGTTGA ATCAATTTCA ACATCTGCAT 960 

TATCTTTACG TTCTTGTTTT TAT TAT AAA T CCCAATAAAC CTATCACTAT TACGCACACC 1020 

AAAATATTTT GTTTCTGGTT T T AC AT T AC G TCCATAAAAT ATAGTTTTCT TTACCGACTT 1080 

ATCTGACAAT GCATAATAGT CATTTAAATC AAATTCAAAA TCAAAAGCCA AATCTAATCT 114 0 

CGTAAAACTA ACATCGTCCA AATAACTGAT GATATTTTGT TTTAACCAAA GCACTTCAT 1200 

ATGCGAAAGC TTATTAGGAT TAAATTCAAC GCGCATAtAC GTCTATTCCA AAGAGTTGCT 12 60 

TTTATTTTGT CATATTCAAT ATAAACTTTT TCTTTAAGAG CTTTAGCTTT AAAGTTTGTT 1320 

TGTAAAATAT CCCAAAGCCG AATTTCAGGA TTAGTACTCA TAAAATGTGA AAGTCTCTCT 1380 

GCGTTAGACA TGCTAAGATT CCCAACAATC GTTATAGCGT CAAAAGACAA TTTTGGAATA 14 4 0 

GCTAGTGACA TCCTATGTCG ATTTAACCGG CTATTACCGG AT AT TAGAGT ATCCAGTTTT 1500 

ACAAATGGAT GAAACGAAAT TCAAAACACT AAAAAATATG TTCCACTAAC AGCAAAAAAA 15 60 

TACCATTATG TTCCTACTAA AAAAC yAAAA ATACTGGAGAACAAATGTCA GGATATAACT 162 0 

TAGGATACTA TGTAATAAAA ATTTACAATA AAAAAACAGG AAAACAAATT TCAAGTAAAA 1680 

GmATACCCAT ACAAAGAGGA TAAAATAAAA AACCTCGAAC TGaAATGATG ATCTTTTCAG 17 4 0 

CTCGAGGTTT AAATATTGGT GCCTTATTTA TATAGATTCG TTATATTATA TTCTCTATTT 1£BD 

TCATTAACmT AATCCTTAAA GAGTTTTAAA TTAATACCTG CTAGATGATT CAAAAATGTT 1860 

TCATCAACTT TTAAATAATT CAATAATTTT TGTGGTGTCA GTAAATnTCT AT C AAAAT AC 192 0 

AACTTTAATA AACTATTCAT TTTGACAGGA CGTGACATTT CAATCACGTC GTCTAAAGAT 198 0 

AATACTTTCT CGCTTTAnAC ATAnACAAAA ACTTACCCGA TTAAAATCAA GTAAGTTTTA 204 0 

CATATATCCG TCACTTTTCA AC T T T AAAAT CAACAACTTG CGGGATGCGC ATCAAATTAT 2100 

TATTTGATAA AAAATCAATA AGTAATTGTG CGCCTTCAAC TTGAATATCT TTTACAACTG 2160 

GCGCGTCGAT ATACATATCA TACTGACCAC CGCCTACTGC ACGATAATA T T T AC AC AAA 2220 

TTGTATATGT CTGCTTTAAA TCAACTGCGT GACCTTGAAT CATCATATTG CTCACACGTT 228 0 

GTCCCTTTGG TCTTCCAACA TGAATGGTAT AACTTACGCC ACCATATATA TCATAATTAA 234 0 

AGTGTTGTGG TTTGGGTTCA AGGAAGTCTG CGCTCACACT AACTTCATCA TTTTTCACGT 2400 

C AAAAT AT T C TGCTGATCGT TCAATGGCTT CTTTAAGTTT GGCACCACTT ACAGCTAAAA 24 60 
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CTTTAAATGT ATTTGGAAAT GGGTAATTGT TAATAACATC TCGCATCGTC ACGACTTGCT 2520 

TGAAACCACT AGCAGAATCA AACAAAGCTG TACAGGCAAC ATCTGCGTCA CTTTTTTCTA 2580 

ATAAAGCGTA ATTCATAAAA TTTGTAAAAG GATGCGGTGC CACACGTGCC TCAAATGCAT 2640 

GATTAATCGT CATATCATAT GGCAATGTAG TAATTTCGTA AT C T AAC C AG TCCTCTAACT 2 7 00 

GCTTTCGTAA ATGTTGGTCA TCTTCATCAA TAGTAAATGT GGAAT CAT C T ATAACAGGAA 2 7 60 

GTAATTCACA TGATTCAACG GATAGATTTT CATATTCATC AGTACTCAAG ACTATCTGC 2820 

CTACAGTTGT ACCTCTCGTA CCAGGTTGAA TCACAGCCGT TTGCTTAAAC CTTTCAGCAA 2 880 

TTTGTCGATG TTGGTGACCC GTAATAAAGA TATCTATATC TTTAGAAAAC GCTTCTAACA 2 94 0 

TGGCATATCC TTCATTTTCA CCCGTTAATA CTTCGGTCGG CGTACCACTT TCTAAATCCT- 3000 

TTTCAAATCC ACCATGGTAA CAAACCACAA TGATATCTGC ATGTCGCTTC ATTTCAGGTA 3060 

AGTATTGTTG AAGTATTTCA AAAGCACTAT GAAACGTArT GnCnTGAATA TGCTCTGGTT 3120 

GTTCCCAATG GGGAATAAAT TGTGTCGTTA AACCTATCAC ACCAACAGTT TGATCTCCAA 3180 

CCTGAAAATA CTTCACACCG TTATCAGTCA ATGTACATC ATTTTCATAT ATATTAGCGC 32 4 0 

ACAAAACTGG ATAATTGAGT CTGCGTAAAG TGTCTTTTAA GTATGGTAAT CCATAATTAA 3300 

ATTCATGATT ACCAAGCGTA CCAAAGTCGA ATGCCATTCG ATTATAAAAA TCAACTAAAG 3360 

GCTGGCTACT GCCGCTATGC GCGATTAAGT AATTACAAAA TGGTGACCCT TGCAAAAAAT 34 20 

CACCATTATC TATTTTAAAA CTTTGGTCAT ACTGCCTTCT GTsTTGTTCT ATAACATGAT 34 8 0 

TCGCTAGTAA CAATCCCATA GGTTGATATT GATTTCTACT CGTAAAATCT GTTGGGAAAA 354 0 

TATAACCATG TACGTCACTC ACGACATAAA ATGCTATGTT TGACATCCTC ACTCACTCCT 3600 

TCAATCACAA ACATCTTTCT TATTTCTATT ATATATTTAT TTGAAGTCTG TTGTAATCAA 3 6 60 

GGTTTTGTCA CCGAGTTTTA AACGAATCTT TGAACCTTCC ATACTTTCAA GTACTTTAGC 372 0 

ATTGACCTTA ATTGTGACAT TTCCGTTTTC ATCTGCTTTA ACTGTTGGCA AAGTACTGTA 37 8 0 

ACCTGGTGGG TTATAATCGT TATCTTTACT TGAAAATTGT CCGIKTTGAC GTCCGCCTTC 38 4 0 

TTTGTAAATA CCAACTCTAA GATTTTCAAG TGTTTGATTT GCTTCAAATC CTTTTAAATG 3 90 0 

TATTGTCATT TCAAATGGCT CATTTACAGA AACATTTTGC GGGATATCAA ATGTTACTTT 3 96 0 

TTCGTTCTGA TTTGGTGGTG TATGATCATC TGGTGTGTTT GGCTGAGGAT CTGCGCCTTT 4 020 

TTCGCTGCCA TAACTACCTG CTTTAAATGT TGTTGGATCA TACCATTTAT AACCACTCGG 4 08 0 

CGGTTGTGAC CATGGCTCTT TTTCAGGCTC AGTTGAACGC TCTGGTCGTT CAAAATCAAG 414 0 

CAACTTAGTC TTTGTATCTA ATGTTAGGCT ACTCGCCTTA AGTGATTTCC CATCATTATC 4200 

TTTAGACATC CAAGCCGTTA TAT TAUT AA TAGCTTACCG TTGTCTTGTT CTTTAAAACC 4 2 60 
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ATCATATGTT TTCTTCTTTT CTCCATTATC TTCTCTTACA TATTTGGGCG AACTATCTTC 4 320 

CACAAGTGAT GAATCACCGA TAAATGCTGC TTTACCTTTT CCAACTTTAG AAATTGCTAC 4 38 0 

ATAGGGGCCT TCTGCTTTAC CGCCCCCATT ATAAATACCT TGATCTACAG ETGTGACCA 444 0 
TTTACTTTTC GCTGGCAATT GTTCTGGTGT ATACACAATA CCTTTTGCTT TCTCTGGATT 4 500 

AGTAATTGCT AATGTCGATC CGGCATGCAT AGAGACAGAT TTCACACCTT CAGTAATACC 4 5 60 

GAAACTTTCT TTTGAAGAAA CAATATTGCT CGTATTTAAA TCACCTAGTG CATTATATCG 4 620 

AAAACGTACG CCAAAGTTTG TAGATAACCA ATCTGAACTT TTCACACCTT GCATTGCAGT 4 680 
AGAACTTTTT TCTTCTGCAT TCATACCTTT CGACATATCT TCATATGCTC CACGTCGATA 474 0 

ACCATTCATT GCCTCCGATG AATCAATACG ATTTAAATTT CGGTCAGCAT TGTAATGATC 4800 

TGAAATAAAG ACAACATTGC CACCTTGTTt CA(ATATTTA ACAATTGCTG CCTGTTCTGA 4 8 60 
TTCTTTGAAA GGAATGTTAG CCTCAGGAAT TACAAATATT TTGGAACTTT TCAAACTTGC 4 920 

TTCTGTTATG TTCGAATGAC CATCAATAGC TTTAACGTCA TAACCTTGTT TTTGTATTGA 4 980 

ATCCGCATAA TCTGAAAATG CACCATCACT AACCCAATCT GCAGCACCAG CTGTTTGAC 5 04 0 
AT G AG AAC G A TCGAATAATA CCGTTCGCTG TTGCTTTGTA GGTTGCGATT CATGCGTTAT 5100 
AGCTAAAGAT TGCGGTAAAG CACTTAATGA TACCGTTGCA ACAATTGCAG AGACAGTTAA 5160 
TGACTTATAT ATTTTTTTCA TTTTGTGAGG CTCCTTTTAA AATAAATTTG TTCTTGAATT 5220 

ATAGGATAAA AATT03TTGC ATATGAGCAA TTTAACGAAA AAT T T AC AAA ATCTTATCAA 52 8 0 
ACTCTTAAAG AAAGT TATTA AAATTCATTT T T AT AAAAT A CTTTTTAACA TTTAAATGTG 534 0 

GTACGCTATA AGTGTAATTT CATTGCATAC ATATTACACG ATTAAGAATG TGAAGGGGAC 54 00 

AGTTATCAAA TGAAAAATTT TAAGTGTTTA TTTGTATTAATGTTAGCAGT CATTGTTTTT 54 60 
GCAGCAGCAT GTGGAAACTC AAGTTCTTTA GATAATCAAA AG AAC G C T AG TAATGATTCG 5520 
GATTCTAAAT CAGGAGGATA CAAACCTAAA GAATTAACCG TTCAATTTGT ACCTTCGCAA 5580 
AATGCTGGAA CATTAGAAGC TAAAGCAAAA CCATTAGAAA AAT T ACT AT C TAAAGAATTA 54S0 
GGGATTCCAG TTAAAGTGTC TGTATCAACT AACTACAATA CAATTGTTGA AGCTATGAAG 57 00 

TCTAAAAAAG TTGATGTTGG TTTCTTACCA CCAACGGCAT ACACATTAGC AC AT GAT C AA 57 60 

AAAGCAGCTG ATTTATTATT ACAAGCACAA CGTTTCGGTG TAAAAGAAGA TGGTTCAGCA 5820 

AGTAAAGAAC TTGTAGATAG TTATAAATCA GAAATTCTTG TTAAAAAAGA CTCAAAAATT 58 8 0 

AAAAGCTTGA AAG AT T T AAA AGGTAAGAAA ATTGCCTTAC AAGATGTAAC ATCAACTGCT 5 94 0 
GGATATACAT TCCCACTTGC GATGTTAAAA AAC G AAG C AG GTATTAATGC AACTAAAGAT 6000 

ATGAAAATTG TGAATGTTAA AGGTCATGAC CAAGCAGTTA TCTCATTKT AAATGGAGAt 6060 
GTAGATGCTG CGGCTGTATT TAACGATGCA CGTAATACTG TGAAAAAAGA CCAACCAAAT 612 0 
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GTATTTAAAG AC AC AC G AAT TTTAAAATTA ACACAAGCTA TTCCGAATGA CACAATTTCT 618 0 

GTAAGACCAG ATATGGATAA AGATTTTCAA GAAAAATTGA AAAAAGCTTT TATAGACATT 62 4 0 

GCTAAATCAA AAGAAGGTCA CAAAATTATT AGCGAAGTTT ATTCACATGA AGGATACACA 6300 

GAAACGAAAG ATTCAAATTT CGACATTGTA AG AG AG T AC G AAAAATTAGT TAAAGATATG 63 60 

AAATAATCAT TATTTAACAA ATGAATCATT AGCGAATTTG GTATTAAAAG CTTTCGTTCA 64 20 

ATAGATATAT TCTAGATTAA TATTGAAAAG CTAGGCGCTA AACTGAAACA GATATAGAAA 64 80 

GGTGTCGCTG TACATTTGAA ACCATTTGTA CACAGAAACC CAATGTCTAT GATATTTCAG 65 4 0 

TTTACCTTGG CTTTTCTTTA TTAAAGAAAG GTGTCAAACA TGAGTCAAAT CGAATTTAAA 6600 

AACGTCAGTA AAGTCTATCC TAACGGTCAT GTAGGCTTGA AAAAT AT T AA CTTASTATT 6 660 

GAAAAAGGTG AATTTGCAGT TATTGTCGGA CTATCTGGTG CTGGGAAATC CACGTTATTA 67 2 0 

AGATCTGTAA ATCGTTTGCA TGATATCACG TCAGGTGAAA TTTTCATCCA AGGTAAATCA 67 8 0 

ATCACTAAAG CCCATGGTAA AGCATTATTA GAAATGCGCC GAAATATAGG TATGATTTTC 68 4 0 

CAACATTTTA ATTTAGTTAA ACGGTCAAGT GTATTACGAA ATGTACTAAG TGGACGTGTA 6900 

GGTTATCACC CTACTTGGAA AATGGTATTA GGTTTATTCC CAAAAGAAGA CAAAATTAAG 69 60 

GCAATGGATG CACTAGAACG CGTCAATATC TTAGATAAAT ATAATCAACG CTCTGATGAA 7020 

TTATCAGGTG GCCAACAACA ACGTATATCT ATTGCAOTG CGCTATGCCA AG AAT C T G AA 7 08 0 

ATTATTCTTG C AG AT G AAC C AGTTGCTTCA TTAGACCCAT TAACTACGAA ACAGGTTATG 7140 

GATGATTTAA GAAAAATCAA CCAAGAATTA GGCATCACAA TTTTAATTAA TTTACATTTT 7 2 00 

GTTGACTTGG CAAAAGAATA TGGCACACGC ATCATTGGTT TACGTGATGG TGAAGTTGTC 72 60 

TATGATGGTC CTGCATCTGA AGCAACAGAT GACGTATTTA GTGAAATATA TGGACGTACA 7 32 0 

ATTAAAGAAG AT GAAAAG C T AGGAGTGAAC TAACATGCCT T T AG AAAT AC CTACAAAGTA 738 0 

TGACTCCCTT TTAAAGAAAA AGGTTTCTTT AAAAAC GAG T TTTACCTTCA TGTTAATCAT 74 40 

TGTGCTTATC ATTTGGAGCT TCATTTATAC GGGATTTAAT TTTGGAGATT TAATGATAGG 7 500 

AATACCTCAA ATAGGTGATC TATTCAAACA AATGATTCCA CCTGATTTCG AG TAT T T AC A 7 5 60 

ACAAATTACA ACGCCAATGT TAGATACCAT TCGAATGGcT ATCGTAAGTA CAGTATTAGG 7620 

TAGCATCGTT TCAATACCAA TTGCGTTATT ATGTGCTAGC AATXCGTTC ATCAAAAGTG 7 68 0 

GATTTCAATA CCCTCGCGCT TTATTTTAAA TATAGTTCGT AC TAT T C C AG ATTTGTTATT 77 4 0 

AGCAGCAATC TTTGTGGCTG TATTTGGAAT CGGTCAAATT CCAGGGATAT TAGCACTGTT 7 8 00 

TATTTTAACT ATCTGTATTA TTGGAAAATT ATTATATGAA TCATTGGAAA CGATAGATCC 7 8 60 

AGGTCCAATG GAAGCAATGA CGGCTGTTGG CGCTAATAAA AT AAAAT GG A TTGTTTTCGG 7 920 
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TGTTGTACCA CAAGCCATAT CGTCATTTAT GTCATACGTA TTATATGCAT TTGAAGTAAA 7 98 0 

TATACGTGCT TCAGCTGTGC TTGGATTAGT CGGCGCTGGC GGTATTGGAT TGTTTTATGA 8 04 0 

TCAAACACTT GGTTTATTTC AATATCEAAA AACAGCAACG ATTATTTTAT TTACTTTAGT 8100 

TATCGTCGTC GTCATTGATT ACATCAGTAC GAAAGTGAGG GCACATCTCG CAT G AC AC AG 8160 

GAAATAGCAA AATATAATGT TCACACAAAA GCACACAAAC GAAAATTGAT TAAAAGATGG 8220 

CTTATTGCAA TTGTCGTCTT AGCTATTATC ATCTGGGCAT TTGCAGGTGT SCAAGTTTA 8280 

GAACTTAAAA GTAAATCATT AGAAATCTTA AAATCCATAT TCAGCGGATT ATTCCATCCT 8 34 0 

GATATCAGCT ATATCTATAT ACCAGATGGC GAAG AC T TAT TACGTGGTTT ACTTGAAACC 8 4 00 

TTTGCGATAG CCGTTGTAGG TACTTTCATC GCCGCAATTA TCTGTATTCC ATTAGCATTT 84 60 

CTAGGTGCAA ATAATATGGT AAAGCTACGC CCAGTTTCAG GTGTTAGCAA ATTTATTTTA 8520 

AGTGTTATAC GTGTCTTCCC AGAAATTGTA ATGGCACTTA TATTTATCAA AGCTGTTGGC 8580 

CCAGGTTCAT TTTCAGGTGT ATTAGCTTTA GGTATCCATT CCGTAGtATG CTTGGGAAAC 8 64 0 

TTTTAGCTGA AG AT AT T G AA GGTCTAGATT TCiSTGCTGT AGAATCATTA AAGGCCAGTG 87 00 

GTGCGAATAA GATTAAAACA CTCGTATTTG CAGTCATACC ACAAATTATG CCTGCCTTTC 87 60 

TATCACTCAT ACTTTATCGC TTTGAACTAA ACTTACGTTC AGCTTCTATA CTGGGGCTAA 8820 

TTGGGGCTGG TGGTATCGGG ACACCACTCA TATTTGCCAT TCAAACACGT TCTTGGGAC 8880 

GTGTAGGTAT TATATTAATC GGTTTAGTAC TAATGGTCGC AATTGTCGAT TTAATTTCCG 8 94 0 

GTTCAATCCG AAAACGTATT GTTTAACATT AAATCAGGAT ACTCCTAAAT AAGAAGTCCT 9000 

ACCGTCTTAC GTTTCTCTAT TATAATAAAA ACAGCAGTGA AGAAAACTAT TGTTATAGTT 9060 

AACTTCACTG CTGTTTTTAT AATATCTAAA TTTATTCTAT TTCAATTCCT TTAAATAACT 9120 

TTTACCGAAC TCTGGTAATG TTACGTTGAA ATTATCTGCT ATAGTTGCAC CGATAGAACT 918 0 

GAATGTAGTA TCACTTTCTA GTGCATGACC ACCTTTAAAT TTCGGACTGT ACATAATTAC 92 4 0 

TGGGATATAT TCTCTCGTAT GGTCCGTACC TGGCGCTGTCGGGTCATTAC CATGGTCTGC 9300 

TGTAATAATT ACTAAATCGT CTTCTTTTAA GTTGCTAAAC AGTTCTGGCA AGCGATCATC 9360 

GAAATCTTTA ATTGCTTGTG CATAACCTGG TTTATCACGA CGATGACCGT ATAATGCATC 9420 

AAAGTCTACT AAGTTTAAGA AGCTAATACC TGTGaAATCT TTCTTAACAA TTTTCATCAA 9330 

TTGATCCATA CCGTCCATGT TACTCTTCGT ACGAACCGCT TCTGTTACAC CTTCACCATC 95 4 0 

ATAAATGTCA TTAATTTTAC CGATGGCAAT AACATCATAA CCACCGTCTT TCAAATGATC 9600 

TAAGACAGTT TTACCAAAAG GTTTTAACGC ATAGTCATGT CGATTAGATG TACGTGTAAA 9660 

GTTTCCTGGT TCACCAACAT ATGGACGTGC GATAATACGA CCAATTAAAT ATTTAGGGTC 9720 

TTTTGTCAAC TCACGAACCT TTTCACAAAT ATCATATAAC TCTTCTAATG GGATAATGTC 97 8 0 
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TTCATGTGCA GCAATTTGCA ATACTGGGTC TGCACTTGTA T AAACAAT T A AGTCACCAGT 98 4 0 

TTTCATTTGG TGCTCGCCCC ACTCATCGAT AATTTGCGTA CCCGATGCG GTTTGTTAGC 9900 

AACAACTTTA CGACCTGTCA TTTCTTCAAT TTGTTGAATT AACTCTTCAG GGAATCCATT 9960 

AGGGTATACT TTAAAAGGTT GCATAATATT TAATCCCATA ATTTCCCAGT GACCAGTCAT 10020 

TGTATCTTTA CCAACTGAAG CTTCACTCAA TTTAGTATAG TATGCTTCTG GTTGTTCAAC 10080 

TGCATTTACT ACTGGTAATT TATCGATGTT CCCTAGACCT AACTTTTCAA GGTTTGGTAA 10140 

AGTTTGATCG AAACCTTCTA AGGTATGTCT TAAAGTATGT GAACCTTCAT CTTTAAAATC 10200 

AGCTGCGTCT GGCGCTTCAC CAATACCTAC TGAATCCATT ACGATTAAAT GTACACGATT 10260 

AAATGGTCTT GTCATAGCTA TCACTCCCAA AATTTATATA TATTAGTAAT CTGAATCTGC 10320 

TTCTAAACCT TGCATAATTT GAACACCTGC GCTCGCACCA ATACGTGTCG CACCTGCTTC 10380 

AACCATTTTA TTGAAATCTT CTAAATTACG TACGCCACCT GATGCTTTTA CTTCTACATC 10440 

AGCACCTACT GTATCTTTCA TTAATTTAAC GTCTTCTGCA GTCGCACCGC CACOTCAAA 10500 

ACCTGTTGAA GTTTTAACGA AGTCCGCACC AGCCGCTTTT GTTAATTCAC TCGCTTTTAC 105 60 

AATTTCGTCA TGGTCCAACA ATACCGTCTC AATAATCACT TTTACTGTGT GACCTTTCGC 10620 

AGCTTTAACC ACTGCTTCAA TGTCTTGTTG TACATCATCA AAACGTCCAT CTTTTAATGC 10680 

GCCGATGTTG AT G AC CAT G T CAATTTCATC TGCACCATTT TGAATTGCAT CTTCTGTTTC 107 4 0 

AAATGCTTTC GTTGCAGTTG TCGACGCACC TAATGGGAAT CCTATTACCG TACAAACGAG 108 00 

CACCTCTGAA TCAGCTAGTC GCTCTGCTGC ATATTTAACA TGTGTTGGAT TCACACATAC 10860 

AGATTTAAAA TTGTATGctT TCGCTTCATC GATGATTGA TCGATTTGCG TACGTGTTGA 10920 

CTCAGGCTTC AATAAAGTGT GATCTATATA TTTCTCAAAT TTCATACTTA CTACTCCTCG 10980 

TGTTATATAA TCTCTTTATT TAATTTTACT ATAAATACGA ATATATCTCG CGAATTTATA 1104 0 

TCTATTTATT TTTCAGATTG TGTCCAACGA TTTAAGCTGA CGAATAAAAC TAATCCGACA 1110 0 

ATACTCATTA AACCTAAAAT AATTAAAATA ATACCGAAAT GTGAACTTAA TGCATCATTG 11160 

CCTGGGAAAT TTAATGCTTT AAAATCGATT AGAGCCGCAG CAAT CGCAAT ACCTACAGAT 11220 

ACCGCCACAT TAATAATTAA ATTATAAAAA CCAATAGCCA CACCTGTCAT ATTAAGATCT 1128 0 

ATTGTTTTAA TGGCTTCGTT AAGTAAAGGT GCATACATTA AAGCAAAGCT ACCTGCAAAG 1134 0 

AATATCATAG AAATGACGAA GATTGAAATG TGATTACCTA CTGCAAATGC AGGTAAAATC 114 00 

AAGCTCAGTG CTATTAAAAT AATTGCTGTG ATAATCGCTT GTTTTGAATT CAGATATTCG 114 60 

CCGATTTTAC CACTTAGTGC ACCAACAATG AC T GC T AC T A TATACCCGG TACTAATAAC 11520 

AGTGATGTTG TGTCTAGTTG C AG AT G AT AA ATTTGCTCCA TTAT GAATGG GAACGTAAAA 11580 
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ATATAACCCA ATTGGATAGC ATACATTACA AATACTATAA AT AAAAAT G A AGCATAACGT 



11640 



TTATTTTGGA AAAATGATTT ATTTACTAAT GGACGTTGCG CATTTTTAAT ATATAGCGCA 



11700 



AAAACGATAA TCGCAATTAA GGCACCAATC ATATATAACC AATTAAAGTT CGTAATAAAC 



11760 



AGCATGACTG TTGTAGCAGG GGATCCTCTA GAGTCGAnCC TG 



11802 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

CTAAAGAAGA TGCGAAACAA GATGTTGATA AACAAGTTCA AGCTTTAATT GACGAAATCG 60 

ATCAAAATCC AAATCTAACA GATAAGGAAA AACAAGCACT TAAAGATCGT ATTAATCAAA 12 0 

TACTTCAACA AGGTCATAAC G AC AT T AAC A ATGCGATGAC AAAAGAAGCA ATTGAACAAG 180 

CAAAAGAACG TTTAGCGCAA gCATTGCAAG AC AT C AAAGA TTTAGTGAAA GCTAAAGAAG 24 0 

ATGCGAAAAA T GAT AT T GAT AAACGTGTAC AAGCTTTAAT TGACGAAAT GATCAAAATC 300 

CAAATCTAAC AGATAAGGAA AAACAAGCAC TTAAAGATCG AATTAATCAA ATACTTCAAC 3 60 

AAGGTCATAA CGACAT T AAC AATGCGCTGA CTAAAGAAGA AATTGAGCAG GCAAAAGCAC 420 

AACTTGCACA AGCATTGCAA GACATCAAAG ATTTAGTGAA AGCTAAAGAA GATGCGAAAA 4 80 

ATGCAATAAA AGCCTTAGCT AATGCGAAgc GTGATCAAAT CAATTCAAAT CCAGATTTAA 54 0 

CACCTGAGCA AAAAGCAAAA GCGCTCAAAG AAATTGACGA AGCTGAAAAA CGAGCACTAC 60 0 

AAAACGTTGA GAATGCTCAA ACTATAGATC AATTAAATCG AGGATTAAAC TTAGGTTTAG 660 

AT GACAT TAG AAATACACAT GTATGGGAGG TTGATGAACA ACCTGcTGTA AATGAAATTT 720 

TTGAAGCAAC ACCTGAGCAA ATCCTAGTTA ATGGTGAACT CATTGTACAT CGTGATGACA 780 

T CAT T AC AG A ACAAGATATT CTTGCACACA TAAACTTAAT TGATCAGCTT TCAGCAGAAG 84 0 

T C AT C GAT AC ACCATCAACT GCAACGATTT CTGATAGCTT AACAGCAAAA GTTGA&TTA 900 

CATTGCTTGA TGGATCAAAA GTGATTGTTA ATGTTCCTGT AAAAGTTGTA GAAAAAGAAT 960 

TGTCAGTAGT CAAACAACAG GCAATTGAaT CAATCGAAAA TGCGGCACAA CAAAAGATTA 1020 

ATGAAATCAA TAATAGTGTG ACATTAACAC TGGAACAAAA AGAAGCTGCA ATTGCGnAAG 10 8 0 

TTAATAAGCT TAAACAACAA GCAATTGGAT CATGTTnAAC AATGGCACCT GGATGTTCCA 114 0 

TTCAGTTGAA GGAAATTTCA ACAACAAGGA ACAAGCGCCn GATTGGAACA ATTTGA 1196 
(2) INFORMATION FOR SEQ ID NO: 72: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

CAATCGTTTC AACGCTATTA TCTTTAGACA ACAATTGTAA GCGTGTATGT GCAGTTTCTA 60 

AACAGTCTAT AATTCGAGTT CTTAATTCAG CTGGACATC TTTAAAAATA AAATCCATCG 120 

CTGCAACTTT GTAGACAAAT GTTAAATAGG TAAGTTCACT GTGACTCGTA ACGAAAATAA 180 

TGTTACCAAC TGGGTCATGC TTACGAATTT CACTGCCTAA TTTGATACCA TTAATATCAG 2 40 

TTGAAAGTTG AAT ATC T AAA AAGTAACAGC CTATGTCATT CATATTTTTA GCTTGCTCAA 300 

GCACCTCATA AGGATTATCA GTTGCGAGGG CAATTTCCAT AGGCTTTTCT TCTATCATTA 3 60 

TATAATTTTT AATAATGGTA ACCATGTTTT CTCTTTGTTT TGGATCGTCT TCGCAAATGA 4 20 

AAATTTTCAT ACATTCACAT CCTTATGGCT AGTTGTTAAT AATTTCAACT TTTTGAATAA 4 80 

AGAAACCATT TTCGATAATT GTATCTAATA AGACATTGTC TGCATTATCA GCAATTTCTT 54 0 

TTAAAGTTGA TAGACCTAAA CCACGACCTT CACCTTTAGT AG AAAAAC T T TCTTGGAACA 60 0 

ATT CAT GAAT GCGTGGTATA TCATCAGCGC ATTTATTCAT AACAATAAAC GTTACTGAAT 6 60 

TTTCACTTTC AATAAATGCA ACGCGAATGA TAGGGTCATC ASTTCAGTT GATGCCTCAA 72 0 

TTGCATTATC AAGAATAATA CCAATACTGC GACTTAAATC GATCATATTC AAGTTAATGC 7 80 

TACTTACTTC ATCGGGTATT TCGATACTAA TCGGAATATT CATTTCTTGT GCACGTAAAA 840 

TTTTCGCAGT AATTAAGCCT TTAATTTCAC GTACTTTAAG ATTCTCGATA CCATTTAATT 900 

TTATAGCATT CATTTGTAAA TTGTCTTTCA TAGGTACAaT ATTTTTATTG AAATAATCAC 9 60 

GTAGGCCAGG CATGTCATCT TCTCGAATGT ATTCTGAAAG TGTCGTTAAG ATATTGACAT 1020 

AAT C AT G AC G GAACTTGCGC ATTTCGTTGT TGATAGCTTC AATCTTCAAT GTATATTCAT 108 0 

AATAGGTTTC AATTTCTTCT TGATTACGTT TATATTTCAT CTCTTTAAGG AGAAATTGAG 114 0 

AAATAACAAA TGTTAATATA CTTAAAAATA TAGTGATACC AATAAAAATA AAAGAAT AC T 12 00 

GCCTTATTAC TTTAGCTTCA TCCGAGTTTA TTTGTGAATA AAAGAAAAAT AATGAAAAAG 12 60 

TAAGCAGTAA GATAGTCGAA ATAACTATTA AAAATCCTTT GTTTAGTATTAGATATGGTG 1320 

TGCTAATTTT TTTGAGAACT CTATTTATTA TATATGAGAA TAGTATACTA ATAGTCACAT 1380 

AAAC T AC AAA AAAGCTAGGG AAT ATT AC AA AT AT ACT ATC AGAAATTTTG GTGGATATAT 14 4 0 

GCATATATAA CTATATACCT GTAGTTAGCA CnGTnATAGG AATAATCnGG CGAGGTCCAT 1500 
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AATCCACCAA AATAGAATA 



1519 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5445 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

GTAGGAATCT CTTTGTCTTT TTGGGAGGAC ATTTAATATG AATGTATATT TAGCAGAATT 60 

CCTAGGAACT GCAATCTTAA TCCTTTTTGG TGGTGGCGTT TGTGCCAATG TCAATTTAAA 12 0 

GAGAAGTGCT GCGAATGGTG CTGATTGGA? T G T CAT C AC A GCTGGATGGG GATTAGCGGT 180 

TACAATGGGT GTGTTTGCTG TCGGTCAATT CTCAGGTGCA CATTTAAACC CAGCGGTGTC 24 0 

TTTAGCTCTT GCATTAGACG GAAGTTTTGA TTGGTCATTA GTTCCTGGTT ATATTGTTGC 300 

TCAAATGTTA GGTGCAATTG TCGGAGCAAC AATTGTATGG TTAATGTACT TGCSCATTG 360 

GAAAGCGACA GAAGAAGCTG GCGCGAAATT AGGTGTTTTC TCTACAGCAC CGGCTATTAA 420 

GAATTACTTT GCCAACTTTT TAAGTGAGAT T AT CGGAACA ATGGCATTAA CTTTAGGTAT 480 

TTTATTTATC GGTGTAAACA AAATTGCCGA TGGTTTAAAT CCTTTAATTG TCGGAGCATT 54 0 

AATTGTTGCA ATCGGATTAA GTTTAGGCGG TGCTACTGGT TATGCAATCA ACCCAGCACG 600 

TGATTTAGGT CCGAGAATTG CACATGCGAT TTTACCAATA GCTGGTAAAG GTGGTTCAAA 660 

TTGGTCATAT GCAATCGTTC CTATCTTAGG ACCAATTGCC GGTGGTTTAT TAGGTGCAGT 720 

GGTATACGCT GTATTTTATA AACATACATT TAATATGGT TGTGCAATTG CrATTGTTGT 7 80 

AGTTATTATT ACTTTGATTT TAGGTTACAT TTTAAATAAA TCATCAAAAA AAGGTGATAT 84 0 

CGAATCAATT TACTAAAATA AAAAGAAACG TAAATAGCAT AATTTAACAT GTTTGATTCA 900 

TGGATTATGC TATTTTTTCG CCAAAATTTA AC AG AT T T T G TACAATGGGT TAGCGATTAT 960 

TTTTTAATAA AGGAGATACT ACTAATGGAA AAATATATTT TATCTATAGA CCAAGGAACA 102 0 

ACAAGCTCAA GAGCGATTTT ATTCAATCAA AAAGGGGAAA TTGCAGGGGT AGCACAACGT 1080 

GAGTTTAAGC AATATTTTCC ACAATCAGGT TGGGTTGAAC ATGATGCAAA TGAAATTTGG 114 0 

ACATCTGTGT TAGCTGTAAT GACGGAAGTA AT T AAT G AAA ATGATGTTAG AGCTGATCAA 1200 

ATTGCAGGTA TCGGTATTAC AAACCAACGT GAAACAACGG TTGTTTGGGA CAAaCATACT 12 60 

GGCCGCCCAA TTTATCACGC AATTGTTTGG CAATCACGTC AAACACAATC AATTTGTTCA 132 0 

GAATTAAAAC AACAAGGATA TGAACAAACA TTTAGAGATA AGfiAGGATT ACTTTTAGAT 1380 

CCGTATTTTG CAGGTACAAA AGTTAAATGG ATTCTAGACA ATGTTGAAGG TGCACGAGAA 14 4 0 
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AAAGCAGAAA ATGGCGATCT ATTATTTGGA AC GAT T GAT A CTTGGTTAGT ATGGAAATTA 1500 

TCaGGaAAAg CtGCGCATAT TACTGATTAT TCaAATGCGA GTCGTACATT AATGTTTAAT 15 60 

AT C CAT GAT T TAGAATGGGA CGATGAGTTA TTAGAACTAt TACAGTACCT AAAAATATGT 162 0 

TGCCAGAAGT TAAAGCTTCG AGTGAAGTAT ATGGTAAGAC AATTGATTAC CACTTCTATG 168 0 

GTCAAGAAGT ACCAATCGCT GGAGTAGCTG GTGATCAACA AGCAGCATTA TTTGGACAAG 17 4 0 

CTTGCTTCGA ACGTGGTGAC G T G A£AAAC A CATATGGAAC TGGTGGCTTC ATGTTAATGA 18 00 

ATACAGGTGA CAAAGCGGTT AAATCTGAAA GTGGTTTATT AACAACAATT GCTTATGGTA 18 60 

TTGATGGAAA AGTAAAT TAT GCGCTTGAAG GTTCCATCTT TGTTTCGGGT TCAGCAATCC 1920 

AATGGTTACG TGATGGATTA AGAATGATTA ATTCAGCACC ACAATCAGAAAGTTATGCGA 198 0 

CACGAGTTGA CTCTACTGAG GGTGTTTATG TTGTTCCAGC TTTTGTAGGT TTAGGAACAC 2040 

CATATTGGGA TTCTGAAGCA CGTGGTGCGA TTTTCGGTTT AACACGTGGA AC T G AAAAAG 2100 

AGCACTTTAT CCGTGCAACT TTAGAATCAC TATGTTACCA AACTCGTGAC GTTATGGAAG 2160 

CAATGTCAAA AGACTCTGGT ATTGATGTCC AAAGTTTACG TGTCGATGGT GGTGCAGTTA 2220 

AAAATAACTT TAT TATGCAG TTCCAAGCAG ACATTGTTAA TACTTCTGTT GAAAGACCTG 2280 

AAATTCAAGA AACTACAGCT TTAGGTGCTG CATTTTTGGC AGGTTTAGCA GTTGGATTCT 2340 

GGGAGAGTAA AGATGATATC GCTAAAAACT GCAAATTAGA AG AAAAAT T C GATCCGAAAA 24 00 

TGGATGAAGG CGAAAGAGAA AAAT TAT AT A GAGGTTGGAA AAAAGCTGTT GAAGCAACAC 24 60 

AAGTTTTTAA AACAGAATAA ACTTGTAGAT TAGACTTTTG TATAAACATT GTGATACAAT 2520 

CAATTTAAGT TAATATTTGA ATCGAGAAGC GAGAGATTTG TTCGAACATG TACAATTKA 2580 

GGATTGTTTA TGTGGGACAA GTCTCTCGTT TTTACATTTT TTAGGAGGCG TTTTGGAATG 2 64 0 

GCATTGTCTA CTTTTAAGAG AGAACATATT AAAAAG AATT TAAGAAATGA TGAATATGAT 27 00 

TTAGTAATTA TTGGTGGCGG TATTACAGGT GCAGGTATTG CACTAGACGC GAGTGAAAGA 27 60 

GGAATGAAAG TTGCATTAGT TGAAATGCAA GACTTTGCAC AAGGAACAAG CTCAAGATCT 2820 

ACAAAATTAG TCCATGGTGG TTTGCGTTAC TTAAAACAAT TCCAAATTGG AGTAGTTGCC 2880 

GAAACTGGTA AAGAACGTGC GATTGTTTAT GAAAATGGGC CTCATGTTAC GACTCCAGAG 2 94 0 

TGGATGCTTT TACCAATGCA TAAAGGTGGA ACATTTGGTAAATTCTCAAC ATCAATTGGT 3000 

TTAGGAATGT ATGATCGTTT AGCAGGTGTT AAGAAGTCTG AACGTAAAAA AATGTTATCT 30 60 

AAAAAAGAAA CTTTAGCTAA AGAACCATTA GTTAAAAAAG AAGGTCTAAA AGGCGGCGGT 312 0 

TACTATGTTG AATATCGTAC TGACGATGCG CGTTTAACTA TTGAAGTTAT GAAGCGTGCT 138 0 

GCTGAAAAAG GCGCAGAAAT TATCAACTAT ACTAAATCTG AACACTTCAC TTATGATAAA 324 0 
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AATCAACAAG TAAATGGTGT TAAAGTTATA GATAAATTAA CTAATGAAAA TTATACAATT 3 300 

AAGGCTAAAA AAGTGGTTAA TGCAGCAGGT CCATGGGTTG ATGATGTTAG AAGTGGTGAT 3360 

TATGCACGCA ATAATAAAAA ATTACGTTTA ACTAAAGGTG TACATGTTGT TATTGATCAA 34 2 0 
TCAAAATTCC CAT T AGGTCA AGCAGTATAC TTTGATACTG AAAAAGATGG AAGAATGATT 34 8 0 

TTTGCAATTC CACGTGAAGG AAAAGCGTAT GTAGGTACTA CAGATACATT CTATGACAAT 3 54 0 

ATCAAATCTT CACCATTAAC TACACAAGAA G AC AG AGAC T ATTTAACCGA TGCGATTAAT 3 600 
TACATGTTCC CTAGTGTTAA TGTTACAGAT GAAGATATTG AATCAACATG GGCAGGAATT 3 660 

AGAC CAT T AA TTTACGAAGA AGGCAAAGAC CCTTCTGAAA TCTCTCGTAA GGATGAAATT 3720 
TGGGAAGGTA AATCAGGTTT ATTAACTATT GCAGGTGGTA AATTAACAGG CTATCGTCAC 37 8 0 

ATGGCTCAAG ACATTGTTGA TTTAGTATCT AAACGCTTGA AAAAAG AC T A CGGTTTAACA 38 4 0 

TTTAGTCCAT G T AAT AC AAA AGGTCTGGCA ATTTCAGGTG GCGATGTAGG TGGTAGCAAG 390 0 

AACTTTGATG CGTTTGTAGA GCAAAAAGTA GATGTAGCTA AAGGATTCGG CATTGATGAA 3960 

GATGTTGCAA GACGTTTAGC ATCTAAATJT GGTTCAAATG TTGATGAATT GTTCAACATT 4 02 0 
GCGCAAACAT CTCAATACCA TGATAGCAAG TTACCATTAG AAATTTATGT AGAACTTGTT 4 08 0 

TATAGTATTC AACAAGAAAT GGTATACAAA CCTAACGATT TCTTAGTTCG TCGTTCTGGT 414 0 

AAAATGTATT TCAATATTAA AGATGTATTA GAT T AT AAAG ATGCTGTCAT CGAHTTATG 4200 
GCAGATATGC TTGATTACTC TCCAGCTCAA ATTGAAGCAT ATACTGAAGA AG T T G AG C AA 4 2 60 

GCAAT TAAAG AAGCGCAACA TGGaAATAAT CAACCAGCAG T T AAAGAAT A AtTAATTTGT 4 32 0 

AC AAT C AT AA ACTGGTGTCC TGTTTTAAGG GCATCAGTTT TTTTATACGA GATACATTAG 4 38 0 

T C AAT T G AAA C T AT GC AT CA CTAAATGTAT GAT AAT AAT A AAAGCGAATA TAAAAGCGTT 44 4 0 

G T TAT TAAAG GTGTGAGATG 7ATGACTGAAA AACAATTTAA ATTAACTGTA CAAGATAATA 4 500 

CGAATATTGA AGTTAAAGTG AATTTTACAG ATGTAGATTC AAAAGGAATT ATTCATATAT 4 5 60 

TTCATGGTAT GGCTGAACAT ATGGAACGTT ACGAT^ATT AGCACATGCA CTTTCAAAGC 4 620 

ATGGCTTCGA TGTGATACGT CAT AAT CAT C GAGGACATGG TATTAATATT GAT G AAT C AA 4 68 0 

CAAGAGGGCA TTACGATGAT ATGAAACGAG TTATCGGTGA TGCCTTTGAA GTAGCGCAAA 4 74 0 

CAGTGAGAGG CAATGTTGAT AAACCATACA TTATAATCGG ACATTCAATG GGATCCGTTA 4 800 

TAGCTAGATT GTTTGTAGAA ACATATCCGC AATATGTTGA TGGTCTAATT TTAAGTGGTA 4 8 60 

CTGGTATGTA TTCATTATGG AAAGGTTTAC CAACCGTTAA AGTGTTACAA C T GAT T AC AA 4 92 0 

AAATTTATGG TGCTGAGAAA C GAG T T G AAT GGGTTAACCA GTTAGTATCA AATAGTTTTA 4 98 0 

ATAAAAnnAT ACGTCCATTA CGTACACAAA GTGATTGGAT TTCTAGTAAT CCAATTGAAG 504 0 

TAGATAaCTT TATTAAAGAT CCATATAGTG GaTTTAATGT GTCAAATCAA TTATTATATC 5100 
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AAACAGCCTA TTATATGCTA CATACATCAC AATTAAAAAA TATGAAAATG TTAAaTCATG 5160 

i 

CCATGCCTAT ATTATTAGTT TCAGGATATG ACGATCCTTT AGQJGATTAT GGTAAAGGGA 5220 

TTTTAAAATT GGCGAATATA TATAGAAACG CTGGCATnAA AAATGTTAAA GTGAATCTTT 52 8 0 

ATCATCATAA ACGTCATGAA GTGTTATTTG AAAAnGATCA TGACnAAATT TGGGAAGACT 534 0 

TGTTTAAATG GTTGAATCAA TTTTATAAAA AATAAAGAAA GTGGAATTAA ATATGAATAA 54 00 

AAATAAGCCT TTTATTGTAG TAATTGTGGG GCCAACTGCT TGCAG 54 4 5 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
TGGCTTGAAC TACGCCAATA AGTCCCCCTA G T AC AAGAAT GAATACCATG ATATCGACCG 60 
CTTCTATCGT ACCTTCAACC ATGCTACTTG TTATTTGTTC TGGTCCAGCT GGATGTTGCT 120 
TTAATCTTTC ATAAGTATTC GCAATTGATA CCGGCTTATT AATTGCACCT GATTTAAATT 18 0 

GTTCAATCTT AATTTTAACC CCCATTTTGT CTAGTTCCTG TTGCGTACCC GGAACCTTTT 240 
TCACTTGGTT ATGAGGGTTA ACTATCTTTA GTTCTTGGGA TGAAGGTTCG TAAGAAAGTT 300 

TAGAATATGC ACCAGCAGGA ATAACCCATG TTGCTATAAC TGCAACAEC GTTAAAATGA 3 60 
ATAAAATCAT AAATGCACCT GGCATTTTTA ATTTAAATCG TTTTTTCTTT TCACTTksrt 420 
TAATTGTATT TTCCACGGTT TCATCTCCTT CGACATTTAA CCTAGCATTT CTACCTTAAA 4 80 

GATTTTATAA ATATAAATTA AGAAAGTGCA CCCCGCATCA AAATAGAGGC ATTATTTTCA 54 0 

GGGGGTGCAC ATAAATAATA AAAAT CAT GC ATTTGACATA TAGTAATTGA AAAGCGTTTC 600 
AATTCAATTA CTTTTTAATC ACAGTACCTA CTTTACCCTC TAAGGCAGCA TCTAATTCAT 660 
TTAATGATGT TATAAGCACA CTTCCTTTTG GATTGTTTTC AATAAATGAT ATGGCTGCTT 720 

CAATTTTTGG TAACATACTT CCTTTTGCAA ATTGATTTTC GTCTATATAT CGTTTTAATT 78 0 
CATCAACATT TGTTGTTTTC AAAGGCTGTT GGTTTTCAGT GTTAAAATTA ATATATACAT 840 
AATCAATTGC TGTTAAAATA ATCAATTGAT CGCATTGAAT ATTAGCACCC AACAACGCAC 900 

TTGTTTTATC TTTGTCTATA ACTGCATCAA TACCTTTAAA ACCATCATGT TGCTTCTAA 9 60 

TTACTGGTAT ACCTCCACCA CCAGCAGCAA TAACGAGTGT ATCATTTTTA ATAAGTGTTT 1020 

TAATACTCTC TAATTCAATA ATAGAGATGG GTTGTGGTGA AGGAACAACG CGTCTATATC 108 0 
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CTCTTCCAGC ATCTTCAACA AATATAAATC CTTTTTCTTT TTGAATTTGT TCAGCTTCTT 114 0 

CTTTGTTGTA AAATAACCCA ATTGGTTTTG AAGGATTGTT AAATGCCGGA TCATTTTCAT 1200 

CAACTTCAAC TTGTGTCACT AGTGTTACCA CTTGTTTATC CATTCCAATA GAATGCAATT 12 60 

CATTTTGTAA GCTTTCTTGT AATTGATAGC CGATGTAAGC TTGACTCATT GCGCCACATT 132 0 

CAGCAAATGG AAATGCCGGA CCTTGGTTAT GTTCTG(AGC ATAGTTAAGT CCCAAATTAA 1380 

TGCTTCCAAC .CTGTGGTCCA TTACCATGAC TAATAACAAT CTCATGTCCT TTTGTnATTA 1440 

AyCCTACTAA TGATTtCGCA GTATTTTTAA CAAGCTCGAG TtGgTyCTTG aGGTGATTTn 1500 

CCTAAAGCAT TACCACCTAA TGCTACTACT ATTTTCGCCA TCATATTCAC TTCCTTATAT 15 60 

CAT T T AAAAT TCACCCAATG TAGCAACCAT GaCTGCTTTG ATTGTATGCA TTCTGTTCTC 162 0 

AGCTTCTTGG AATACAACTG AAGCTTTACT TTCGAATACT TCATCTGTAA CTTCCATTTC 168 0 

TCGAATACCA TATTTTTCAA AAATTTGTTG ACCTATTTTC GTATCAGCAT TATGGAAAGA 1740 

TGGTAAGCAA TGCTCAAAfiA TAACATTTGG ATTACCAGTT TTATCCATTA TTTCTTTATT 1800 

TACTTGATAT GGTTTCAATA ATTCAAGTCG TTCTTTCCAT ACTTCATCAG GTTCACCCAT 18 60 

TGATACCCAA ACATCAGTGT AAAT T AC AT C CGAACCTTTT ACaCCTTGGT CaATATCATC 1920 

TGTGATTAAT ATGTTGCCaC CATTTTCaGC GGCAATATTT TTAfiGCGAT TTAATAATTC 1980 

ATCTGTTGGA TTTAATTCTT TTGGACAAAC TAAATGGAAG TTCATACCCA TAATGGCAGC 204 0 

ACCTTGCATT AATGCATTTG CAACGTTATT ACGACCATCT CCAACATATG TAAAGTTAAT 2100 

ATCTGCATAA TCTTTTTTTA AGACTTCTTT TGCTGTTAAG AAATCAGCAA GAACTTGAGT 2160 

AGGATGATCT TCATCAGTTA AACCATTCCA CACTGGTACG CCTGAATACT CAGCTAAAGT 2220 

TTCTACTGTT CTTTGTGAAA AACCACGGTA TTCAATGCCA TCATACATTC CACCAAGCAC 2280 

ACGTGCAGTA TCTTTAGTTG TTTCTTTTTT ACCCATTTGT GATCCAGTTG GGCCTAAATA 234 0 

AGTTACATTT GCACCTTGAT CATGCCETGC AAC T T C AAAT GCACATCGCG TTCTTGTAGA 24 00 

ATCTTTTTCA AATAACAGTG CAATATTTTT ATTTTTTAAC ATAGGCTTTT CAGTGCCAAT 2 4 60 

ATATTTAGCA CGTTTTAAAT CCTCGGAGAG TGTTAATAAG GTTCTACCTC TTGTCGTGAA 2520 

AAGTCTAATA AAGTTAAAAA ACTTCTGTTT CGTAnATTTT TCATTAAnA 2569 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
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CCTGGAACCA TCCaATCGtG CaAATCtTGa AAGa GAAT AC GCAACAACAA TTAAATGTAT 60 

TGGAACACTA TATTCCAAAT GACCATCCAG CACTCGTTGA ATTAAAAATA TGGGAACGTT 12 0 

GGTTACATAA ACAAGGTTAC AAAGACATCC ATTTAGATAT TACTGCGCAC CACCTAGATC 180 

CTATTACACA GGTTTATTTA TTCAATGTCA TTTTGCTGAA AATGAATCTC GAGTTTTAAC 24 0 

AGGTGGTTAT TACAAAGGAA GCATCGAAGG GTTTGGATTA GGATTAACAC TTTAAGTAAG 300 

GGAGTATGCA CAATGTTAAG AATCGCCATA GCCAAAGGAC GTCTAATGGA TAGTTTAATT 3 60 

AAC T AT T TAG ATGTAATTGA ATATACGACA TTATCAGAAA CATTAAAAAA TAGAGAACGC 4 20 

CAATTATTAT TAAGTGTAGA TAATATTGAA TGCATTTTAG TAAAAGGAAG TGACGTGCCA 4 80 

ATCTATGTGG AACAAGGAAT GGCAGACATA GGCATTGTTG GTAGCGACAT AT TAG AT GAG 54 0 

CGCCAATATA ATGTTAATAA TTTGTTGAAT ATGCCTTTTG GAGCATGTCA TTTTGEGTT 600 

GCAGCGAAAC CTGAAACGAC CAATTATCGT AAAATCGCAA CGAGTTATGT TCATACTGCT 660 

GAAACATATT TTAAATCAAA AGGTATTGAT GTCGAATTGA TTAAATTGAA TGGCTCTGTT 720 

GAATTGGCCT GTGTTGTAGA TATGGTAGAC GGAATTGTCG ACATCGTTCA AACAGGTACT 780 

ACGCTAAAAG CGAACGGACT GGTTGAAAAG CAACATATTA GTGATATCAA TGCAAGATTA 84 0 

ATAACTAATA AAGCAGCTTA TTTTAAAAAA TCACAATTAA TAGAGCAATT TATTCGCTCT 900 

TTGGAGGTGT CTATTGCCAA TGCTTAATGC ACAACAATTT TTAAATCAAT T T T CAT TAG A 9 60 

AGCACCAT T A GATGAGTCAT TGTATCCaAT TATTCGC(AT ATTTGTCAGG AAGTTAAAGT 1020 

TCATGGAGAT AAGGCTTTGA AAATGTATAA TCTAACATTC GATCATACGA AAAC a GAT C A 1080 

TTTAGaAATT AGTCATGAmC AAATTAAAGC AGCATTTGAC ACATTAGATG AAAAAACAAA 114 0 

ACAAGCATTA CAACAAAGTT ATGAAAGAAT TAnAGCATAT CAaGAAaGTA T t a AAC AG a C 1200 

G a AT C AAC AG T T AGAAG a AT CAGTGGaGTG tTrTGaAATA TACCATCCmC taGaAAGTGT 12 60 

CGGTATTTAT GTG 1273 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

GTTGATAAAT TAAAAATGTT TTTATCAGAT ATTCAAAGTT ACCAACAATA TAGTAAAGAT 60 

CATCCGGTGT ATCAGTTAAT TGATAAATTT TATAATGATC ATTATGTTAT TCAATACTTT 120 
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AGTGGACTTA TTGGTGGACG TGGACGACGT GCAAATCTTT ATGGTTTATT TAATAAAGCT 180 

ATCGAGTTTG AGAATTCAAG TTTTAGAGGT TTATATCAAT TTATTCGTTT TATCGATGAA 24 0 

TTGATTGAAA GAGGCAAAGA TTTTGGTGAG GAAAATGTAG TEGTCCAAA CGATAATGTC 300 

GTTAGAATGA TGACAATTCA TAGTAGTAAA GGTCTAGAGT TTCCATTTGT CATTTATTCT 3 60 

GGATTGTCAA AAGATTTTAA TAAACGTGAT TTGAAACAAC CAGTTATTTT AAATCAGCAA 4 20 

TTTGGTCTCG GAATGGATTA TTTTGATGTG GATAAAGAAA TGGCATTTCC ATCTTTAGCT 4 80 

TCGGTTGCAT ATAGAGCTGT TGCCGArAAA GAACTTGTGT CAGAAGAAAT GCGATTAGTC 54 0 

TATGTAGCAT TAACAAGAGC GAAAGAACAA CTTTATTTAA TTGGTAGAGT G AAAAAT GAT 600 

AAAT CAT T AC TAGAACTAGA GCAATTGTCT ATTTCTGGTG AGCACATTGC TGTCAATGAA 6 60 

CGATTAACTT CACCAAATCC GTTCEATCTT ATTTATAGTA TTTTATCTAA ACATCAATCT 720 

GCGTCAATTC CAGATGATTT AAAATTTGAA AAAGATATAG C AC AAAT T G A AGATAGTAGT 7 80 

CGTCCGAATG TAAATATTTC AATTGTGTAC TTTGAAGATG TGTCTACAGA AACCATTTTA 840 

GATAATGATG AATATCGTTC GGTTAATCAA TTAGAAACTA TGCAAAATGGTAATGAAGAT 900 

GTTAAAGCAC AAAT T AAACA CCAACTTGAT TATCGATATC CATATGTAAA TGATACTAAA 9 60 

AAGCCCTCAA AACAATCTGT TTCTGAATTG AAAAGACAAT AT GAAAC AG A AGAAAGTGGC 1020 

ACAAGTTACG AACGAGTAAG GCAATATCGT ATCGGTTTTT CAACGTATGA ACGACCTAAA 1080 

TTTCTAAGTG AACAAGGTAA ACGAAAAGCG AATGAAATTG GTACGTTAAT GCATACAGTG 114 0 

ATGCAACATT TACCATTCAA AAAAGAACGC ATATCTGAAG TTGAGTTACA TCAGTATATC 1200 

GATGGATTAA T C GAT AAACA TAT TAT C G AA GCAGATGCGA AAAAAG AT A T CCGTATGGAT 12 60 

GAAATAATGA CATTTATCAA TAGTGATTAT /TTCGATATT GCTGAAGC 1308 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

GATGCCATTn ATnnGTATGC AAGAAGTTGT TCCGGGTTCA GGTGGATTaC CAGTTGGTAC 60 

TGGTGGTAAG AC G T T AC T AA TGCTTTCAGG CGGTATAGAC TCACCAGTTG CTGGGATGGA 120 

AG T GAT GAGA CGTGGCGTAA CAATTGAAGC GATTCATTTC CATAGTCCAC CATTACAAG 180 

TGATCAAGCA AAAGAAAAAG TTATTGAATT GACACGTATT TTAGCTGAAC GTGTTGGACC 240 

AATTAAATTG CATATTGTAC CATTTACAGA ATTGCAAAAA CAGGTAAATA AAGTTGTACA 300 
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TCCAAGATAT ACAATGACTT CAACGAGACG TAT GAT GAT G CGTGTTGCTG AT AAAT T AG T 360 

ACATCAAATA GGGGCTTTAG CTATTGTAAA TGGTGAAAAC CTAGGGCAGG TAGCCAGTCA 4 20 

AACACTTCAT AGCATGTATG CAATTAATAA TGTAACTTCT ACTCCTGTAT TACGTCCTTT 480 

ATTAACTTAC GATAAAGAAG AAAT TAT TAT TAAATCGAAA GAAATTGGTA CATTTGAAAC 54 0 

ATCTATTCAA CCATTTGAAG ATTGTTGTAC AATTTEACC CCTAAAAATC CAGTAACCGA 600 

ACCAAACTTT GATAAGGTAG TCCAATATGA AAGTGTCTTT GATTTTGAAG AG AT GAT T AA 660 

TCGTGCTGTT GAAAATATTG AAACACTTGA AATAACTAGT GATTATAAAA CTATTAAAGA 720 

ACAGCAAACA AACCAATTAA TAAACGACTT TTTATAAATA AAAT C C TAG A G T AAAT T T AA 7 80 

ACATAAGGGG ATGTTAAACT ATGGATTTGA ACTTAACGAT GATTATAATC ATAATTTTAT 84 0 

TTGGTTTTAT CGCGGCGTTT ATAGATTCGG TTGTAGGGGG TGGCGGTTTA ATTTCTACGC 90 0 

CAGCATTATT AGCAATCGGT CTACCACCAT CTGTGGCTTT AG G T AC AAAT AAATTGGCAA 96 0 

GTTCGTTTGG TTCTTTAACT AGTACGATAA AGTTTATAAG GTCCGGTAAA GTGGACTTAT 1020 

ATGTTGTTGC CAAATTATTT GGTTTTGTAT TTTTGGCATC TGCATGTGGC GCATATATTG 108 0 

CAACGATGGT TCCGTCACAA ATATTGAAAC CTTTAATCAT CATTGCACTT TCGTCGGTGT 114 0 

T TAT AT T C AC ATTACTTAAA AAAGATTGGG GCAATACACG CAGTTTACT CAATTTACAT 1200 

TTAAGAAAGC CATAATATTT GCAGCACTTT TTATATTAAT CGGCTTTTAT GATGGATTTG 12 60 

TAGGTGGGGG AACAGGTTCA TTTATGCTTT TTGTATTGTT AGTCTTTGGT TTTGATTTTT 1320 

TAAGTGCAGC AGGAAATGCT AAAGTTTTGA ACTTTGCTTC TAATATAGGT GCGCTTGTAT 138 0 

TATTTATGGT ATTAGGACAA GTAGATTATG TAATAGGTTT AATTATGGCT A 14 31 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4403 base pairs 

(B) TYPE: nucleic acid ' . 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

AATATTATTT TAAATTCAAT ATTTATTGGT GCATTTATTT TAAACTTATT ATTCGCCTTT 60 

ACCATTATTT TCATGGAAAG ACGTTCTGCC AATTCTATCT GGGCTTGGTT ACTAGTCTTA 120 

GTTTTCTTGC CTTTATTCGG CTTCATTTTA TACTTACTAT TAGGACGACA AATTCAACGT 180 

GACCAAATTT TCAAAATTGA TAAGGAAGAT AAAAAAGGAT TAGAGTTAAT CGTTGATGAG 24 0 

CAATTAGCTG CTTTAAAAAA TGAAAACTTT TCAAATTCCA ATTATCAAAT TGTAAAATTT 300 
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AAAGAAATGA TTCAAATGTT GTTATATAAT AACGCAGCAT TTTTAACAC AG AC AAC GAT 360 

TTArrrrtAT AC AC AG AC GG CCAAGAAAAA TTTGATGACC TAATACAAGA CATCCGTAAT 4 20 

GCTACTGATT ATATTCATTT TCAGTACTAT ATT ATT C AAA ATGATGAATT AGGTCGTACC 4 80 

ATTTTAAATG AACTTGGTAA AAAAGCGGAA CAAGGTGTAG AAGTTAAAAT TCTTTATGAT 540 

GACATGGGTT CTCGTGGACT GCGTAAAAAA GGCTTACGCC CGTTTCGCAA TAAAGGTGGA 600 

CATGCTGAAG CATTTTTCCC ATCAAAATTA CCTTTAATTA ACTTGCGTAT GAACAATCGA 6 60 

AAC CAT C GAA AAATTGTTGT AATAGATGGG CAAATTGGAT ATGTTGGTGG TTTTAATGTT 720 

GGTGATGAGT ACTTAGGTAA ATCAAAAAAA TTCGGCTATT GGCGAGATAC GCATTTACGA 7 80 

ATTGTCGGGG AT G C AG T GAA TGCATTGCAA TTACGATTTA TTCTAGATTG GAATTCACAA 840 

GCCACACGTG ACCACATCTC CTATGATGAT CGTTATTTCC C AG AT G T AAA TTCTGGTGGA 900 

ACAATTGGCG TTCAAATAGC TTCTAGTGGT CCTGACGAAG AATGGGAACA GATTAATAC 9 60 

GGCTATTTGA AAATGATTTC ATCTGCTAAA AAATCGATTT ATATTCAATC TCCCTATTTC 102 0 

ATACCTGATC AAGCCTTTTT AGATTCTATT AAAATTGCGG CATTAGGTGG TGTTGATGTC 1080 

AAT AT CAT GA TTCCTAATAA ACCTGACCAT CCGTTTGTTT TTTGGGCTAC TTTAAAAAAT 1140 

GCAGCATCCT TATTAGATGC CGGTGTTAAA GTATTTCACT ACGACAATGG CTTTTTACAC 1200 

T C AAAAAC AC TTGTTATAGA TGATGAAATT GCAAGTGTGG GAACAGCTAA TATGGACCAT 12 60 

CGCAGTTTCA CATTGAATTT CGAAGTCAAC GCTTTTATTT ATGACCAACA AATTGCCAAA 132 0 

AAATTAAAAC AAGCTTTTAT AGATGATTTA GCAGTAETT CTGAATTAAC AAAAGCACGT 138 0 

TATGCTAAGC GAAGTCTTTG GATTAAATTT AAAGAAGGTA TTTCACAATT ATTGTCACCT 14 4 0 

ATCTTATAAA ATAGAAATAT GAGGAGTGTA aCTTTAATGC AACAATCAGA CGTCATTAGT 1500 

GCTGCCAAAA AATATATGGA ATCTATTCAT CAAAATGATT ATACAGGCCA TGATATTGCG 15 60 

CATGTATATC GTGTCACTGC TTTAGCTAAA TCAATCGCTG AAAATGAAGG T G T T AAT GAT 162 0 

ACTTTAGTCA TTGAACTCGC ATGTTTGCTT CATGATACCG TTGACGAAAA AGTTGTAGAT 168 0 

GCTAACAAAC AATATGTTGA ATTGAAGTCA TTTTTATCTT CTTTATCACT AT C AAC C GAA 17 4 0 

GATCAAGAGC ACATTTTATT TATTATTAAT AATATGAGCT ATCGCAATGG CAAAAATGAT 18 00 

CATGTCACTT TATCTTTAGA AGGTCAAATT GTCAGGGATG CAGATCGTCT TGATGCTATA 18 60 

GGCGCTATAG GTGTTGCACG AAC AT T T C AA TTTGCAGGAC ACTTTGGTGA ACCTATGTGG 192 0 

ACAGAACATA TGTCACTAGA TAAGATTAAT GATGATTTAG TTGACAGTT GCCACCATCT 198 0 

GCAATTAAAC ATTTCTTTGA AAAATTACTT AAGTTAGAAT CTTTAATGCA TACAGATACG 204 0 

GCGAAGATGA TTGCTAAAGA ACGTCACGAC TTTATGATGA TGTACTTGAA ACAGTTTTTT 2100 

ACGGAATGGA ATTGTCACGA C TAG AC AT T G AAGTTGTAGT ATGATGATGC GATGTAATGG 2160 
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CGTGTTGTTG TGGAAGCTTG GTGTCATGCC ATGTTACTTT GATGTGTTGT TGTGGGAGCT 2220 

T GG T G AC AT G TCATGCTACT TTGATGTGCT GGTACCACGA TGCGTCTTGA TGTAGTGCTA 228 0 

TGATGTGGCA TTGCGGTGTT ATGGTGTTAT AGACAGGTTT GGCGTTGATG CCATGTTACT 234 0 

TTGATGTGCT GGTACCACGA TGCGACTTGA TGTAGTGCTA TGATGTGGCA TTGCGGTGTT 2400 

ATGGTGTTAT AGACCGGTTT GATGTTGATG CCATGTTACT TTGATGTGCT GGTGCTACGA 2 4 60 

TGCGACTTGA TGTAGTGCTA TGATGTGGCG TTGCGCTGTT ATGGTGTTAT AGCCAGGTTT 2 520 

GGTGTTGATG TCATGCCGTT ACGATTCTAT GATATGTTGT TGGGACGTTG fiATGTGTAT 258 0 

TATGCCGTTG TGACGTTATT ATTTCACACT GTTACATGTA TAAGTGAATT GCTGTGGAAA 2 64 0 

TTTGCGACAT ATACTGCTAC ACTGATGAAT CATTGTGTCA AGATGACATT GCGAT GAAGA 2 7 00 

ATGACAACTC TGTTATTAAC CACTTTTTAC AT AC T GAAAA CTCGTTAATA TTATTTCAAA 2 7 60 

TAAAAACAGC AGTAGGATGA CTTTCACATT TGAAATCATC TTACTGCTGT TTCTATTTAT 2 8 20 

CACATATTGT ATAATGTGAC ACTAAGTTTC GCTATTGAAG CGAAAAATAA TGTGCGCCCT 2 880 

ATAAAGTTAA AATTATCTTC AACTTTTAGG GTGCACATTA TTTGGACTTG CTAAGGTTAT 2 94 0 

TTCTTTTTCT TTTTAGACAC AACTTGTGTG TTHTGCCTT TTTTATTGct GCCGCCGTTG 3000 

TGCTCTCTTT CATACGCTTC AATGAAAGGT TGTACTTCTT TTTTAGCGAC TTTTTCATAA 3 0 60 

TAAATGTTCG CAAAGTGTGT TTGAACTACA AGGAACGCCG CACTGACTGA CCAGTACAAA 3120 

CCAAGTGCTG ATGCTGAGCT TAATGAAATC CAGATAATCA TAATTGGTGA AATGACCAT 318 0 

ATCATGTAAC CCATTTGACG TTGTTCGTCT GGCATCGTTT TACTTGATAC ATATGCTTGG 324 0 

ATAAAGTATA AAACACCGGC AATAATTGTA ATCCAAATAT CAGGACGTCC TAAATCGAAC 3 30 0 

CATAAGAAGT GTGGATATTT AAACAAACCA TCTACAAGTT GGTCTTTAAG TACAAAGTAT 3 3 60 

AATCCCATGA TGAT1GGTAA TTGGATTAGC ATTGGTAAAC AACCCAACAT ACTCTTAATC 3420 

GGGTTCATGT CATACTTTTT ATATACTTGC ATTAATTCTT GGTTTGCAGC CATTTTTTCT 34 80 

TCTTGTGTAC GCGnCaCGTT cACTTTTTCT TGAATTTTTT CAACTTCTGG CTTTGCAACT 3540 

TTCATTTTTT GACGCATCAT ATGACTATTT T TAT AG T T T G AC AAC AT G AA TGGTAATAAA 3600 

ATAATACGAA TTACCAATAC AAGGATAATA ATAGCTAAAC CATAATTGTC GTTTAATAAG 3 660 

TTATTTCCCA ACCAATCCAA TACATTTTTC ATTGGATCTA CGAATGTATT GTAGAAAAAy 3720 

cwCtACGTTT TTCAGGTTTA GAATAGTCAC AACCAGCCAA AAAGACCATA ATACCTAAAA 3870 

ATAATGGTAG TAACGCTTTT TTCTTCATTT TTCCACCTCT ATCATTATAT T C AC AT AGGA 384 0 

TTTATTCTAT CACATTAATG AGTACGTATG AAACAATAAG TGGAAAAATT TAACTAATTA 3900 

TTAAAAAAAT CTTTGAATCG ATTAACAGTC TTTTCAATAT TTTCACTTTT AGAAATGGCT 3 9 60 
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GAAATGACTG AAATTCCATT GGCACCTGCT TCTACAATCG GCGCCACATT ATTAGTATTG 



4020 



ATACCGCCAA TAGCTACAAT CGGTAGTTGC GGATTCATTT CTTTAAACGT TGCAATCATT 



4080 



TCTGGACCTA CTGGTATATG CGCGTCATGC TTCGACGGCG TAGGATAGAT TGGTCCAACA 



4140 



CCTATATAAT CmAC AT GAG T TAAATCAGAT TTTGCATACT CATCTAASC ACTAATACTA 



4200 



AGTCCAATAA TTTTATCAGT GAAATATTGT GCTATCTCTT TGACTTTCGC ATCATCTTGA 



4260 



CCGACATGTA TACCATCCGC GTTAATTTCT TTTGCCAAGG ' ATACATCATC ATTAACGATA 



4320 



AAAGGCACAT CATATTGATG ACAGAGATGC TGTAATTCTT TAGCTAATAC AAGTTTATCG 



4380 



TTTCCTTTTA AAGCTGATTC ACC 



4403 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

TGGAnCCAAT AT T AG AAAT G AT T AAAAC AT TAACAGGTAT TAATAGTCCT TCAGGAGnCA 60 

C AG An G AAG C AATTCAATTT GTCGAAAAAT AT G C AAAAGA CTTGGGTTAT CAAACAACAC 120 

TAACAAATAA AGGTGCGTTA TTAATAA:AG TGCCAGGCAA AAATGATGAA GTACAACGCT 18 0 

GTATTACTGC TCATGTTGAT ACTTTAGGTG CaATGGTTAA AG AAAT T AAA GAAGATGGTC 240 

GCTTaGCAAT AGAATTAATT GGAGGATTCA CGTATAACGC GATTGAGGGT GAATATTGCC 300 

AAAT T AAAAC TGATGCTGGT CAAATATATA CAGGAACAAT TTGTCTGCAT GAACAAGTG 360 

TTCATGTATA TAGAAATAAT CATGAAATAC CTAGAGATCA AAAGC AT AT G GAAATAAGAA 4 20 

T T GAT G AAG T AACTACATCA GAAGAAGATA CAAAGAGTTT AGGTATTTCA GTAGGTGATT 4 80 

TTGTTAGCTT TGATCCACGT ACAGTTATCA CGTCATCAGG TTTTATTAAA TCTCGTCATT 540 

TAGATGATAA AGCTAGCGTA CGgTtGATAC TACAATTACT AAAGAAAT T A AAAGAAGAGC 600 

AAAT AAT ATT ACCACATACA ACGCAATTTT ATATTTCTAA T AAC G AAG AA ATAGGTTACG 6 60 

GTGCAAATGC ATCAATTGAT TCGAAAATCA AAGAATATAT TGCATTAGAT ATGGGCGCGT 7 20 

TGGGAGACGG TCAAGCATCG GATGAATATA CAGTTCTAT TTGTGCCAAA GATGCTTCAG 7 80 

GTCCATATCA TAAGCAATTG AAATCGCACC TAGTTAATCT TTGCAAAATA AAT AAC AT T C 840 

C AT AT AAAG T AGACATATAT CCATATTATG GTTCAGATGC TTCAGCAGCT TTACATGCTG 900 

GTGCGGATAT CAGACATGGT TTATTTGGCG CTGGCATTGA ATCATCTCAT GCAATGGAAC 960 

GAACACATAT TGATTCTATT AAAG C G AC AG AGAAATTACT ATATGCATAT TGCTTATCAC 1020 
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CAATTGAGTA AACAATTAGT GTTGACAAAT GTGaACGACC TATGTAATAT AATGAACTAT 1080 

AAAAATAATT AGAATTTTCT AAAGAAATAG TAGCAGATAT GAAACGTAGC AAATAGAAAG 114 0 

CTAATGGGTG ATGGGAATTA GCACGCCATA TCTTGTGAAT TGGACTTTGG AAAACAATTG 12 00 

AATGAGTTTT GAAAGTGAAC ATGAATTATG TTAACTAAGG TGGCACCACG GTAACGCGTC 12 60 

CTTACAGGTA TATGCGTTAT GTGGTGTCTT T T T AT T TAG A C AAAAT G TAG TAGTTAATTA 1320 

AAGGTAGCAA CAGAAAGTTA GTGGATGATG TGAACTAACA CGAGATTAA TGAAATTGGG 1380 

TTTTGTCTGC AACAGAAAAA T TAT AT AT AG TAAAGAGTGA ACTATGAATA TTTCGAATAT 14 4 0 

TCGGTTAATT TAGGTGGTAC CACGCGTCAc nTCCTTTATA TTGATAAGGA TGCTGGCGCT 1500 

TTTTTGAAAG GAGCGTATAG AATGGATATA TTTTATAAAA AAATAAAAGC AAATGTAACG 150S 

CCCGAAGTTT TAGCACAACT TCATTCCAAG AAGaTCATTT TGGAAAGTAC AAATCAACAA 162 0 

CAAACTAAAG GTCGCTATTC AGTTGTTATT TTTGATATTT ATGGCACTTT AACTTTAGAT 168 0 

AATGATGTAT TATCAGTAAG TACTTTAAAA GAATCGTATC AAATCACTGA AAGACCGTAC 17 4 0 

CATTATTTAA CGACTAAnAT AAATGAAGAC TACCATAATA TTCCAAGATG AGGCAACTTA 1800 

AGTCATTA 18 08 
<2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

TGGTCGTCAA TTTCTTGATT ATATCTATAA TCCTCATTTT CAATATTAGA GTCTGTAGAA 60 

TCATCGATAT TATTATCATT CGCATGACTA GAAGCAGAAT CATTATTTT ATCATTGCTT 12 0 

TCTTCTTTTT TGAAGTCTTT ATTTATCAAG TAAATTTCTT CATCAAAATC AGCTTGTTGA 180 

GATGTATCAT CTTTATTTTG AT T AG AAAAA TGTGTAGCCT TTGATCTTTT TCTTTGCCGT 2 40 

CTTTTCTTAG ATGTATTCCT CGTAAATAAT TCTAATTCAT CTTTATCTTC ATTTGATTCT 300 

TGTTGATCGT TCTTCGTTTT AT CATC CATC AAT AC T C AC A CCCTTTAATA AGATGGTAAA 3 60 

TGGGCACGGA ATCTTTCAAT AAATTTCTCT CCACGCTCTT CAAAAGTACT ATATTGATCC 4 20 

CAACTCGCAC AAGCAGGTGA CAATAATACA ACATCATTTG GTTCTATAAT ATCTTGTACT 4 80 

TTATCAACAG CGTCTTCGAC ATTGTTCGTT TCAATGACCG ATTTCCCTTG ACTATTACCT 54 0 

AGTTTAGCAA ACTTAGCTTT CGTTTGTCCG AATACAACCA TCGCGCGAAC ATTTTCCATA 600 
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TAAGGAATGA GTTCGTCAAA TTCATTCCCT CGATCCAAAC CACCACATAA CCAAATGATT 6 60 

GGTTGATTAA ATGAATTTAA GGCAAACTGT GTTGCTAGCG TGTTTGTTGC TTCPGAATCA 720 

TTATAATATT TATTAGTTCT ATTAGTACCA ACATATTGCA ATCTATGCTC TATTCCTGAA 7 80 

AATGTAGTTA AACTATCAAT AATTGCtTTA ATAGGTACAC CAGCanAATA CAAGCAAGCA 840 

CAGCTGCTAA TATATTTcTA AATTATGTTC ACCAGGCAAT ACTAGAtCTT CAGTGTTAAT 900 

AATaCGAACA CCTTTATaAA CGATAAAACC ATCTTtAATA TAAaTACCAT CArCTtCTTG 9 60 

TTGAGTTGAG AAATACAATG TCTTAGCTTT TAATTCTTCC GACTCTATCA CTTGTCTTTG 1020 

ATGATAATTA CAAATCAAAT AATCCTCTTC CGTTTGATTT TTATATATTT GCTTTTTAGC 108 0 

AT T T T GAT AG TTTTCTAAAT TTTCATGGTA ATCTiCATGC GCCGAATAAA TGTTAGTAAT 114 0 

TATAGCAATG TGTGGTTTAT ACTTTTCGAT TCCAAGTAAC TGGAATGACG ACAACTCTGT 12 00 

AACTAAATAA TCTGTAGGCT TTACTTCTTG TGCTACTTTA GATGCAACAT AACCAATATT 12 60 

GCCGGATAAT CTTCCAGTTA AGCGACTTTT TTTAAACATA TCTCCAATTA GAGAAGTAAC 132 0 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

TTTACACCAA TCAAAAAATC GAACTGATAT AAATAAGTAC AAAGCTTATC TATCAATCCG 60 

ATTTAGTTAT AAAACAAAAA AAGCCACAGT AATGTGGCTT TTTGTTATAT TCAGTATCAA 120 

AATGGTATCA ATAGCCATTT TCGGAAGTCA AGAATGGCTT AACAACGCGG TTTAAAGCTA 180 

TCCAATACTA CCTTCCATTT CGAACTTGAT TAAACGGTTC ATTTCGACCG CGTATTCCAT 24 0 

TGGAAGTTCT TTTGTAAATG GTTCGATGAA TCCCATAACA ATCATTTCTG TCGCTTCTTC 300 

TTCAGAAATA CCACGACTCA T TAG AT AG AA TAATTGTTCT TCAGAAACTT TTGAAACCTT 3 60 

GGCTTCATGT TCTAATGATA TTTGATCGTT GAATACTTCG TTATATGGAA TTGTATCTGA 420 

TGTTGATTCG TTATCTAAGA TTAATGTATC ACATTCAATA TTTGAACGAG CACCTTTTGC 4 80 

TTTACGTCCA AAATGAACAA TACCGCGATA AATAACTTTA CCACCATTTT TAGAAATAGA 540 

TTTAGAAACA ATTGTAGAAG ATGTATTAGG TGCTTTATGA ATCATTTTAG CACCGGCATC 600 

TTGAACTTGT CCTTTACCAG CAAATGCAAT AGATAATGTA CTACCTTTTG CACCTTCACC 6 60 

TAAAAGAACA CAGTTTGGAT ATTTCATCGT T AAC T T AG AA CCTAAGTTAC CATCTACCCA 7 20 

TTCCATATTT CCGTTTTCAT AAACAAAAGT ACGTTTTGTA ACTAAATTGT ATACATTGTT 7 80 
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CGCCCAGTTT TGAATCGTAG TATAACGAAC GTGCGCATCT TTATGCACAA TGATTTCCAC 8 40 

AAC AG C AG AG TGTAAAGAAC TAGTTGTATA AACTGGTGCA GTACAACCTT CTACGTAATG 900 

TACAGAAGCA CCTTCATCAG CAATGATTAA TGTACGTTCA AATTGACCCA TGTTCTCAGA 9 60 

GTTAATACGG AAATAAGCTT GTAGTGGCGT ATCTAGTTTG ATATTTTAG GTACATAAAT 1020 

G AAC G AAC C A CCTGACCATA CTGCTGAGTT TAACGCCGCA AATTTGTTAT CTGCTGCAGG 108 0 

T AC T AC AG AA GCAAAGTATT TTTTGAATAA TTCTTCATTT TCTTGTAAAG CACTATCTGT 114 0 

ATCTTTAAAG ATAATACCTT TTTCTTCAAG TTCTTTTTCC ATATTATGGT AAACAACTTC 1200 

AGATTCATAT T G AG C AG AAA CACCAGCTAA ATATTTTTGT TCAGCTTCAG GAATTCCTAA 12 60 

TTTATCGAAA GTTCTTTTAA TTTCTTCTGG CACTTCATCC CATGAACGTT CAGCTTGTTC 1320 

TGAAGGCTTT ACATAGTAAG TAATGTCATC GAAATTCAAT TCTGATAAGT CGCCACCCCA 1380 

TTGAGGCATT GGCATTTTAT AAAACAATTf TAATGATTTA AGACGGAAAT C T AAC AT C C A 14 4 0 

TTCCGGCTCA TTTTTCATGT TAGAAATTTC TCTAACGATA TTCTCAGTTA AACCACGTTC 1500 

TGATCTGAAA ATGGACACAT CATCGTCGTG GAATCCATAT TTATAATCCC CAACATCAGG 15 60 

TGCTTTTTTA GCCATTTCAA TCACTCCTTA AAAAT AT AT T T AC AAC G G C G TGCEAAGTT 162 0 

TTTAATTCAT GATGTAAACC ATATTATAAC AAT G AC AT G A CATCTTATAA AAATTTTTAT 168 0 

ACTTTTATAT GTCTAATATC AAAATTATCT ATGATTAACA GCATTCTATT CTTCTTCAGT 1740 

CGTACCTTCT GCTTTACCTT CTTTAGCAAC AGTACCTTTT TCCAATGCTT TCCAAGCTAA 1800 

TGTGGCACAT T T AAT AC GAG CTGGGAATTG AGAT AC AC C T TGCAATGCTT CAATATCTCC 18 60 

CATTTCTTCT GTAATCACAT AGTCTTCACC AAGCATCATT TTCGTAAATT CTTGGCTCAT 192 0 

TTGCATTGCT TCTCCAAGTG AATGACCTTT AACAGCTTGT GTCATCATCG ATGCACTTGC 1980 

CATTGAAATC GAACAACCTT CACCTTCAAA CTTAGlATCT TTTATAATGC CGTCTTCTAT 204 0 

ATCAAATGTT AGTCGTATAC GGTCACCGCA TGTCGGGTTA TTCATATCTA CTGTCATAGA 2100 

CCCGTTATCT AATACACCTT TATTTCTAGG ATTTTTATAA TGATCCATAA TGACAGATCT 2160 

ATATAATTGA T C T AGAT TAT TAAAATTCAT AAGAGAAAAA CTCCTTCGTT TGTTTCAAGG 2 22 0 

CATTTATTAA CTGATCAACG TCTTCTTTCG TGTTGTATAT ATAAAAACTC GCTCTAGCTG 2280 

TTGAAGACAC ATTTAACCAT TTCATTAACG GTTGCGCACA ATGATGCCCA GCTCTAACCG 234 0 

CTACACCTTC TGTATCTACG GCTGTAGCAA CATCGTGTGG ATGTACATCT TGTAAATTAA 2 4 00 

ACGTTATTAC ACCTGCACGA CGATCCTTTG GCGGGCCATA AATTTCAATT CCTTCAATTG 24 60 

C AG AC AT T T G CTCATAAGCA TATATCGTTA ATTCTTGTTC ATATTTATGA ATTGCATCAA 2520 

AACCTATGCG TTCTAAATAG CGAATAGCTT CTGCAAGCCC AATTGCTTGA GCAATTAATG 2580 
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GAGTACCCGC CTCAAATTTA GTAGGTAAAT CAGCCCATGT TGfiTCATAC TTACTTACAA 2 64 0 
AATCAATCAT GTCGCCACCG AACTCAATCG GTTCCATTTT TTGTAGTAAC TCACGTTTAC ' 2 7 00 

CAAATAATAC GCCAATACCT GTTGGTCCAA GCATTTTATG ACCACTAAAA C T AT AAAAAT 27 60 

CAGCATTCAT TTCTTGCATA TCAAGTTTCA TATGTGGTGC TGctTGCGCC CCATCAACAC 2820 

TGATAATTGC ACCATGTTGA TGAGCTATTT CTGCAATGGT TTTAACATCA TTAATTGTAC 288 0 

CGAGCACATT AGATATATGT GCAATAGCAA CGATCTTTGT TTTATCATTA ATCGTTTGCT 2 94 0 

TAATATCCTC GATGTTTAAT TCACCGTCAG CTGTCATTGG TATAAATTTC AATGTCGCAT 3000 

TTTTACGCTT TGCTAACTGT TGCCffiGGAA CAATATTGGC ATGATGTTCC ATTTCAGTGA 30 60 

CAACAATTTC ATCGCCCTCT TCAACATTTG CATCACCATA GCTATGTGCT ACAAGGTTAA 3120 

TCGACGCAGT TGTTCCGCGT GTAAAAATGA TTTCTTCAAA ATACTTCGCA TTAATAAAAC 3180 

GACGAACGGT TTCACGGGCA TTTTCATAAC CATCAGTTGC CAATGATCCTAATGTATGAA 32 4 0 

C AC C AC GAT G AACGTTTGAA TTATAACGCT TGTAGTAATC TTCTAAAACA TTTAACACTT 3300 

GCACAGGCGT TTGACTTGTC GCTGTTGAAT CAAGATATGC TAAACGTTTG CCATTGACTT 3360 

TTTGATCTAA TATCGGAAAA TCCTTGATTA CTTCATTAAC GTCAAATGAG TGTTCGGCCA 34 20 

CTTCATTCAC GACCTTTCTT AAAT AAAAAT CCTAATCATT TAAATACTGA CGTTGTATTA 34 8 0 

GTCTTATACC AAT AT C G AC A GTCTATATCT ATTACAAACT TTTATTTTCA AAATATTATT 354 0 

TAGAAACTTT GCGTTCAATT ACTTCTCTCA ATTGACGTTT AACGTCTTCG ATAGGTAATT 3600 

CACGTACTAC TGGATCTAAG AAACCATGTA TffiCAAGACG TTCCGCTTCT CTTTGAGAAA 3 6 60 

TACCACGACT CATTAAATAG TAAAGTTGAT CTGGATCAAC ACGACCTACT GATGCAGCAT 37 20 

GACCAGCTTG TACATCATCT TCATCAATTA ATAAAATAGG ATTCGCGTCA CCACGAGCAT 37 80 

GTTCAGATAA CATTAATACA CGTGATTCCT GATTAGCAAT TGATTTAGTT CCACCATCT 38 4 0 

TAATGTAGCC GATACCATTA AATACAGACG ATGCATGTTC TTTCATAACA CCATGTTTAA 3900 

GGATATAACC ATCTGTTTCT TTACCATATT GTACGATTTT AGATGTTAGA TTAATTTTTT 39 60 

GTTCGCCTGT ACCTACAACT AC T GAT T T AA GTGAACTTGT TGAACGATCA CCAAATAAAT 4 02 0 

TTGTTGTATT ATCAA.TAATT TGGCTACCCT CATTCATTAA ACCTAGTGCC CAATTAATTG 4080 

AGGCATCCGC TTCAGTAATA CCACGTCGAA TGATATGACC TGTAAAGCCT TTATCCATAT 414 0 

AGTCCACTGA GCCATATGTG ATATTTGAAT TTGCACCAGC AATCACTTCA GAAATAATAT 4 200 

TtAATTGATT TCCTTCACCA GATGCATTTG mTAAGTAATT TTCAACATAT GTGACTTCGG 4 2 60 

CGCTTTCTTC AGTAACGATG 4 280 
(2) INFORMATION FOR SEQ ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

TCnGACTCGA ACGGTGmAAC TAttCCGTTG TaATTCCgGA GgAAsCAAGG TATGCCCATC 60 

T GC a AAGAAA gaATGsAATG AACTTTTTGG AAATGTAGAA GTGGTAAATA AAGATAAAGG 120 

ATATTACATT CTGAGAAGTA TAAAAGCTTG AAATGAAATG GATATTCTGT TATAGTTATA 18 0 

TAATGTAAAA ATTTATGTTC AATAAGTGTG TACTTTTACG TTAAATAGAT AAGTTAATTA 24 0 

AGAATAAATA TAGAATCGAA AATGGTGTCA TCATTAGTGT TGCCGTTTTC TTTTTGTCTT 300 

TTTATTAATA TGCTTATG3T ATTTAGCTAA AAGCGGATCA CATAATTTTT GAGGGGTGAA 3 60 

TCTGTTTGGC AGGTCAAGTT GTCCAATATG GAAGACATCG TAAACGTAGA AACTACGCGA 4 20 

GAATTTCAGA AGTATTAGAA TTACCAAACT TAATAGAAAT TCAAACTAAA TCTTACGAGT 4 80 

GGTTCCTAAG AGAAGGTTTA ATCGAAATGT T T AGAG AC AT TTTCCAATT GAAGATTTTA 54 0 

CTGGTAATTT GTC ATT AGAG TTTGTGGATT ACCGTTTAGG AGAACCAAAA TATGATTTAG 600 

AAGAATCTAA AAACCGTGAC GCTACTTATG CTGCACCTCT TCGTGTAAAA GTGCGTCTAA 660 

TCATTAAAGA AACAGGAGAA GTTAAAGAAC AAGAAGTCTT TATGGGTGAT TTCCCATTAA 720 

T G AC T GAT AC AGGTACGTTC GTTATCAATG GTGCAGAACG TGTAATCGTA TCTCAATTAG 7 80 

TTCGTTCACC ATCCGTTTAT TTCAATGAAA AAAT CGACAA AAATGGTCGT GAAAACTATG 840 

ATGCAACAAT TATTCCAAAC CGTGGTGCAT GGTTAGAATA TGAAACAGAT GCTAAAGATG 900 

TTGTATACGT AC G T AT T GAT AGAAOACGTA AACTACCATT AACAGTATTG TTACGTGCAT 960 

TAGGTTTCTC AAGCGACCAA GAAATTGTTG ACCTTTTAGG TGACAATGAA TATTTACGTA 102 0 

ATACTTTAGA GAAAGACGGC ACTGAAAACA CTGAACAAGC GTTATTAGAA ATCTATGAAC 108 0 

GTTTACGTCC AGGTGAACCA CCAACTGTTG AAAATGCTAA AAGTCTATTGTATTCACGTT 114 0 

TCTTTGATCC AAAACGCTAT GACTTAGCAA GCGTGGGTCG TTATAAAACA AACAAAAAAT 1200 

TACATTTAAA ACATCGTTTA TTTAATCAAA AATTAGCTGA GCCAATTGTA AAT AC T G AAA 12 60 

CTGGTGAAAT TGTAGTTGAA GAAGGTACAG TGCTTGATCG TCGTAAAATC GACGAAATCA 132 0 

TGGATGTACT T G AAT C AAAT GCAAACAGCG AAGTGTTTGA ATTGCATGGT AGCGTTATAG 138 0 

ACGAGCCAGT AG AAAT T C AA TCAATTAAAG TATATGTTCC TAACGATGAT GAAGGTCGTA 14 4 0 

CGACAACTGT AATTGGTAAT GCTTTCCCTG ACTCAGAAGT TAAATGCATT ACACCAGCAG 1500 

ATATCATTGC TTCAATGAGT TACTTCTTTA ACTTATTAAG CGGTATTGGA TATACAGATG 1560 
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ATATTGACCA TTTAGGTAAC CGTCGTTTAC GTTCTGTAGG TGAATTACTA CAAAACCAAT 162 0 

TCCGTATCGG TTTATCAAGA AT GGAAAGAG TTGTACGTGA AAGAATGTCA ATTCAAGATA 168 0 

CTGAGTCTAT CACACCTCAA CAATTAATTA ATATTCGACC TGTTATTGCA TCTATTAffiG 17 4 0 

AATTCTTTGG TAGCTCTCAA TTATCACAAT TCATGGACCA AGCAAACCCA TTAGCTGAGT 1800 

TAACGCATAA ACGTCGTCTA TCAGCATTAG GACCTGGTGG TTTAACACGT GAACGTGCTC 18 60 

AAATGGAAGT ACGTGACGTT CACTACTCTC ACTATGGCCG TATGTGTCCA ATTGAAACAC 192 0 

CTGAGGGACC AAACATTGGA TTGATTAACT CATTATCAAG TTATGCACGT GTAAATGAAT 1980 

TCGGCTTTAT TGAAACACCA TATCGTAAAG TTGATTTAGA TACACATGCT ATCACTGATC 204 0 

AAATTGACTA TTTAACAGCT GACGAAGAAG ATAGCTATGT TGTAGCACAA GCAAACTCTA 2100 

AAT TAG AT G A AAATGGTCGT TTCATGGATG ATGAAGTTGT ATGTCGTTTC CGTGGTAACA 2160 

ATACAGTTAT GGCTAAAGAA AAAATGGATT ATATGGATGT ATCGCCGAAG CAAGTTGTTT 2220 

CAGCAGCGAC AgcATGTATT CCATTCTTAG AAAATGATGA CTCAAACCGT GCATTGATGG 22 80 

GTGCGAACAT GCAACGTCAA GCAGTGCCTT TGATGAATCC AGAAGCACCA TTTGTTGGTA 24 0 

CAGGTATGGA ACACGTTGCA GCACGTGATT CTGGTGCGGC TATTACAGCT AAGCACAGAG 2 4 00 

GTCGTGTTGA ACATGTTGAA TCTAATGAAA TTCTTGTTCG TCGTCTAGTT GAAGAGAACG 24 60 

GCGTTGAGCA TGAAGGTGAA TTAGATCGCT ATCCATTAGC TAAATTTAAA CGTTCAAACT 2520 

C AGGT AC AT G TTACAACCAA CETCCAATCG TTGCAGTTGG AGATGTTGTT GAGTATAACG 2580 

AGATTTTAGC AGATGGACCA TCTATGGAAT TAGGAGAAAT GGCATTAGGT AGAAAC G T AG 2 64 0 

TAGTTGGTTT CATGACTTGG GACGGTTACA ACTATGAGGA TGCCGTTATC ATGAGTGAAA 2700 

GACTTGTGAA AGATGACGTG TATACTTCTA TTCATATTGA AGAGTACBAA TCAGAAGCAC 27 60 

G T GAT AC T AA GTTAGGACCT GAAG AAAT C A CAAGAGATAT TCCTAATGTT TCTGAAAGTG 2 82 0 

CACTTAAGAA C T TAG AC GAT CGTGGTATCG TTTATATTGG TGCAGAAGTA AAAGAT GGAG 2880 

ATATTTTAGT TGGTAAAGTA ACGCCTAAAG GTGTAACTGA GTTAACTGCC GAAGAAAGAT 2940 

TGTTACATGC AATCTTTGGT GAAAAAGCAC GTGAAGTTAG AGATACTTCA TTACGTGTAC 3000 

CTCACGGCGC TGGCGGTATC GTTCTTGATG TAAAAGTATT CAATCGTGAA GAAGGCGACG 30 60 

ATACATTATC ACCTGGTGTA AACCAATTAG TACGTGTATA TATCGTTCAA AAACGTAAAA 3120 

TTCATGTTGG TGATAAGATG TGTGGTCGX: ATGGTAACAA AGGTGTCATT TCTAAGATTG 3180 

TTCCTGAAGA AGATATGCCT TACTTACCAG ATGGACGTCC GATCGATATC ATGTTAAATC 32 4 0 

CTCTTGGTGT ACCATCTCGT ATGAACATCG GACAAGTATT AGAGCTACAC TTAGGTATGG 3300 

CTGCTAAAAA TCTTGGTATT CACGTTGCAT CACCAGTATT TGACGGTGCA AACSTGACG 33 6 0 

ATGTATGGTC AAC AAT T G AA GAAGCTGGTA TGGCTCGTGA TGGTAAAACT GTACTTTATG 34 20 
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ATGGACGTAC AGGTGAACCA TTCGATAACC GTATTTCAGT AGGTGTAATG TACATGTTGA 34 80 

AACTTGCGCA CATGGTTGAT G AT AAAT T AC ATGCGCGTTC AACAGGACCA TATTCACTTG 354 0 

t T AC AC AAC A ACCACTTGGC GGTAAAGCGC AATTCGGTGG ACAACGTTTT GGTGAGATGG 3600 

AGGTATGGGC ACTTGAAGCA TATGGTGCTG CATACACATT ACAAGAAATC TTAACTTACA 3660 

AATCCGATGA TACAGTAGGA CGTGTGAAAA CATACGAGGC TATTGTTAAA GGTGAAAACA 372 0 

TCTCTAGACC AAGTGTTCCA GAATCATTCC GAGTATGAT GAAAGAATTA CAAAGTTTAG 37 80 

GTTTAGATGT AAAAGTTATG GATGAGCAAG ATAATGAAAT C G AAAT G AC A GACGTTGATG 38 4 0 

ACGATGATGT TGTAGAACGC AAAG TAG AT T TACAACAAAA TGATGCTCCT GAAACACAAA 3 900 

AAGAAGTTAC TGATTAATAC GCAATTTACA AAACAGGCAA AAAGAT AC T A AGCTGAATTT 3 9 60 

TATTGATGAT TCAGTTTAGT ACTTTAAGCC ATTTTAAATA AATGCAAATC AATCAAATAG 4020 

CACAGCTAAT CTAAATTGAA GGAGGTAGGC TCCTTGATTG ATGTAAATAA TTTCCATTAT 4 08 0 

ATGAAAATAG GATTGGCTTC ACCTGAAAAA ATCCGTTCTT GGTCTTTTGG TGAAGTTAAA 414 0 

AAACCTGAAA CAATCAACTA CCGTACATTA AAACCTGAAA AAGATGGTCT ATTCTGTGAA 4 20 0 

AGAATTTTCG GACCTACAAA AGACTGGGAA TGTAGTTGTG GTAAATACAA ACGTGTTCGC 4 2 60 

TACAAAGGCA TGGTCTGTGA CAGATGTGGA GTTGAAGTAA CTAAATCTAA AGTACGTCGT 4 32 0 

GAAAGAATGG GTCACATTGA ACTTGCTGCT CCAGTTTCTC ACffiTTGGTA TTTCAAAGGT 4 38 0 

ATACCAAGTC GTATGGGATT ATTACTTGAC ATGTCACCAA GAGCATTAGA AGAAGTTATT 4440 

TACTTTGCTT CTTATGTTGT TGTAGATCCA GGTCCAACTG GTTTAGAAAA GAAAACTTTA 4 500 

TTATCTGAAG CTGAATTCAG AGATTATTAT GATAAATACC CAGGTCAATT CGTTGCAAAA 4 5 60 

ATGGGTGCAG AAGGTATTAA AGATTTACTT GAAGAGATTG ATCTTGACGA AGAACTTAAA 4 620 

TTGTTACGCG ATGAGTTGGA ATCAGCTACT G G T C AAAG AC TTACTCGTGC AATTAAACGT 4 680 

TTAGAAGTTG TTGAATCATT CCGTAATTCA GGTAACAAAC CTTCATGGAT GATTTTAGAT 4 74 0 

GTACTTCCAA TCATCCCACC AGAAATTCGT CCAATGGTTC AATTAGATGG TGGACGATTT 4 8 00 

GCAACAAGTG ACTTAAACGA CTTATACCGT CGTGTAATTA ATCGAAATAA TCGTTTGAAA 4 8 60 

CGTTTATTAG ATTTAGGTGC ACCTGGTATC ATCGTTCAAA AC G AAAAAC G TATGTTACAA 4 920 

GAAGCCGTTG ACGCTTTAAT TGATAATGGT CGTCGTGGTC GTCCAGTTACTGGCCCAGGT 4 980 

AACCGTCCAT TAAAATCTTT AT C T CAT AT G TTAAAAGGTA AACAAGGTCG TTTCCGTCAA 504 0 

AACTTACTTG GTAAACGTGT TGACTATTCA GGACGTTCAG TTATTGCAGT AGGTCCAAGC 5100 

TTGAAAATGT ACCAATGTGG TTTACCAAAA GAAATGGCAC TTGAACTATT TAAACCATTC 5160 

GTAATGAAAG AATTAGTTCA ACGTGAAATT GCAACTAACA TTAAAAATGC G AAG AG T AAA 522 0 
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ATCGAACGTA TGGATGATGA AGTTTGGGAC GTATTGGAAG AAGTAATTAG AGAACATCCT 5280 

GTATTACTTA ACCGTGCACC AACACTTCAT AGACTTGGTA TTCAAGCATT TGAACCAACT 534 0 

TTAGTTGAAG GTCGTGCGAT TCGTCTACAT CCACTTGTAA CAACAGCTTA TAACGCTGAC 54 00 

TTTGACGGTG ACCAAATGGC GGTTCACGTT CCTTTATCAA AAGAGGCACA AGCTGAAGCA 54 60 

AGAATGTTGA TGTTAGCAGC ACAAAACATC TTGAACCCTA AAGATGGTAA ACCTGTAGTT 5520 

ACACCATCAC AAGATATGGT ACTTGGTAAC TATTACCTTA CTTTAGAAAG AAAAGATEA 55 8 0 

G T AAAT AC AG GCGCAATCTT TAATAATACA AATGAAGTAT TAAAAGCATA TGCAAATGGC 5 64 0 

TTTGTACATT TACACACTAG AATTGGTGTA CATGCAAGTT CGTTCAATAA TCCAACATTT 5700 

ACTGAAGAAC AAAACAAAAA GATTCTTGCT ACGTCAGTAG GTAAAATTAT ATTCAATGAA 5 7 60 

ATCATTCCAG ATTCATTTGC TTATATTAAT GAACCTACGC AAGAAAACTT AGAAAGAAAG 58 2 0 

ACACCAAACA GATATTTCAT CGATCCTACA ACTTTAGGTG AAGGTGGATT AAAAGAATAC 5880 

TTTGAAAATG AAGAATTAAT TGAACCTTTC AACAAAAAAT TCTTAGGTAA TATTATTGCA 594 0 

GAAGTATTCA ACAGATTTAG CATCACTGAT AC AT C AAT GA T G T TAG AC C G TATGAAAGAC 6000 

TTAGGATTCA AATTCTCATC TAAAGCTGGT ATTACAGTAG GTGTTGCTGA TATCGTAGTA 60 60 

TTACCTGATA AGCAACAAAT ACTTGATGAG CATGAAAAAT TAGTCGACAG AATTACAAAA 612 0 

CAATTCAACC GTGGTTTAAT CACTGAAGAA GAAAGATATA ATGCAGTTGT TGAAATTTGG K8 0 

AC AG AT G C AA AAG AT C AAAT TCAAGGTGAA TTGATGCAAT CACTTGATAA AACTAACCCA 62 4 0 

ATCTTCATGA TGAGTGATTC AGGTGCCCGT GGTAACGCAT CTAACTTTAC ACAGTTAGCA 6300 

GGTATGCGTG GATTGATGGC CGCACCATCT GGTAAGATTA TCGAATTACC AATCACATCT 6360 

TCATTCCGTG AAGGTTTAAC A3TACTTGAA TACTTCATCT CAACTCACGG TGCACGTAAA 6420 

GGTCTTGCCG ATACAGCACT TAAAACAGCT GACTCAGGAT ATCTTACTCG TCGTCTTGTT 64 80 

GACGTGGCAC AAGATGTTAT TGTTCGTGAA GAAGACTGTG GTACTGATAG AGGTTTATTA 65 4 0 

GTTTCTGATA TTAAAGAAGG T AC AG AAAT G ATTGAACCAT TTATCGACG TATTGAAGGT 6600 

CGTTATTCTA AAGAAACAAT TCGTCATCCT GAAACTGATG AAATAATCAT TCGTCCTGAT 6 6 60 

G AAT T AAT T A CACCTGAAAT TGCTAAGAAA ATTACAGATG CTGGTATTGA ACAAAT GT AT 6720 

ATTCGCTCAG CATTTACTTG TAACGCACGA CATGGTGTTT GTGAAAAATG TTACGGTAAA 67 80 

AACCTTGCTA CTGGTGAAAA AGTTGAAGTT GGTGAAGCAG TTGGTACAAT TGCAGCCCAA 68 4 0 

TCTATCGGTG AACCAGGTAC ACAGCTTACA ATGCGTACAT TCCATACAGG TGGGGTAGCA 6900 

GGTAGCGATA T C AC AC AAG G TCTTCCTCGT ATTCAAGAGA TTTTCGAAGC ACGTAACCcT 6960 

AAAGGTCAAG CGGTAATTAC GGAAATCGffi GGTGTCGTAG AAGAT AT T AA AT T AGCAAAA 7020 

GATAGACAAC AAG AAAT T G T TGTTAAAGGT GCTAATGAAA CAAGATCATA CCTTGCTTCA 7 080 
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GGTACTTCAA GAATTATTGT AGAAATCGGT CAACCAGTTC AACGTGGTGA AGTATTAACT 7140 

GAAGGTTCTA TTGAACCTAA GAATTACTTA TCTGTTGCTG GATTAAACGC GACGAAAGC 7200 

TACT TAT TAA AAGAAGTACA AAAAGT T T AC CGTATGCAAG GTGTAGAAAT CGACGATAAA 72 60 

CACGTTGAGG TTATGGTTCG ACAAATGTTA CGTAAAGTTA GAATTATCGA AGCAGGTGAT 7 32 0 

ACGAAGTTAT TACCAGGTTC AT TAG T T GAT ATTCATAACT TTACAGATGC AAATAGAGAA 738 0 

GCATTTAAAC ACCGTAAGCG TCCTGCAACA GCTAAACCAG TATTACTTGG TAT T AC T AAA 74 40 

GCATCACTTG AAACAGAAAG TTTCTTATCT GCAGCATCAT TCCAAGAAAC AACAAGAGTT 7 500 

C T T AC AG AT G CAGCAATTAA AGGTAAGCGT GATGACTTAT TAGGTCTTAA AGAAAACG T A 7 5 60 

ATTATTGGTA AGTTAATTCC AGCTGGTACT GGTAT<AGAC GTTATAGCGA CGTAAAATAC 7 62 0 

GAAAAAACAG CTAAACCAGT TGCAGAAGTT GAATCTCAAA CTGAAGTAAC ■ GGAATAACAA 7680 

GTATATAaCA GAGGCTAATG CTTTAGCCTC TTGTTATTTT TATGTAAATT ATTTGATTTA 77 4 0 

ATGTTGACGA ATTCTCTTGT TCAATGTTAA TATATTAAAG GTTGATGCAA GCAGAACTTT 7800 

GGAGGATAAA TTATTGTCTA AG G AAAAAG T tGCACGCTTT AACAAACAAC ATTTTGTAGT 78 60 

TGGTCTTAAA GAAACGCTTA AAGCGTTAAA GAAAGATCAA GTTACATCTT TGATTATTGC 7 920 

TGAAGACGTT GAAGTATATT TAATGACTCG CGTGTTAAGC CAAATCAATC AGAAAAATAT 7 98 0 

ACCTGTATCT TTTTTCAAAA GCAAACATGC TTTGGGTAAA CATGTAGGTA TTAACGTCAA 8 04 0 

TGCGACAATA GTAGCATTGA TTAAATGAGA ATTAGTAAGT GTTTTACTTA CTAAATTTTA 8100 

TTTAACCTAA AAATGAACCA CCTGGATGTG TGGGATTAAA AAGT GAAGAG AGGAGGACAT 8160 

ATCACATGCC AACTATTAAC CAATTAGTAC GTAAACCAAG ACAAGCAAA ATCAAAAAAT 822 0 

CAGATTCTCC AGCTTTAAAT AAAGGTTTCA ACAGTAAAAA GAAAAAATTT AC T G AC T TAA 82 8 0 

ACTCACCACA AAAACGTGGT GTATGTACTC GTGTAGGTAC AATGACACCT AAAAAACCTA 834 0 

ACTCAGCGTT ACGTAAATAT GCACGTGTGc gTtTATCAAA CAACATCGAA ATTAACGCAT 8 4 00 

ACATCCCTGG TATCGGACAT AACTTACAAG AACACAGTGT TGTACTTGTA CGTGGTGGAC 8 4 60 

GTGTAAAAGA CTTACCAGGT GTGCGTTACC ATATTGTACG TGGAGCACTT GATACTTCAG 8520 

GTGTTGACGG ACGTAGACAA GGTCGTTCAT TATACGGAAC TAAGAAACCT AAAAACTAAG 85 8 0 

AATTTAGTTT TTAATTAAAT CTTAJACTTA AAATATTTAA TATAAGGAAG GGAGGATTTA 8 64 0 

CATTATGCCT CGTAAAGGAT CAGTACCTAA AAGAGACGTA TTACCAGATC CAATTCATAA 8 7 00 

CTCTAAGTTA GTAACTAAAT TAATTAACAA AATTATGTTA GATGGTAAAC GTGGAACAGC 8 7 60 

ACAAAGAATT CTTTATTCAG CATTCGACCT AGTTGAACAA CGCAGgtTCGTGATGCATTA 8820 

GAAGTATTCG AAGAAGCAAT CAACAACATT ATGCCAGTAT TAGAAGTTAA AGCTCGTCGC 88 80 
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GTAGGTGGTT CTAACTATCA AGTACCAGTA GAAGTTCGTC CAGAGCGTCG TACTACTTTA 8 94 0 

GGTTTACGTT GGTTAGTTAA CTATGCACGT CTTCGTGGTG AAAAAACGAT GGAAGATCGT 9000 

TTAGCTAACG AAAT T T TAG A TGCAGCAAAT AATACAGGTG GTGCCGTTAA GAAACGTGAG 90 60 

GACACTCACA AAATGGCTGA AGCAAACAAA GCATTTGCTC ACTACCGTTG GTAAGATAAA 912 0 

AGCTTTTACC CTGAGTGTGT TCTATATTAA TGAATTTTCA TTAAGCGTTC ATGCTTAGGG 918 0 

CATCGCCATA TCTATCGTAT TTATTCAGTA AIVTAAACTG GAAGGAGAAA AAATACATGG 92 4 0 

CTAGAGAATT TTCATTAGAA AAAACTCGTA ATATCGGTAT CATGGCTCAC ATTGATGCTG 9 30 0 

GTAAAACGAC TACGACTGAA CGTATTCTTT ATTACACTGG CCGTATCCAC AArGknGGTG 9360 

AAa C AC AC G A AGGTGCTTCA CAAATGGACT GGATGGAGCA AGAACAAGAC CGTGGTAHA 94 2 0 

CTATCACATC TGCTGCAACA ACAGCAGCTT GGGAAGGTCA CCGTGTAAAC ATTATCGATA 94 8 0 

CACCTGGACA CGTAGACTTC ACTGTAGAAG TTGAACGTTC ATTACGTGTA CTTGACGGAG 954 0 

CAGTTACAGT ACTTGATGCA CAATCAGGTG TTGAACCTCA AACTGAAACA GTTTGGCGTC 9 60 0 

AGGCTACAAC TTATGGTGTT CCACGTATCG TATTTGTAAA CAAAATGGAC AAATTAGGTG 9660 

CTAACTTCGA ATACTCTGTA AGTACATTAC ATGATCGTTT ACAAgCTAAC GCTGCTCCAA 9720 

TCCAATTACC AATTGGTGCG GAAGACGAAT TCGAAGCAAT CATTGACTTA GTTGAAATGA 97 8 0 

AATGTTTCAA ATATACAAAT GAT T TAGGT A C T G AAAT T G A AG AAAT T G AA ATTCCTGAAG 98 4 0 

ACCACTTAGA TAGAGCTGAA GAAGCTCGTG CTAGCTTAAT CGAAGCAGTT GCAGAAACTA 9 90 0 

GCGACGAATT AATGGAAAAA TATCTTGGTG ACGAAGAAAT TTCAGTTTCT GAATTAAAAG 9 9 60 

AAGCTATCCG CCAAGCTaCt AcTAACGTAG AATTCTACCC AGTACTTTGT GGTACAGCTT 1QC2 0 

TCAAAAACAA AGGTGTTCAA TTAATGCTTG ACGCTGTAAT TGATTACTTA CCTTCACCAC 10080 

TAG AC G T T AA ACCAATTATT GGTCACCGTG CTAGCAACCC TGAAGAAGAA GTAATCGCGA 1014 0 

AAGCAGACGA TTCAGCTGAA TTCGCTGCAT TAGCGTTCAA AGTTATGACT GACCCTTATG 10200 

TTGGTAAATT AACATTCTTC G3TGTGTATT CAGGTACAAT GACATCTGGT TCATACGTTA 10260 

AGAACTCTAC TAAAGGTAAA CGTGAACGTG TAGGTCGTTT ATTACAAATG CACGCTAACT 10 320 

CACGTCAAGA AATCGATACT GTATACTCTG GAGATATCGC TGCTGCGGTA GGTCTTAAAG 10 380 

AT ACAGG T AC TGGTGATACT TTATGTGGTG AGAAAAAT G A CATTATCCTG GAATCAATGG 10440 

AATTCCCAGA GCCAGTTATT CACTTATCAG TAGAGCCAAA ATCTAAAGCT G AC C AAG AT A 10 50 0 

AAAT G AC T C A AGCTTTAGTT AAATTACAAG AAG AAG AC C C AACATTCCAT GCACACACTG 10560 

ACGAAGAAAC TGGACAAGTT ATCATCGGTG GTATGGGTGA GCTTCACTTA GACATCTTAG 10 620 

TAGACCGTAT GAAGAAAGAA TTCAACGTTG AATGTAACGT AGGTGCTCCA ATGGTTTCAT 10 68 0 

ATCGTGAAAC ATTCAAATCA TCTGCACAAG TTCAAGGTAA ATTCTCTCGT CAATCTGGTG 1074 0 
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GTCGTGGTCA ATACGGTGAT GTTCACATTG AATTCACACC AAACGAAACA GGCGCAGGTT 10800 

TCGAATTCGA AAACGCTATC GTTGGTGGTS TAGTTCCTCG TGAATACATT CCATCAGTAG 108 60 

AAGCTGGTCT TAAAGATGCT ATGGAAAATG GTGTTTTAGC AGGTTATCCT TTAATTGATG 10920 

TTAAAGCTAA ATTATATGAT GGTTCATACC ATGATGTCGA TTCATCTGAA ATGGCCTTCA 10 980 

AAATTGCTGC ATCATTAGCA CTTAAAGAAG CTGCTAAAAA ATGTGATCCT GTAACTTAG 11040 

AACCAATGAT GAAAGTAACT ATTGAAATGC CTGAAGAGTA CATGGGTGAT ATCATGGGTG 11100 

ACGTAACATC TCGTCGTGGA CGTGTTGATG GTATGGAACC TCGTGGTAAT GCACAAGTTG 11160 

TTAATGCTTA TGTAC CACTT TCAGAAATGT TCGGTTATGC AACATCATTA CGTTCAAACA 11220 

CTCAAGGTCG CGGTACTTAC ACTATGTACT TCGATCACtA TGCTGAAGTT CCaAAATCaA 11280 

TCGCTGAAGA TATTATCAAG AAAAATAAAG GTGAATAATA TAACTTGTTT TGACTAGCTA 11340 

GCCTAGGTTA AAATACAAGG TGAGCTTAAA TGTAAGCTAT CATCTTTATA GTTTGATTTT 11400 

TTGGGGTGAA TGCATTATAA AAGAATTGTA AAATTCTTTT TGCATCGCTA TAAATAATTT 114 60 

CTCATGATGG TGAGAAACTA T CAT GAG AG A T AAAT T T AAA TATTATTTTT AATTAGAATA 11520 

GGAGAGATTT TATAATGGCA AAAGAAAAAT TCGATCGTTC TAAAGAACAT GCCAATATCG 11580 

GTACTATCGG TCACGTTGAC CATGGTAAAA CAACATTAAC AGCAGCAATC GCTACTGTAT 11640 

TAGCAAAAAA TGGTGACTCA GTTGCACAAT CATATGACAT GATTGACAAC GCTCCAGAAG 117 00 

AAAAAGAACG TGGTATCACA ATCAATACTT C T C AC AT T G A GTACCAAACT GACAAACGTC 117 60 

ACTACGCTCA CGTTGACTGC CCAGGACACG CTGACTACGT T AAAAAC AT G ATCACTGGTG 11820 

CTGCTCAAAT GGACGGCG3T ATCTTAGTAG TATCTGCTGC TGACGGTCCA ATGCCACAAA 1188 0 

CTCGTGAACA CATTCTTTTA TCACGTAACG TTGGTGTACC AGCATTAGTA GTATTCTTAA 11940 

AC AAAG T T G A CATGGTTGAC GATGAAGAAT TATTAGAATT AGTAGAAATG GAAGTTCGTG 12000 

ACTTATTAAG CGAATATGAC TTCCCAGGTG ACGATGTACC TGBATCGCT GGTTCAGCAT 12 0 60 

TAAAAGCTTT AGAAGGCGAT GCTCAATACG AAGAAAAAAT CTTAGAATTA ATGGAAGCTG 12120 

TAGATACTTA CATTCCAACT CCAGAACGTG ATTCTGACAA AC CAT T CAT G ATGCCAGTTG 12180 

AGGACGTATT CTCAATCACT GGTCGTGGTA CTGTTGCTAC AGGCCGTGTT GAACGTGGTC 12240 

AAAT C AAAG T TGGTGAAGAA GTTGAAATCA TCGGTTTACA T G AC AC AT C T AAAACAACTG 12 300 

TTACAGGTGT TGAAATGTTC CGTAAATTAT TAGACTACGC TGAAGCTGGT G AC AAC AT T G 12360 

GTGCATTATT ACGTGGTGTT GCTCGTGAAG ACGTACAACG TGGTCAAGTA TTAGCTGCTC 12420 

CTGGTTCAAT TACACCACAT ACTGSVTTCA AAGCAGAAGT ATACGTATTA TCAAAAGACG 124 80 

AAGGTGGACG TCACACTCCA TTCTTCTCAA ACTATCGTCC ACAATTCTAT TTCCGTACTA 1254 0 
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CTGACGTAAC TGGTGTTGTT CACTTACCAG AAGGTACTGA AATGGTAATG CCTGGTGATA 12 600 

ACGTTGAAAT GACAGTAGAA TTAATCGCTC CAATCGCGAT TGAAGACGGTACTCGTTTCT 12 660 
CAATCCGTGA AGGTGGACGT ACTGTAGGAT CAGGCGTTGT TACTGAAATC ATTAAATAAT 12720 
TTCTAATTTC TTAGATTTTA TATAAAAAGA AGATCCCTCA ATCGAGGGGt CTTTTTTTAA 127 80 

TGTGTAAATT TTGTAATGGC TATTCGATTT AGAAGAACAA TAATTGATGA AAGAC T G AC T 12840 

AATAAAACTT ATAACTGATA ATACTGTTTA AATAAAATTG TTGAGTCTTG GACATTGTAA 12900 
AATGCTCCCT TCAAAGTTTT CATTTTTTCa ATGTCTACTT TGAAGGGAGC ATTTCATTAG 12 9 60 
TTTATGTCTC AGATTCATAT CTTTCAATTA ATTTAAATGC TTAATTTGTT TTAAATACTT 13020 

GCTCTAATTC TATGATTTTT AAAAATACAG CTlCAGCGTA TTTTAATGAT TTTTCATCAA 1308 0 
TATCAAATTT GGGATTATGG TGTGGCGCTG TAATACCTTT ACTTTCATTA CCACAACCAG 1314 0 

TCAGAAAGAA TGCACCTGGT CGTACTTTCA AATAATGTGA AAAATCTTCT C C AAT CAT C A 132 00 

TTAAATCTGA TTCATTAAAG CGTACATGTA AGTCATTTGT TGCTTCTTTA ATAACTTjET 132 60 
ATGCTTTCTC GTTATTATGG ACAGGCAAAT ACCCTTTAAT AT AAT T C AAA TCATAGTTAA 13320 
TATCATTTGC TATTGCTAAA CCTTGTAGAA GCTTATCCAT TTTGTCCATT ACATGATTCT 13380 
GTATATCTGA ATCGAAAGTT CTAACTGTAC CTTTACAAAA TGCTTGATCA GGAATAACGC 13440 

TATCTGTGGT GCCTGCTTGA ATCATTCCAA AT GAAAG T AC AGCTTGTTTA ACTGGATCGA 13500 
TCGTACGTGA AATTATTTTT TGTGCACTTA AAATGAACTC TGCCATGATT ACTATTGGGT 135 60 

CAATGGTTTC ATGAGGTTTG GCACCATGAC CACCACGACC TTTAAATGTG ACGCTAAATT 13 620 

CATCTGGAGA GGCCATGATT GCCCCCGCAC G T G AAT G AAT AG T T C C AG T A GGATAACCAC 13680 

TCCATAAATG TGTACCGTAA ATTCTATCTA CATTTTCCAG AC AT CCAGC A TCTATCATTT 13740 

CTTGAGAACC ACCTGGCATG ATTTCTTCAC CGTACTGGAA TATTAATACA ACATTACCTT 13800 

CTAATAAATG TTTATGTTCA TCTAAAATCT CTGCTACAGT AAGTAAAATT GCTGTATGAC 1360 

CATCATGCCC ACACGCATGC ATACATCCTG GATTTTTAGA CTTATAAGGC ACATCGTTTA 13920 

ATTCCTCGAC AGGTAACGCA TCAAAGTCAG CTCTTAATGC AATGGTAGGT CCTGTGCCCA 13980 

AGCCTTTAAA TGTGGCTTTG ATACCATTGC GGCCGATAGG AGTTTCAATA TCACAAGATA 14 04 0 

ACTGGCTTAA TTGGTTAACA ATATAATCAT GTGTTTGAAA TTCTTCAAAA GATAACTCAG 14100 

GATATTGGTG TAAATAACGT CTGAGTTGAA TTGTTTTATT TTCTTTATTA TTTGCTAGTT 14160 

GGAACCAATC TAACACCCTT ATCACTACTT TCTAAAATAA TGTTTATAGT ATAACATTTT 14 220 

ATGAAATTAT CGTACTAAAT GATTGCTTTG AGATATTTTA TCTATGATG ATAAGGCTTT 14 28 0 

CAAGTTATGT AGAATTACTG TATGATAAAG GTATTACCAA ACAATACTTA AGGGGGATTA 14 340 

TATACTGTGG TTCAATCATT ACATGAGTTT TTAGAGGAAA ATATAAATTA TCTAAAAGAA 14 4 00 
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AATGGTTTGT ATAATGAAAT AGATACAATT GAAGGTGCAA ACGGACCAGA AATCAAAATC 14 4 60 

AATGGGAAAT CAT AC ATT AA CTTATCTTCA AATAATTATT TAGGACTAGC AACAAATGAA 14 520 

GATTTGAAAT CaGctGCAAA AGCAGCTATT GAT AC AC AT G GTGTAGGTGC AGGCGCTGTT 14 580 

CGTACAATCA ATGGTACATT AGATTTACAC GACGAATTAG AAGAAACACT AGCAAAATTT 14 640 

AAAGGAACAG AAGCTGCAAT AGCTTATCm TCAGGATTTA ATTGTAATAT GGCTGCTATT 14 700 

TCAGCTGTCA TGAATAAAAA TGATGCTATT TTATCAGATG AGCTTAATCA TGCATCAATT 14 7 60 

ATTGATGGAT GTCGCTTATC TAAAGCTAAA ATTATTCGAG TTAACCATTC AG AC AT GG AT 14 820 

GATTTACGTG C G AAAGC AAA AGAAGCAGTT GAATCAGGTC AATACAATAA AGTSTGTAT 14 8 80 

ATCACTGATG GCGTTTTTAG TATGGATGGT GATGTGGCTA AATTACCTGA AATTGTAGAA 14 94 0 

ATTGCAGAAG AATTTGGTTT ATTAACTTAT GTTGACGACG CTCATGGTTC AGGTGTTATG 15000 

GGTAAAGGCG CTGGTACGGT TAAACATTTT GGTTTACAAG ATAAAATCGA TTTCCAAATA 15060 

GGTACGCTTT CTAAAGCAAT TGGTGTCGTT GGCGGTTATG TAGCAGGTAC AAAAGAGTTA 15120 

ATAGATTGGT TAAAAGCACA AT C AC G AC C A TTCTTATTCT CTACATCATT AGCACCTGGG 1518 0 

GATACCAAAG CAATAACTGA AGCAGTTAAA AAGTTAATGG ATTCAACTGA ATTACATGAT 15240 

AAATTATGGA ACAAT GCAC A ATATTTAAAA AATGGSTGT CAAAATTAGG ATATGATACA 15300 

GGTGAGTCAG AAACTCCAAT TACACCAGTA ATTATTGGTG ATGAAAAAAC AACTCAAGAA 15360 

TTTAGTAAGC GTTTAAAAGA CGAAGGTGTC TATGTGAAAT CTATCGTTTT CCCAACAGTA 154 20 

CCAAGAGGTA CAGGACGTGT AAGAAATATG CCTACAGCTG CACATACAAA AGACATGTTA 15480 

GATGAAGCAA TTGCGGCTTA T GAAAAAG T A GGAAAAGAAA TGAAGTTGAT TTAATATTTA 1554 0 

TTTATTCCCA CGGCAAATAT TGTCGTGGGC TTTTTTTAAT GTTTAGTTTA TTAACAGT 15598 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6a base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

AAGTAAATCA ACTTACTGGG ATAAGAATAA AGGCGATTAT AGTAACAAGT TGATTTTATT 60 

C G AAAAAC AT TTTGAACCGG TTCTGGGTAT CAAGATGCAA CAT AG T GGAG GTCATAGCTT 12 0 

TGGCCACACG ATTATTACGA TTGAAAGTCA AGGAGATAAA GCAGTTCATA TGGGTGATAT 18 0 

ATTCCCAACT ACTGCACATA AAAATCCTCT ATGGGTAACG GCATATGATG ATTATCCTAT 24 0 
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GCAATCGATT CGTGAAAAAG AACGCATGAT ACCATATTTT STCAGCAAC AATATTGGTT 



300 



CTTGTTTTAT CAT GAT GAAA ACTACTTTGC TGTAAAATAC AGCGATAATG GTGAAAACAT 



360 



AGATGCATAT ATTTTACGTG AAACATTAGT TGATAATAAC TAAAATAAAG ATGTATTACT 



420 



AAACAAATTT TCAAAAATAA AAAATTGAGC CACATCCAAT CTTACTAATT AGGGTGTGGC 



4GB 



TCATTTTTAA GTTTTACgAT C C AAAT C AAA TATGGaTAAA ATTCgTATTA ACGCTCTACa 



540 



ATGtTAATGA CTTCACCAGT ATATGCATCT GCATAAAAAT CATAATGAAT ATTTTGACCA 



600 



TTTTTAATAG TTGTAATTCC ACCTTGATAA ACTAAACGGT ATTTATCAGT TTCAGGATGA 



660 



A 



661 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

GCAGACGGTA CAGCAGTTAA AGTCGCACCA AaACTGTAGT GAATcTAATC GGTGcATTCT 60 

TTTTAGGATT AGTTGTCGCG CTTATATATA TCTTCTTCAA AGTAATTTTC GATAAGCGAA 120 

TTAAAGATGA AGAAGATGTA GAGAAAGAAT TAGGATTGCC TGTATEGGT TCAATTCAAA 180 

AATTTAATTA AGGATGGTTG CTACTTATGT CAAAAAAGGA AAAT AC GACA ACAACACTAT 24 0 

TTGTATATGA AAAACCAAAA TCAACAATTA GTGAAAAGTT TCGAGGTATA CGTTCAAACA 300 

TCATGTTTTC AAAAGCAAAT GGTGAAGTAA AGCGCTTATT GGTTACTTCT GAAAAGCCTG 360 

GTGCAGGTAA AAGTACAGTT GTATCGAATG TAGCGATTAC TTATGCACAA GCAGGCTATA 4 20 

AG AC AT T AG T TATTGATGGC GATATGCGTA AgcCAACACA AAACTATATT TTTAATGAGC 4 80 

AAAATAATAA TGGACTATCA AGCTTAATCA TTGGTCGAAC GACTATGTCA GAAGCAAT T A 54 0 

CGTCGACAGA AATTGAAAAT TTAGATTT3C TAACAGCTGG CCCTGTACCT CCAAATCCAT 600 

CTGAGTTAAT TGGGTCTGAA AGGTTCAAAG AATTAGTTGA TCTGTTTAAT AAACGTTACG 660 

ACATTATTAT TGTCGATACA CCGCCAGTTA ATACTGTGAC TGATGCACAA CTATATGCGC 720 

GTGCTATTAA AGATAGTCTG TTAGTAATTG ATAGTGAAAA AAAT G AT AAr AAGAAGTTA 780 

AAAAAGCAAA AGCACTTATG GAAAAAGCAG GCAGTAACAT TCTAGGTGTC ATTTTGAACA 840 

AGACAAAGGT CGATAAATCT TCTAGTTATT AT C ACT ATT A TGGAGATGAA TAAGTATGAT 900 

TGATATTCAT AACCATATAT TGCCTAATAT CGATGACGGT C C G AC AAAT G AAACAGAGAT 960 

GATGGATCTT TTAAAACAAG CGACAACACA AGGTGTTACA G AAAT CAT TG T AAC AT C AC A 1020 
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TCACTTACAT CCTCGATATA CCACACCTAT AG AAAAAG T G AAATCATGTT TAAACCATAT 108 0 

TGAAAGCTTA GAGGAAGTAC AAGCACTAAA TCTAAAGTTT TATTATGGTC AGGAAATAAG 114 0 

AAT TACCGAT CAAATCCTTA ATGATATTGA TCGAffiAGTT ATTAACGGTA TTAATGATTC 1200 

ACGCTATTTA CTAATAGAAT TTCCATCAAA TGAAGTTCCA CACTATACTG ATCAATTATt 12 60 

TTTCGAATtA C AG AG T AAAG GCTTTGTACC GATTATTGCA CATCCAGAGC GGAATAAAGC 1320 

AATAAGTCAA AACCTTGACA TACTATACGA TTTAATTAAC AAAGGTGCTT TAAGTCAAGT 138 0 

GACAACGGcG TCATTAGCGG GTATTTCCGG TAAAAAAATT AGAAAATTAG CAATTCAAAT 14 4 0 

GATTGAAAAC AAT C T G AC AC ATTTCATCGG TTCAGATGCG CATAACACAG AAATCAGACC 1500 

GTTCTTAATG AAAG AC T TAT TTAATGATAA GAAATTACGT GATTATTATG AAGATATGAA 15 60 

CGGATTTATT AGTAATGCGA AGTTAGTTGT T GAT GAT AAA AAAATTCCTA AACGAATGCC 1620 

ACAACAAGAT TATAAACAGA AAAGATGGTT TGGGTTATAA ACAGCAAATG AGGGGTTTTA 168 0 

TGGCACATTT ATCTGTGAAA TTGCGGCTTT TAATACTAGC ATTAATCGAT TCACTGATAG 1740 

TGACATTTTC AGTATTCGTA AGTTATTACA TTTTAGAACC GIBTTTCAAA ACATATTCTG 1800 

TCAAATTATT AATATTGGCA GCTATATCAC TATTCATATC GC AT CAT ATT TCaGCATTTA 18 60 

TTTTTAATAT GTATCATCGA GCGTGGGAAT ATGCCAGTGT GAGTGAATTG ATTTTAATTG 192 0 

TTAAAGCTGT GACGACATCT ATCGTTATTA CGATGGTGGT CGTGACAATT GTTACAGGCA 198 0 

ATAGACCGTT TTTTAGATTG TATTTAATTA CTTGGATGAT GCACTTGATT TTAATAGGTG 2 04 0 

GCTCAAGGTT ATTTTGGCGT ATTTATCGGA AATACCTTGG AGGTAAGTCA TTTAATAAGA 2100 

AGCCAACTTT AGTTGTTGGT GCTGGTCAAG CAGGTTCAAT GCTGATTAGA CAAATGTTGA 2160 

AAAG T G AC G A AATGAAACTT GAACEGGTAT TAGCAGTCGA TGATGACGAA CATAAACGCA 2220 

ATATCACAAT TACTGAGGGT GTAAAAGTCC AAGGTAAAAT TGCGGATATT CCAGAACTAG 22 8 0 

TGAGGAAATA T AAG AT T AAA AAAAT CAT C A TTGCAATTCC AACTATTGGT CAAGAGCGTT 234 0 

T G AAAG AAA T T AAT AAT AT T TGCCATATGG ATGGCGTTGA GTTATTGAAAATGCCAAATA 2 4 00 

TAGAAGACGT CATGTCTGGT GAGTTAGAAG TGAACCAACT TAAAAAAGTT GAAG TAG AAG 24 60 

ATTTACTAGG CAGAGATCCT GTTGAATTAG ATATGGATAT GATATCAAAT GAATTGACGA 2520 

ATAAAACTAT TTTAGTTACG GGTGCAGGTG GTTCAATAGG ATCAGAAATT T G TAG AC AAG 2580 

TTTGTAA.TTT CTATCCAGAA CGTATTATTC TACTTGGCCA TGGTGAAAAC AG TAT T TAT T 2 64 0 

TAATCAATCG TGAATTGCGA AATCGCTTCG GwAAAAATGT TGATATCGTT CCTATTATAG 2700 

CGGATGTGCA AAATAGAGCG CGTATGTTTG AAATTATGGA AACGTATAAA CCATACGCAG 27 60 

TTTATCATGC AGCAGCACAC AAGCACGTGC (ETTAATGGA AGACAACCCT GAAG AAG C AG 2820 
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TACGTAATAA TATTTTAGGT ACGAAAAATA CTGCTGAAGC TGCTAAAAAT GCAGAGGTAA 2880 

AGAAATTCGT TATGATTTCT ACGGATAAAG CCGTTAATCC GCCTAATGTC ATGGGAGCTT 2 94 0 

CAAAGCGAAT TGCAGAAATG AT TAT T C AAA GTTTAAATGA TGAAACGCAT CGAACA&TT 300 0 

TTGTTGCAGT GAGATTTGGT AATGTACTTG GATCGAGAGG ATCTGTGATT CCACTTTTCA 30 60 

AAAGTCAAAT TGAAGAAGGT GGGCCAGTTA CTGTGACACA TCCTGAAATG ACACGTTACT 3120 

TTATGACAAT TCCTGAAGCT T C TAG AC TAG TTTTGCAGGC AGGGGCATTA GCAGAAGGTG 3180 

GCGAAGTATT TGTGCTAGAT ATGGGAGAAC C AG T G AAAAT TGTAGATTTG GCACGTAATT 324 0 

TAATTAAGCT AAGTGGTAAA AAAGAAGACG ACATACGCAT TACTTATACA GGGATTAGAC 3300 

CCGGCGAAAA AATGTTTGAA GAGCTTATGA AT AAAG AT G A GGTTCATCCT GAACAAGTAT 33 60 

TTGAAAAAAT TTATCGTGGC AAAGTACAAC ATATGAAAE TAATGAAGTT GAAGCGATTA 3420 

TTCAAGACAT CGTCAATGAC TTTAGTAAAG AAAAAATTAT TAACTATGCC AATGGCAAAA 34 8 0 

AGGGAGATAA TTATGTTCGA T G AC AAAAT T TTATTAATTA CTGGGGGCAC AGGATCATTC 3540 

GGTAATGCTG TTATGAAACA GTTTTTAGAT TCTAATATTA AAGAAATTCG TATTTTTTCA 3600 

CGCGATGAGA AAAAACAAGA TGACATTCGA AAAAAATATA AT AAT T C AAA ATTAAAGTTC 3660 

TACATTGGTG ATGTGCGTGA TAGTCAAAGT GTAGAAACAG CAATGCGAGA TGTTGATTAC 3720 

GTATTCCATG CAGCAGCTTT AAAACAAGTG CCGTCATGTG AATTCTTTCC AGTTGAGGCA 37 8 0 

GTGAAGACAA AT AT TATTGG TACAGAAAAT GTCTTACAAA GTGCTATTCA TCAAAATGTT 38 4 0 

AAAAAAGTCA TATGTTTATC TACAGATAAG GCAGCGTATC CTATTAATGC TAGGGGTATT 3900 

TCAAAAGCAA TGATGGAAAA AGTATTCGTA GCCAAATCAA GAAATATTCG TAGTGAACAA 3 960 

ACGCTTATTT GTGGTACAAG ATACGGTAAT GTGATGGCTT CAAGAGATC AGTAATACCT 4 020 

TTGTTTATCG AC AAAAT C AA AGCTGGAGAA CCTTTAACGA T T AC AG AT C C TGATATGACA 4 080 

AGATTTTTAA TGAGCTTAGA AGATGCGGTA GAACTAGTTG TTCATGCATT TAAGCATGCA 414 0 

GAGACAGGAG ATATTATGGT TCAAAAAGCA CCAAGCTCAA CGGTAGGGGA TCTTGCGACC 4 2 00 

GCATTATTAG AATTGTTTGA AGCTGATAAT GCAATTGAAA TCATTGGTAC GCGACATGGA 4 2 60 

GAGAAAAAAG CAGAAACATT GTTGACGAGA GAAGAATACG C AC AAT G T GA AGATATGGGT 4 32 0 

GATTATTTTA GAGTGCCGGC AGACTCCAGA GATTTAAATT ATAGTAATTA TGTTGAAACC 4 38 0 

GGTAACGAAA AGATTACGCA ATCTTATG^A TATAACTCCG ATAATACACA TATTTTAACG 4440 

GTGGAAGAGA TAAAAGAAAA ACTTTTAACA CTAGAATATG TTAGAAACGA ATTGAATGAT 4 500 

TATAAAGCTT CAATGAGATA GGAGAGATTG ACGTTGAATA T T G T AAT T AC AGGAGCAAAA 4 5 60 

GGTTTTGTAG GAAAAAACTT GAAAGCAGAT TTAACTTCAA CGACAGATCA TCAATTTTC 4 620 

GAAGTACATC GACAAACTAA AGAGGAAGAA TTAGAGTCAG CATTGTTGAA AGCAGACTTT 4 680 
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GTCGTGCATT TAGCGGGTGT TAATCGACCT GAACATGACA AAGAATTCAG CTTAGGAAAC 4740 

GTGAGTTATT TAGATCATGT ACTTGATATA TTAACTAGAA AT AC G AAAAA GCCAGCGATA 4 8 00 

TTATTATCGT CTTCAATACA AGCAACACAA GATAATCCTT ATGGTGAGAG TAAGTTGCAA 4 8 60 

GGGGAACAGC TATTAAGAGA GTATGCCGAA GAGTATGGCA ATACGGTTTA TATTTATCGC 4 920 

TGGCCAAATT TATTCGGCAA GTGGTGTAAG CCGAATTATA AC T C AG T GAT AGCAACATTT 4 980 

TGTTACAAAA TTGCACGTAA CGAAGAGATT CAAGTAATG ATCGGAATGT TGAACTAACG 504 0 

CTAAACTACG TGGATGATAT CGTCGCTGAA ATAAAGCGTG CTATTGAAGG AACTCCAACG 5100 

ATTGAAAATG GTGTACCTAC AG T AC C AAAC GTATTTAAAG T G AC AT T GG G AGAAATTGTA 5160 

GATTTATTAT ACAAGTTCAA ACAGTCACGT CTCGATCGAA CATTGCCGAA ATTAGATAAC 5220 

TTGTTTGAAA AAGATTTGTA TAGTACGTAT TTAAGCTATC TACCTAGTAC aGACTTTAGT 5280 

TAyCCCTTAC TTATGAATGT GGATGATAGG GGTTCTTTTA CAGAATTTAT AAAAACACCG 5340 

GATCGTGGTC AAGTTTCTGT AAATATTTCT AAACCAGGTA TTACTAAAGG TAATCACTGG 54 00 

CATCATACTA AAAACG2AAA ATTTCTAGTC GTATCAGGTA AAGGGGTAAT TCGTTTTAGA 54 60 

CATGTTAATG ATGATGAAAT CATTGAATAT TATGTTTCTG GCGACAAATT AGAAGTTGTA 552 0 

G AC AT AC C AG TAGGATACAC ACATAATATT GAAAAT T TAG GC G AC AC AG A TATGGTAACT 55 8 0 

ATTATGTGGG TGAATGAAAT GTTTGATCCA AATCAGCCAG AACGTATTT CTTGGAGGTA 5 64 0 

TAGCGCATGG a AAAAC T G AA rTTAATGACA ATAGTTGGTA CAAGGCCTGA AATCATTCGT 57 00 

TTATCATCAA CGATTAAAGC ATGTGATCAA TATtTTAA 57 38 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9062 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

ATCATCAACA AGAATGATAT TTTTCCCATC TACTATATCT TTTACCGCAG ATAACTTCAC 60 

TCTCACACCT TGCTCACGTA ATTCTTGAGT TGGTTGAATA AATGTTCTTG CAACATATTG 120 

ATTTTTAACT AGTCCCATTT CATATGGCAA ACCTATTTCT TCAGCATAAC CACTCGCAGC 180 

TGATAGCGAT gAATTGGGTA CACCGATGAC CATATCAGCA TTTACAGGGC TTTCTTGGGC 24 0 

TAATTTTTTA CCAGAAGCTT TACGTACTGC ATGGACATTT TTACCAGCTA TTGTTGAGTC 300 

TGGTCTAGCA AAATAAATAT ATTCCATCGC AGAAATTGCA GTTGTCGTAT GATGTGTATA 3 60 
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AGATTTAACT GTAATACCTT TATCGTTAAT CACGACATAT TCACCTGCAT GAATATCTTG 4 20 

AACAAATTCT GCACCTAACA CATCTATTGC ACATGTTTCA CTTGCAEGA TGTATGTCCC 480 

ATCTTTCATT TTACCTACAA CAAGTGGTCT GATAGCATTT GGATCTACTG CGCCATATAA 54 0 

CGCATCTTTA GTTAAAATCG CAAATGTAAA ACCGCCTTTA ACTTTTCGCA AACTTTCTTT 600 

CAACGCTTCC TCAAAAGTAG GAGCTTTACT TCGACGTATC AAATGCATAA TGACTTCAGT 6 60 

AT CAG AAGAC GAATGGAAGA TAGCACCTTG TTTTTCTAAA TTCTGACGCA ATGATTTAGC 720 

ATTAATGAGG TTACCATTAT GACAAATACC TACACTCATA TCATAAAAGT GATACAGAAA 7 80 

CGGTTGAATA TTTTCAATAC CTTTATTACC TGAAGTAGCA TAACGGACGT GACCAATTGC 8 40 

ATGTTGATAT CCTTTTAATC GTTCCATT'E ATCATCTTTA ATCGCTTCAG TTAGTAAGCC 900 

TAATCCTCGC TCGCCTTTTA ATTCATTTTG AT C AGAAAC A ACTATACCTG cACCTTCTTG 9 60 

ACCACGATGT TGCAAACTAT GAAGTCCCAT ATAtGTTAGT TGCGCTGCtT CaGGATGATT 1020 

CCAAATACCA AACACGCCAC ATTCTTCGTT TAATCCTGAG TAGTTAAACA TTGQCAATT 108 0 

GCCCCtTCCC ATATTTGTTT AATATCTGAA ACATTTTCAC TAATCTCTGT aTATGGTGTT 1140 

GTTACCTTGr aATTATCACT ATCTGTTAAA AGTCCAATTT CTATTGCATT ATCAATATTT 1200 

AAAGTTTTAC CTGATTTAAC AGAAACAACA TATCGGCCTT GCGTCTCACT AAACAATTGT 12 60 

GCATTTGTTA TATCTATTGA AGATTTTAAT CCTAAACCGT AATGCGCACT TAGTTTAGCT 132 0 

AAGGTAATCA GTAAGCCACC TTTACCAACT GTTTGAACAT GTGATAATAG TCCTTCACGA 138 0 

ATAGCGGTCT TGATTGATTC ACCTTTTTCA ACTTCTGAAC TCAAATCTAA TGACTCAAAT 14 4 0 

TCATGATTAA CTTTGCCATA AATTAACTTT TCAAGTGAC TACCACCAAA GTCGTCCTTA 1500 

GTATCACCGA TTAAATATAA TTTATCTCCA ACTTGAGGTT CAAAATCATT TAAATAATTT 15 60 

ACATTTTCAA TCAAACCTAC CATTCCAACA ACTGGTGTTG GGAAAATAGA AGTACCTTTC 162 0 

GTTTCGTTAT ATAAAGATAC AT T ACCAGAA ACTACTGGTG TCTTAAGAAT GTCGCATGCT 168 0 

TCTGCCATAC CTTTCGTTGA ATCTATCAAC TGTTGATAGA TTTCTTTCTT TTCAGGAGAA 17 4 0 

CCATAATTTA AACAATCTGT CATTGCTAAT GGTGTTGCAC CCACGGCAAT TAAATTTCGA 18 00 

TAAGCTTCAG C T AC T AC CAT CTTTCCACCT TCATATGGAT TGTTATATAC ATAACGCGCT 18 60 

TCACCATCAA TTGTTGAAGC AATTGCCTTA TTTGTGCCTT CCACACGTAC TACCGATGCT 1920 

TGAAGTCCTG GCTTAATTAT CGTATTGGCA CCAACTTGTT GGTCGTATTG ATCATATAAA 1980 

TAGTGTTTAG ATGCTATAGT CGGATGCTTA AGTAATTTAA AGAAAGTATC TTTAACATCG 204 0 

ATGTGTGTAT AATCATTTTT AGAAGTATTA TAATCTTTTT CTCTCCTTC TAAAATATAT 2100 

ACAGGTGCTT CATCAGCTAG TGGTTCAACT GGAATGTCAG CATAAACTTC GTCATCATAT 2160 

GTTAAAACAA AACGATTTGT ATCTGTAACT TCACCTATAA CAGCACTATC CAATTCGTGC 2220 
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TTATCAAATA AATCTAAGAA TTTTTGTTCA GTACCTTTTT CAACAACTAG TAACATACGT 2280 

TCTTGAGTTT CTGAAAGCAT CATTTCATAA GGAGAAATAC CTGGCTCACG TGTTGGCACT 2340 

TGTTCTAATC TCAAATGTAA CCCACTACCA CCTTTTGCCG CCATTTCAGA CGATGAAGAT 2400 

GTTAAACCAG CAGCACCCAT ATCTTGAATA CCAACTAATT CATCAAATGT AATTGCTTCA 24 60 

AGTGTTGCTT CCATTAATTT TTTACCTACA AATGGATCAC CGATTTGTAC AGAAGGTCGT 2520 

TTACTTTCGC TTTCTTCCGT CAATTCTTCA GATGCAAAAG TAGCACCATG AATACCATCT 2580 

CGACCAGTTT TCAAACCAAC ATAAATGACC GAATTACCTA CACCTTTTGC TGTGCCTTTT 2 64 0 

TGAATCATGT CGTGATTGaT AACACCAACA C AC AT T GC AT TAACAAGTGGATTGCCATCA 2700 

TAACGTTCAT CAAATTCGAT TTCACCAGCA GTTGTTGGaA TACCAATGCA GTTACCATAA 2 7 60 

CCTCCGATAC CCTTTACAAC ACCTTTAAGT AATCTTTGGT TTTGTTTATT ATCTAATTCT 2820 

CCAAATCTAA GACTGTTTAA CAAATTAATA GGTCTAGCCC CAATAGAGAC AATGTCACGA 2880 

AT GAT T CC AC CAACGCCTGT AGCAGCCCCT TGATATGGTT CAATTGCTGA TGGATGATTG 2 94 0 

TGAGACTCTA CTTTAAATAC TACGGCTTGA TTATCACCTA TATCGACTAC CCCTGCACCT 3000 

TCACCAGGCC CCATAAGCAC ATGGTcACCT GACGTAGGAA ATTGCTTTAA AAACGGTTTA 3060 

GAATGTTTAT AAGAGCAATG TTCACTCCAC ATaACAGAAA AGATACCTGT TTCTGTAAAG 312 0 

TTAGGTTGTC TGCCTAAAAT ATCGCAAACT TTTTCATATT CTTGATCaCT TAATCCCATA 3180 

T C T T GAT AT A CTTTTTCAAG TTTAATTTCT TCAACGCTTG GTTCGATAAA T T TAG AC AT G 32 4 0 

TTGTTCCCTC CAACTTTTTA CCATCGCTTC AAATAATTTC ACACCACTAT CAGTACCAA 3300 

CAACGTTTCT AAAGCTCTTT CagGATGtGG CATCATGCCA CATACATTGC CTTTTTCGTT 33 60 

AACAATTCCT GC AAT AT CAT CATATGAACC GTTCGGATTA TTCACATATT TCAGAATAAT 34 20 

TTGATTGTTA GCTTTTAATT GTTGATATAT TTCATCAGTA C AAT AAT AAT GACCTTCACC 3480 

GTGAGCTACA GGATATATAA CTTTTTCACC TTGTTCATAA AGATTTGTAA ATGCCGTTTG 354 0 

ATTATTCACT ATTTCTAACT CTTCATTTCT ACTAATAAAT AAATGTGAAT CGTTATGCAA 3 60 0 

TAATGCACCA GGTAATAAGC CTATTTCAGT TAAAATTTGA AACCCATTAC AAACACCTAA 3 6 60 

TACTGGCTTA CCTTCAGCTG CAAGACGTTT AACTTCCGAAATAATCGGsG CTACACTAGC 3720 

CATTGCCCCA GATCTTAAGT AATCCCCGAA TGAAAATCCA CCAGGAATAA GTACGCCATC 3780 

AAATCCACTT AGTGATGTTT CTCTATAATC TACATATTCC GCTTCAACAC CACTTTTAAT 3840 

AGCAGCATTA AACATGTCTC TATCACAATT CGAACCTGGA AAAACAAGAA CCGCAAATTT 9300 

CATTTTATGC ATTCTCCTTT TCATCATCTA ACACTTTATA GCTATATTCT TCAATCACTG 3 9 60 

TATTTGCAAA CAATTTTTCA CTTAGAGTTG TAATAATGTT GTGTACCTTT TCATCACTAA 4 02 0 
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CCTCATCCAC TGTCATATAT AATACTTTTC CTACACGAAT ATCATTCACT TGTGCATAAC 4 0 80 

CTAAGTCATG TACAGCTCGA GT AAGCGTTT GTCCTTGCGT ATCTAATACT TGTGGTTGTA 414 0 

ATGTGATATG TAGTTCAATT GTTTTCATTA TTTTAAATCC TCCAATTTGT TTAAAAATAT 4 2 00 

TTGATATGTT TCAATCAGTG ATCCAGTGTT ATTTCTATAT ACATCTTTAT CAAAGTTTGC 4 2 60 

ATTGGTAGCT TTATCCCAAA TTCGACATGT ATCTGGAGAT ATTTCAdCG CTAACAAAAT 4 32 0 

TTGACCAGTC TCAGTCTTTC CAAATTCGAT TTTAAAATCT ACTAATTTTA AATTCATAGC 4 380 

ATCCATTAAT TGTTTCAACA CATTATTAAT CTTTAATGCT TTGGATTTTA GTATTTCAAT 4 4 40 

ATCTTCATCT GATGCTATAT TGAGCAATTT AACATGGTCA TCCGTTATCA ACGGATCATT 4 5 00 

TAACGCATCA TTTTTATAGA AAAATTCTAC AAGTGGTTCT CTAAAAACTT CACCATTTTC 4 5 60 

AAAACCTAAA CGCTTTGTAA TAGATCCACT AGCAATATTA CGAACAACTA CTTCTAATGG 4 620 

AATTATTTTC ACAGGCTTAA CTAATTGTTC TGTTTCAGAT AATTGTTTAA TAAAGTGACT 4 680 

TTCTATTCCA TTTTCTTGTA AATATTTA7A TATAATAGAA GTAATTTGAT TATTTAATCG 4740 

CCCCTTACCT GCCATTGTGT CTTTCTTAGC CCCGTTTCCA GCAGTAACTT CATCTTTATA 4 8 00 

TTCAACTCTT AATTCATTTT CTTGATTTGT TGAGAAAATG CGcTTCGCTT TTCCTTCATA 4 8 60 

TAATAATGTC ATGCTTTAAT TACTCCCCTC AAATTTAGCG TACATATCTT GTTKGTTTG 4 920 

GTTTACATCA TTCGTTAGTA CAGTCATATG CCCCATTTTT CTGCTATCTT TACGCTCAGA 4 980 

CTTACCATAA ATATGTAAGT GCCACTCTGG ATGTTCATTA AATTCATTTT CCAATAAATC 504 0 

TAAATCTTTA CCTAGTAAGT TCATCATGAC TGCTGGCTTT AATAATTCAA TTGAATTTGG 5100 

TAATGATTGT CCGGTAACTG CTAAAATATG AG TAT C AAAT TGTGAATAAT CACATGCTTC 5160 

AATTGAATAA TGTCCGGAAT TGTGAGGCCT TGGTGCTATC TCGTTCACAT ACAATTGGTT 5220 

GTTACTATCT ATAAAAAATT CAACTGTAAA TGTTCCAATG AAATGAATCG ATTGGATAAT 52 8 0 

TTTATTAACT TGCTCTTTCG CCTCAGCTGT TTTATOATT CTCGCTGGAA CAATTGTTTT 534 0 

GAAAAGTATT TGATTTCTAT GCTCATTTTC TTGTAATGGG AAAAAAGTGA TTTGATTGTT 54 00 

GTTTCCTCTT GTAACAGTAA GAG AT AC T T C TTTCTTGATA TTCAAATATT TTTCAGCTAC 54 60 

GCATTCACTA GTTTCAATTA ATTTAAAACC TTCTTGTAAG TCTTTTTCGT TGTTAATTAA 5520 

AACTTGACCT TTGCCATCGT AGCCACCAAA TCTAGTTTTT ACAATAAAAG GATATCCTAA 5580 

TGTTTCAATT GCTTTGTCAA TATCTGTAGA TTCTTTTACT G AAAT G AAC G GGACAACTTT 564 0 

GGTACCAGCA CTTTTTAATG TTTCTTTTTC AGTTAAGCGA TCTTGTAATA ACTGTATAGC 57 00 

TTGGTAACCT TGCGGAATAT TGTACTTTTC AC AT AAT AG T TTTAATTGTT GGGCTGAAAT 57 60 

GTTTTCAAAT TCATAAGTAA TCACATCACA TTTTTGTCCT AATTGATTGA GTGCCTTTTC 5820 

ATCGTCATAC TTGGCTTGTA TAAATTCGTG TGCAACGTAT CTACATGGAC AATCTTCAGA 58 80 
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AGGATCCAAT ACAACCACTT TATAACCCAT TTTTTGAGCT GATGTGCCA TCATCTTTCC 5 94 0 
AAGCTGACCA CCACCAATAA TGCCAATAGT CGCACCAAAC TTTAATTTAT TGAAGTTCAT 6000 
TTTGCATGTC CTCCACTTTT TGAATTAACG AAGATTCATA CTGATTTAGT TTTTCAACTA 60 60 

AAGAAGGATT TTGAATACTT AACATTCTTG CTGCAAGTAT ACCTGCGTTT TTAGCACCTG 6120 
CTGCACCAAT TGCAGTCGTT GCAACCGGAA TACCTCCTGG CATTTGAACA ATTGATAATA 618 0 

AAGAATCTAT ACCCTTTAAA CTTTTTGTTT CAATCGGCAC TCCAATAACT GGTAGCGTCG 62 4 0 

TTAATGATGC AACCATACCT GGTAAATGTG CCGCACCGCC AGCGCCTGCA ATGATAATGT 6300 

TTATACCTCT TTCTCTCGCT TCAGi&GCAA ATTGAACCAT CATTTTTGGC GTACGATGTG 63 60 
CGGATACTAC TTGTTTTTCG TACGGAATTT CAAAATAATC CAACATGTTA CAACTCTCTT 64 2 0 

GCATAATTTT CCAATCGGAA GAACTGCCCA TAAT GACTGC TACTTTCACT TTGTACACCC 64 8 0 

TTTCAAAAGT TTGAATTGTG AATTACTTTA GTTGTATATT ATAGATATAGCATAACAAGC 65 4 0 
AATTTCTGCT TTTTCAATCA AAAATCGAAC TTTATTTTGA TTTTTTATTT GAATTTACGT 6600 
CTTTTGCTAT GTAAATTAGT TTTATAAACT AACAAAGTTA GGATATTGAC AATAGGAGGA 6660 
GAAGTTTTTA TGGTTGCTAA AAT T T TAG AT GGTAAACAAA TTGCCAAAGA CTACAGACAG 6720 

GGGTTACAAG ATCAAGTTGA AGCGCTAAAA GAAAAGGGTT TTACACCTAA ATTATCCGTT 67 8 0 
ATATTAGTTG GTAATGATGG CGCTAGTCAA AGTTATGTTA GATCAAAAAA GAAAGCAGCT 68 4 0 

GAAAAAATTG GTATGATTTc AGAAATCGTA CATTTGGAAG AAACAGCTAC TGAAGAAGAA 6900 

GTATTAAACG AACTAAATAG ACTAAATAAT G^GATTCTG TAAGTGGTAT TTTGGTACAA 6960 

GTACCATTAC CAAAACAAGT TAGCGAACAG AAAATATTAG AAGCAATCAA TCCTGAAAAA 7 02 0 

GATGTGGACG GTTTTCATCC AATAAATATA GGGAAATTAT ATATCGATGA ACAAACTTTT 7 08 0 

GTACCTTGCA CACCGCTCGG CAT CAT GGAA ATATTAAAAC ATGCTGATAT TGATTTASA 714 0 

GGTAAAAATG CAGTTGTAAT TGGACGAAGT CATATTGTCG GACAACCAGT TTCTAAGTTA 7200 

CTACTTCAAA AAAATGCATC AGTAACAATC TTACATTCTC GTTCAAAAGA TATGGCATCA 72 60 

TATTTAAAAG ATGCTGATGT CATTGTCAGT GCAGTTGGTA AGCCTGGTTT AGTAACAAAA 7 320 

GATGTGGTCA AAGA^GGAGC AGTAATTATC GATGTTGGCA ATACGCCAGA TGAAAATGGC 7 3 80 

AAATTAAAAG GTGACGTTGA TTATGATGCG GTTAAAGAAA TTGCTGGAGC TATTACACCA 74 40 

GTTCCTGGTG GCGTTGGTCC ATTAACAATT ACTATGGTAT TAAATAATAC TTTGCTTGCA 7500 

GAAAAAATGC GTCGAGGTAT TGATTCGTAA AG AGCC T GAG AC AT AAAT C A ATGTTCTATG 7 5 60 

CTCTACAAAG TTATAATGGC AGTAGTTGAC TGAACGAAAA TTCGCTTGTA ACAAGCTTTT 7 620 

TTCAATTCTA GTCAACCTTG CCGGGGTGGG AC G AC G AAAT AAATTTTACG AAAATATCAT 7 680 
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TTCTGTCCCA CTCCCTAATA ACTGAGTTTT AATGAAGTCT TTTAACCCAC ATTAAATATT 774 0 

ATTTTGCAAT TGCAATGAAT AACAAGAAAA ATCTGGGACA TTAATCGATC AAATGCTCCC 7 800 

TTCAAAGTAG ACATTGAATA AATGAAGGCT TTGAAGGGAG CATTTCACTT TGTACTTGGC 7 8 60 

TCAACAATTT TATATAGACA GTAGTTAATT GAATGAAAAT AAGCTTGTAA CAAGTTTTCA 7 92 0 

ATTCTAGTCA GGGGCCCCAA CAAAG AAG C T GACGAAAAGT CAGCTTACAA TAATGTGCAA 7 98 0 

GTTGGGGATG GGCCCCAACA CAGAAGCTGT GACTATGATA AAG T AC T AC T ACATAGTTAA 8 04 0 

TCATTAGTGG TTCTTTATCA TTTTCGCCTC CCTTTTCTTA TTGTTTTGAT ACACAAAAAT 8100 

TTAAGTTCAA ACTGTCGAAT AAAGTTATAT TTGATTTCAA ATTATCCTA AATTATTAAT 8160 

TkTACAATTG TGGCAGATTT TCAAAATAAT AATTATTTCC TCATTATTTA TAAATTTATA 8220 

TTTAAATTTC ATTCTTTATA GGGTAAGATT AGGACTATAG TATGATGTGT ArATAATATA 8 28 0 

AATTAAGGTA TAGTAAAGCT AAC T C AG AAA TGACTTATCA TTCGGAGGTT AC AT TAT G AA 8 34 0 

TAAACTATTA CAGTCATTAT CAGCCCTCGG TGTTTCTGCT ACACTAGTAA CACCAAATTT 8 4 00 

AAATGCAGAT GCAACGACGA AT AC T ACACC ACAAATTAAA GGCGCTAATG ATATCGTTAT 8 4 60 

TAAGAAAGGT CAAGATTATA ACCTTCTAAA CGGCATAAGT GCATTTGATA AAG AAG AT G G 8 520 

AG AT T T AAC C GATAAAATTA AAGTCGATGS CCAAATTGAT ACATCTAAAT CTGGTAAATA 8580 

TCAAATTAAA TATCATGTCA CTGATTCAGA TGGTGCAATT AAAATTTCCA CTAGGTATAT 8 64 0 

TGAGGTTAAA TAGCCCTCAT CACTATACTG CAAATAAAAT GGTAGCAAAC G AAC AT G T T T 8 7 00 

TGCTACCATT TTATTTGTTA TTCTAACTTC ATCTGCAACT T T AAC C C AAA TATGTATTT 8760 

TTTCTGTATA CCAAAGGACT ACCTATCAAA TTATTAAAAC TTAACTGCTC TTTTTAAAAA 8 820 

AATGTTTTGA TTTTGAACAA ACAAATTTCC ACTTTTCATT GTTTAACGAT AAATTACTTT 8 88 0 

TGGCAAATTC CTTATTAAAA TGTTTGCGCT TCCTTTCAAT CAACTAGCCA TCATTTTCAA 8 94 0 

T T T AT TAG AC AATTTCAAAC TTTTTTTATT TTCATTCAAT TAACCTTTAA TTGAAAGCTA 9000 

TTCTCAACTT TCCTTTTAAA TATGAAGCAA TTTTTTCAAA AACGCTATTA GTCACAAAAT 9060 

GT 9062 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
AAATATTTTT TCAAAACTAT GTGAAAATGG aCCATGTCt A aATCATGTAA TAATGCAGyA 60 
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CATAATGCCA ACGGTCTmTC TTTATTGTCC CATGCATCAT GACCAATAAA TGACTCATCA 120 

ATTAATCGTC TAACTATTTC ATACACACCT AAAGAATGTC CAAAGCGACT ATGTTCTGCT 180 

GTGTGAAAAG AT AGG T AC AG TGTTCCTAGT TGTCTAATTC GACGTAACCT TTGGAATTC 24 0 

TTAGTTTTAA TTAAATCCCa TATcAATTGA TCTTCAACAT GAATATATCG ATGTATCGGG 300 

TCTTTAAAAA CTTTTTCTTC TACTAATTTT AAATCTACAT ATGCGTTAGT CATTATTCCC 3 60 

CTCCTTTTCG TTTAATATAA TATTTAATTT ACTTAAAATG CTTTGTACAT AAGTGCTAAG 4 20 

TCTAACTTTT CGCCATACAT TTCTGGCTCA TAAGAGCGTA AGATTGTAAA ACCTTGCTCT 4 80 

TTATAGTAAG CTACTGCTTC TTCATTTTTA TTATCTACTT CTAAGTAAAC ACCTTCAAAT 54 0 

TTATCTTCAA AACGTGATAA TCCTTCATTT AACAATGCTG TACCATAACC TGTATGTTGC 600 

GATTCTGGTT TAACATAATG AGCTGATAAA TATAATTCTT CACCGTAAAT AAAGTTAGCA 6 60 

AAGCCAACGA TGTCATTACC TTCTTCAACG ACTAAGAATA ATTGTTCTTG AAGTCTTTTC 720 

TTTAAATGAT GTT CAT TATA TGAAGCTtCT AACAAGTGAT TAACTGTTGT CGCAGCGTAT 780 

ATATTTAAGT ATGTATTAAA CCAAGCTTTA GTTGCGACAT CTCTAATTTG AACAACATCT 430 

TTTTCAGTTG CTTGTCTTAC CTTGAACATG ACTTTCTCCC CTTATTAACA AGTTTTAATA 900 

ACGGCATTAT ACCACAACTT GCTCAATACT TAATAAACAA TGATTGTCTA TTCAATTTAT 9 60 

ATATtTATAT TTTCCGTTAA AATTAAAAAT AAAAAATAAC GAAGCAAAAA AtCACTTCGT 1020 

TTAGTATGAG GTATGTCTTA TTGCAATATA CTATTCCACT CAGTTGCACG TGCTAAGGCA 108 0 

TAGTTGTCTT TCATGATGTC ACCAGGCTTT TCAGCAGTTC CAATAATATA ACCATTTAAA 114 0 

GTGGCACCTA rAAAGTCTAA ACTATATTTC ATTTGCGTAA TTGCTGGTTC GCTTTTATTT 1200 

TTGGACAATC TCCACCAACT AAAATAACTC TAAAATCCTT TTCGGCC5RT TGTGCCTTAA 1260 

AATTAGGATA TCGTTTATCT TGTAATGTTT CTGACCAATG TTCGATAAAT GCTTTCAATG 1320 

GTGCTGAAAT GCTATACCAA TACACTGGTG ATGCAAAAAT AATTGTATCA CTAGCCAATA 1380 

TTTTATCTAG AATCGGCAAA TAGTCATCGT CATATGAAGT AATAGTCTCT GCTGTATGTC 14 4 0 

TCACGTCACG TATCGGTTTA AACTGATGTT GTGTCACGTC AATCCATTGA TACTCTAAAT 1500 

CTTGCAAAGC GAATTTTGTT AATTGTGCAG TATTACCGTT TGGTCTACTC CCACCAAACA 1560 

AAACAGTAAT CATTTTAGCC TAACCTCACT TTTGATTAAT AAATATCTGT GTTTTTCGTT 162 0 

ACCTAATTAT ACTATCATAA GCTTTGCCTA CCGAATAGTA AAACGCTTAC AACTTTTATA 1680 

TAAATTTGAC GAAATTTCGT CATGCCTTAT ATAACGTCGT TTGTGATACG GGGCTAATTC 17 4 0 

ATGATGAAAT TAGATACATA TATCACCATT AAATACAATT CATTTAGTCT TCAATCGGAA 18 00 

ACAGTTCATC GATATATTGA ATCTCATCAT CTGATAAAAC GATATCTGCA GCTTAATAT 18 60 
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TTTCAACGAC TTGTTCTGCA CGTTTTGCAC CAGGAATAAT CAGATCGATA GCTGGTCTCG 1920 

TTAAATAAAA TGCTAATACA ATGTTCGCAA TTGAAGTTTG ATGTGCTGCA GCTATGCTTT 1980 

CCAAAGCTTT TACGCGACGC ACATTTTCTT CAAATACACC TGGTTTAAAA TCACGACGTG 204 0 

TAGTACGATG GTCACTAAAa TTTAGTGTTC T CATC AT ATT TTCCAGCTAA AATACCGGAT 2100 

GCTAATGGGA AATATGGAAT AAATGTGATT TGGTGATCAA CACAATATTG TAATACTGCC 2160 

TCATTTTCGC GATGCAATAA ATTATATTCT AACTGTACAA CATCAACGTA ACCATCTTTA 22 20 

TTTGCTTCTT TAAGTTGATC TAATGTGAAA TTTGATiEAC CAATTGCTTT AATCTTCCCT 2280 

TGTTCCTTAA GCTCTTGTAA TGCTGCAACT GCTTGATCTT TCGGAGTGTT GTTATCCGGA 234 0 

AAATGAATAT AATATAAATC GATATAATCA GTTTGTAGAC GTTTCAAACT ATTCTCAACT 24 00 

TGTTGTTTTA AATATTCCGG TTGATTGTTC TGATGTACTT CTTGATTTTC ATCAAATTCA 2 4 60 

TGAGACCCTT TCGTAGCAAT TTTAATTTGC TCTCGCGGAT ATTCTTTAAC AACTTCTCCA 2520 

ACCAATTCTT CTGATCGTTC TGGCCCATAA ATATATGCCG TATCTAATAA ATTAATACCA 25 8 0 

TGATTAATGG CTTGACGAAC AACATCTTTT CCTTGTTCTT CATCTAAGTT CGGATATAAA 2 64 0 

TTATGCCCAa CCTAtGCGTT CGTCCCAAGT GCGATTGGAA ACACTTCAAC ATCAGATTTA 27 00 

CCTAAGTTTA CAAATTGCTn CATTAGACCC AGCnCCTT 27 38 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

GAT TAG AT GA TATTTAACGA AAATTAaGrT GmAAT AC T t G AATGTArGAa GTCTGATGTC 60 

GAAAATAGCT ATTAAAATAG AGTAGACGTA ATGt AAATGA AGCACCTAA AATAGAAAAA 12 0 

TTTCAAAAAT AGCGTAATTA TTATAATAAA TAGACTGCCA ATAAAATGCA ATTTTTCACT 180 

TATAACATTC TTCAAAAAAT AATAGCAAAA TTATGTAAAA AATATCTTGT CATGGCAAGA 24 0 

TTGGCTGTGC TATAATCTAT CTTGTGCTTA AGAACGGCTC CTTGGTCAAG CGGTTAAGAC 3D 

ACCGCCCTTT CACGGCGGTA ACACGGGTTC GAGTCCCGTA GGAGTCACCA TTTTTTAGGT 3 60 

CTCGTAGTGT AGCGGTTAAC ACGCCTGCCT GTCACGCAGG AGATCGCGGG TTCGATTCCC 4 20 

GTCGAGACCG TACAAATGCC TATCCAAGAG GATAGGCATT TTTTTGCGTT TAATATTATA 4 80 

TTAATAAAAG ATATATGGAC GAATGATAAT CATATTGATT TATCTGTTCG TCCATTTTCT 54 0 

TTAAAATGTA TGAACCTCAA GTAACTTAGT GGTTGGATAT GAAAGATAAA CGTAGACAAT 600 
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AAAATCTTTA TTAGACGTAC AAACATATGC TACTGTCAAC ATATTTCTTC GTTGTGATAT 660 

GCCACCAGTC CTCCATAACA TCAATTGTTA AAGTAACGAA TAACGAATA TGATATTTAT 720 

TTTCTGAGCA ATGACGTGCA cTAGAAGTTG CCaTTATCCT AATTTTATTA TTGGAATAGA 7 80 

GACCTCATCA TTGTGTTAAA TATCATTGTC ACAATCCGCC GTGAGAAACT AATAAAAAAT 84 0 

AG T AAT AT AT AAGTTTATAT TGGAAAATAG AATTAATAGC TTATAAATGG TAAATTATAT 900 

AATAGGTTAC TATACGTTAT AAGACGGAAA ATGCGCACAA TAACAAAAAT AGTAAGCGAC 960 

ATCCTGTGAT TTTTTACACA AACATAAACG ATAAAGAACA AAAAATGATA AAATAATATT 102 0 

AAT GAT T T AA GAAAAGAGGT TTATGCAAAT GGCTAGAAAA GTTGTTGTAG TTGATGATGA 1080 

AAAACCGATT GCTGATATTT TAGAATTTAA CTTAAAAAAA GAAGGATACG ATGTGTACTG 1140 

TGCATACGAT GGTAATGATG CAGTCGACTT AATTTATGAA GAAGAACCAG ACATCGTATT 1200 

ACTAGATATC ATGTTACCTG GTCGTGATGG TATGGAAGTA TGTCGTGAAG TGCGCAAAAA 12 60 

ATACGAAATG CCAATAATAA TGCTTACTGC TAAAGATTCA GAAATTGATA AAGTGTTGG 132 0 - 

TTTAGAACTA GGTGCAGATG ACTATGTAAC GAAACCGTTT AGTACGCGTG AATTAATCGC 1380 

ACGTGTGAAA GCGAACTTAC GTCGTCATTA CTCACAACCA GCACAAGACA CTGGAAATGT 14 4 0 

AAC G AAT G AA ATCACAATTA AAGATATTGT GATTTATCCA GACGCAT AT T CTATTAAAAA 1500 

ACGTGGCGAA GATATTGAAT TAACACATCG TGAATTTGAA TTGTTCCATT ATTTATCAAA 15 60 

ACATATGGGA CAAGTAATGA C AC G T G AAC A TTTATTACAA ACAGTATGGG GCTATGATTA 162 0 

CTTTGGCGAT GTACGTACGG TCGATGTAAC GATTCGTCGT TTACGTGAAA AGATTGAAGA 168 0 

TGATCCGTCA CATCCTGAAT ATATTGTGAC GCGTAGAGC GTTGGATATT TCCTCCAACA 17 4 0 

ACATGAGTAG AGGTCGAAAC GAATGAAGTG GCTAAAACAA CTACAATCCC TTCATACTAA 1800 

ATTTGTAATT GTTTATGTAT TACTGATTAT CATTGGTATG CAAATTATCG GGTTATATTT 18 60 

TACAAATAAC CTTGAAAAAG AGCTGCTTGA TAATTTTAAG AAGAATATTA CGCAGTACGC 1920 

GAAACAATTA G AAAT TAG T A TTGAAAAAGT ATATGACGAA AAGGGCTCCG TAAATGCACA 198 0 

AAAAG AT AT T CAAAATTTAT TAAGTGAGTA TGCCAACCGT CAAGAAATTG GAGAAATTCG 204 0 

TTTTATAGAT AAAGAC C AAA TTATTATTGC GACGACGAAG CAGTCTAACC GTAGTCTAAT 2100 

CAATCAAAAA GCGAATGATA GTTCTGTCCA AAAAGC AC T A TCACTAGGAC AATCAAACGA 2160 

TCATTTAATT T T AAAAG AT T ATGGCGGTGG TAAGGACCGT GTCTGGGTAT ATAATATCCC 2220 

AGTTAAAGTC GATAAAAAGG T AAT T GG T AA TATTTATATC GAATCAAAAA TTAATGACGT 22 80 

T T AT AACCAA TT AAAT AAT A TAAATCAAAT ATTCATTGTT GGTAiSGCTA TTTCATTATT 2340 

AATgCACAGT CATCCTAGGA TTCTTTATAG CGCGAACGAT TACCAAACCA ATCACCGATA 2400 
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TGCGTAACCA GACGGTCGAA ATGTCCaGAG GTAACTATAC GCAACGTGTG AAGATTTATG 24 60 

GTAATGATGA AATTGGCGAA TTAGCTTTAG CATTTAATAA CTTGTCTAAA CGTGTACAAG 2 5 20 

AAGCGCAGGC TAATACTGAA AG T GAG AAA C GTAGACTGGA CTCAGTTATC ACCCATATGA 2580 

GTGATGGTAT TATTGCAACA GACCGCCGTG GACGTATTCG TATCGTCAAT GATATGGCAC 2 64 0 

TCAAGATGCT TGGTATGGCG AAAGAAGACA TCATCGGATA TTACATGTTA AGTGTATTAA 27 00 

GTCTTGAAGA TGAATTTAAA CTGGAAGlAA TTCAAGAGAA TAATGATAGT TTCTTATTAG 27 60 

ATTTAAATGA AGAAGAAGGT CTAATCGCAC GTGTTAACTT TAGTACGATT GTGCAGGAAA 2 8 20 

CAGGATTTGT AACTGGTTAT ATCGCTGTGT TACATGACGT AACTGAACAA CAACAAGTTG 28 80 

AACGTGAGCG TCGTGAATTT GTTGCCAATG TATCACATGA GTTACGTACA CCTTAACTT 2 94 0 

CTATGAATAG TTACATTGAA GCACTTGAAG AAGGTGCATG GAAAGAT GAG GAACTTGCGC 30 00 

CACAATTTTT ATCTGTTACC CGTGAAGAAA CAGAACGAAT GATTCGACTG GTCAATGACT 30 60 

TGCTACAGTT ATCTAAAATG GATAATGAGT CTGATCAAAT CAACAAAGAA ATTATCGACT 312 0 

TTAACATGTT CATTAATAAA ATTATTAATC G AC AT GAAAT GTCTGCGAAA GATACAACAT 3180 

TTATTCGAGA TATTCCGAAA AAGACGATTT TCACAGAATT TGATCCTGAT AAAATGACGC 32 4 0 

AAGTATTTGA TAATGTCATT ACAAATGCGA TGAAATATTC TAGAGGCGAT AAACGTGTCG 3300 

AGTTCCACGT GAAACAAAAT CCACTTTATA AT C ©AT G AC GATTCGTATT AAAGATAATG 3360 

GCATTGGTAT TCCTATCAAT AAAGTCGATA AGATATTCGA CCGATTCTAT CGTGTAGATA 34 2 0 

AGGCACGTAC GCGTAAAATG GGTGGTACTG GATTAGGACT AGCCATTTCG AAAGAGATTG 34 8 0 

TGGAAGCGCA CAATGGTCGT ATTTGGGCAA ACAGTGTAGA AGGTCAAGGT ACATCTATCT 354 0 

TTATCACACT TCCATGTGAA GTCATTGAAG ACGGTGATTG GGATGAATAA TAAGGAGCAT 3600 

ATTAAATCTG TCATTTTAGC ACTACTCGTC TTGATGAGTG TCGTATTGAC ATATATGGTA 3 6 60 

TGGAACTTTT CTCCTGATAT TGCAAATGTC GACAATACAG ATAGTAAGAA GAGTGAAACG 37 2 0 

rAACCTTTAA CGACACCTAT G AC A G C C AAA ATGGATACAA CTATTACGCC ATTTCAGATT 37 8 0 

ATTCATTCGA AAAAT GAT C A TCCAGAAGGA ACGATTGCGA CGGTATCTAA TGTGAATAAA 38 4 0 

CTGACGAAAC CTTTGAAAAA TAAAGAAGTG AAGTCCGTGG AACATGTTCG TCGTGATCAT 3900 

AACTTGATGA TTCCTGATTT GAACAGTGAT TTTATATTAT tCGATTTTAC GTATGATTTA 3960 

CCGTTATCAA CATATCTTGG TCAAGTACTG AACATGAATG CGAAAGTACC AAATCATTTC 4 020 

AATTTCAATC GTTTGGTCAT AGATCATGAT GCTGATGATA ATATCGTGCT TTATGCTATA 4 080 

AGCAAAGATC GCCACGATTA CGTAAAATTA ACAACTACAA CGAAAAATGA TCATTTTTTA 41 

GATGCATTAG CAGCAGTGAA AAAAGATATG CAACCATACA CAGATATCAT CACAAACAAA 4 2 00 

GATACAATTG ATCGTACGAC GCATGTTTTT GCACCAAGTA AACCTGAAAA GTTAAAAACA 4 2 60 
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TATCGCATGG TATTTAACAC GATTAGTGTT GAGAAAATGA ATGCTATACT ATTTGACGAT 4 32 0 

TCAACCATCG TTCGTAGTTC AAArAGTGGT GTTACAACCT ACAACAATAA TACAGGTGTC 4 380 

GCAAACTATA ACGATAAAAA T G AAAAAT AT CAT T AT AAAA ACCTGTCCGA AGATGAAGCG 4 44 0 

AGTTCCAGCA AAATGGAAGA AACGATTCCA GGAACCTTTG ATTTTATTAA TGGTCATGGT 4 500 r- 

GGTTTCTTAA ACGAAGACTT TAGATTGTTT AGTACGAATA ATCAGTCAG CGAGTTAACA 4 5 60 

TATCaACGTT TCCtTAATGG TTATCCAACG TTTAATAAAG AAGGTTCTAA TCAAATTCAA 4 620 

GTCACTTGGG GTGAAAAAGG CGTCTTTGAC TATCGTCGTT CGTTATTACG CACCGACGTT 4 68 0 

GTTTTAAATA GTGAGGATAA TAAATCGTTG CCGAAATTAG AGTCTGTACG TTCAAGCTTA 4 74 0 

GCGAACAATA G T GAT AT T AA TTTTGAAAAA GTAACAAACA TCGCTATCGG TTACGAAATG 4 800 

CAGGATAATT CAGATCATAA TCACATTGAA GTGCAGATTA ACAGTGAACT CGTACCGCGT 4 860 

TGGTATGTAG AATATGATGG CGAATGGTAT GTTTATAACG ATGGGaGGCT TGaATAAATG 4 92 0 

AACTGGaAAC T G AC AAAG AC ACTTTTCATT TTCGTGTTTA TTCTTGTCAA CATCGTGTTA 4 980 

GTATCGATTT ATGTTAATAA AGTCAATCGC TCACACATTA ATGAAGTCGA GAGTAACAAT 504 0 

GAAGTTAATT TTCAGCAAGA AGAAAT T AAA GTACCGACTA GTATATTGAA TAAATCAGTT 5100 

AAAGGTATAA AAT TAG AG C A AATTACAGGG CGATCAAAAG ACTTTAGTTC TAAAGEAAA 5160 

GGCGATTCGG ATTTGACCAC ATCAGATGGT GGAAAATTAT TGAATGCGAA CAT TAG T C AA 5 22 0 

TCGGTAAAGG TCAGTGACAA TAACTTAAAA GAT T T G AAAG ATTATGTTAA CAAGCGCGTA 5280 

TTTAAAGGTG CTGAATATCA ATTAAGCGAG ATTAGTTCAG ATTCTGTAAA ATATGAACAA 5340 

AC G TAT GAT G ATTTTCCGAT TTTAAATAAC AGTAAAGCGA TGTTAAACTT TAATATAGAA 54 00 

GAT AAC AAAG CGACTAGTTA TAAACAATCA ATGATGGATG ACATTAAGCC CACAGATGGT 5 4 60 

GCAGATAAGA AGCATCAAGT GATTGGTGTG AGAAAAGCAA TCGAGGCATT AT ATT AT AAT 5 52 0 

CGTTACTTGA AAAAAGGTGA TGAAGTCATT AATGCTA&C TCGGTTACTA CTCAGTCGTG 5580 

AAT GAAACGA ATGTTCAATT GTTACAACCA AACTGGGAAA TTAAAGTGAA GCATGACGGT 5 64 0 

AAGGATAAAA C G AAT AC T T A CTATGTCGAA GCGACAAATA ATAACCCTAA AAT TAT T AAT 5 7 00 

CATTAATATG AATCGTAATA AGCTAGCATT GCAAGCTCAT CATATGTGAG AAGCGGTGCT 57 60 

AGCTTTTTTG CTGGTACGGT TTATTATGGC TGATGTTTTT GCGTCTCCAA CGTGCGCATT 5820 

TATTCATATT TTAAGTAGAA CCGCATTGTA AAATTAGTGT AACTGTTATT TTAAAAACTT 5 8 80 

TAGTATTTGT CTAATCATTG TT AT AAT AAT TAAGAAATTC ATTGCACGTG ATTATCAAAA 5 94 0 

TTTAAATATA AGAAACCGGT CGATGAACTA AAG T T AC AT A ATAGGAAAGG TATACAAAAC 6000 

AGCTAATATA CTGATAGTTT CTGTAGGGAA AATCGTATAT TTGCACTGAT GTATATTGCA 6060 
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GTCATATAGA GAGATTGACT GTTTAAAGAG AAAG GAT GAG CCGCTTGATA CGCATGAGTG 6120 

TATTAGCAAG TGGTAGTACA GGTAACGCCA CTTTTGTAGA AAATSAAAA GGTAGTCTAT 6180 

TAGTTGATGT TGGTTTGACT GGAAAGAAAA TGGAAGAATT GTTTAGTCAA ATTGACCGTA 62 4 0 

ATATTCAAGA TTTAAATGGT ATTTTAGTAA CCCATGAACA TATTGATCAT ATTAAAGGAT 6300 

TAGGTGTTTT GGCGCGTAAA TATCAATTGC CAATTTATGC GAATGAAAAA ACTTGGCAGG 6360 

CAATTGAAAA GAAAGATAGT CGCATCCCTA TGGATCAGAA ATTCATTTTT AATCCTTATG 64 2 0 

AAACAAAATC TATTGCAGGT TTCGATGTTG AATCGTTTAA CGTGTCACAT GATGCAATAG 64 8 0 

ATCCGCAATT TTATATTTTC CATAATAACT ATAAGAAGTT TACGATTTTA ACGGATACGG 654 0 

GTTACGTGTC TGATCGTATG AAAGGTATGA TACGTGGCAG CGATGCGTTT ATTTTTGAGA 6600 

GTAATCATGA CGTCGATATG TTGAGAATGT GTCGTTATCC AT GGAAGACG AAACAACGTA 6 6 60 

TTTTAGGCGA TATGGGTCAT GTATCTAATG AGGATGCGGC TCATGCAATG AC AG AC G T G A 67 2 0 

TTACAGGTAA CACGAAACGT ATTTACCTAT CGCATTTATC ACAAGACAAT AfiATGAAAG 67 80 

ATTTGGCGCG TATGAGTGTT GGCCAAGTAT TGAACGAACA CGATATTGAT AC G GAAAAAG 68 4 0 

AAGTATTGCT ATGTGATACG GATAAAGCTA TTCCAACGCC AATATATACA ATATAAATGA 6 90 0 

GAGTCATCCG ATAAAGTTCC GCATTGCTGT GAGACGACTT TATCGGGTGC TTTTTTATGT 6 9 60 

TGTTGGTGGG AAATGGCTGT TGTTGAGTTG AATCGGCTTG ATTGAAATGT GTAAAATAAT 7 02 0 

TCGATATTAA ATGTAATTTA TAAATAATTT ACATAAAATC AATCATTTTA ATATAAGGAT 7080 

TATGATAATA TATTGGTGTA TGACAGTTAA TGGAGGGAAC GAAATGAAAG CTTTATTACT 714 0 

TAAAACAAGT GTATGGCTCG TTTTGCTTTT TAGTSTAATG GGATTATGGC AAGTCTCGAA 7200 

CGCGGCTGAG CAGCATACAC CAAT GAAAGC ACATGCAGTA ACAACGATAG ACAAAGCAAC 7 2 60 

AACAGATAAG CAACAAGTAC CGCCAACAAA GGAAGCGGCT CATCATTCTG GCAAAGAAGC 7 32 0 

GGCAACCAAC GTATCAGCAT CAGCGCAGGG AACAGCTGAT GATACAAACA GCAAAGTAAC 7 380 

ATCCAACGCA CCATCTAACA AACCATCTAC AGTAGTTTCA ACAAAAGTAA ACGAAACACG 7440 

CGACGTAGAT ACACAACAAG CCTCAACACA AAAACCAACT CACACAGCAA CGTTCAAATT 7 500 

ATCAAATGCT AAAACAGCAT CACTTTCACC ACGAATGTTT GCTGCTAATG CACCACAAAC 7 5 60 

AACAACACAT AAAATATTAC AT AC AAAT G A TATCCATGGC CGACTAGCCG AAGAAAAAGG 7 620 

GCGTGTCATC GGTATGGCTA AATTAAAAAC AGTAAAAGAA CAAGAAAAGC CTGATTTAAT 7 68 0 

GTTAGACGCA GGAGACGCCT TCCAAGGTTT ACCACTTTCA AACCAGTCTA AAGGTGAAGA 774 0 

AATGGCTAAA GCAATGAATG CAGTAGGTTA TGATGCTATG GAGTCGGTA ACCATGAATT 7 800 

TGACTTTGGA TACGATCAGT TGAAAAAGTT AGAGGGTATG TTAGACTTCC CGATGCTAAG 7 8 60 

TAcTAACGTT TATAAAGATG GAAAACGCGC GTTTAAGCCT TCAACGATTG TAACAAAAAA 7 92 0 
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TGGTATTCGT TATGGAATTA TTGGTGTAAC G AC AC C AG AA AC AAAG AC G A AAACAAGACC 7 9C8 

TGAAGGCATT AAAGGCGTTG AAT T TAG AG A TCCATTACAA AGTGTGACAG CGGAAATGAT 804 0 

GCGTATTTAT AAAGACGTAG ATACATTTGT TGTTATATCA CATTTAGGAA TTGATCCTTC 8100 

AACACAAGAA ACATGGCGTG GTGATTACTT AGTGAAACAA TTAAGTCAAA ATCCACAATT 8160 

GAAGAAACGT AT TACAGT T A TTGVTGGTCA T T C AC AT AC A GTACTTCAAA ATGGTCAAAT 8 22 0 

T T AT AAC AAT GATGCATTGG CACAAACAGG TACAGCACTT GCGAATATCG GTAAGATTAC 828 0 

ATTTAATTAT CGCAATGGAG AGGTATCGAA TATTAAACCG TCATTGATTA AT G T T AAAG A 834 0 

CGTTGAAAAT GTAACACCGA ACAAAGCATT AGCTGAACAA ATTAATCAfi CTGATCAAAC 8 4 00 

ATTTAGAGCA CAAACTGCAG AGGTAATTAT TCCAAACAAT ACCATTGATT TCAAAGGAGA 8460 

AAG AG AT GAC GTTAGAACGC G T G AAAC AAA TTTAGGAAAC GCGATTGCAG ATGCTATGGA 8520 

AGCGTATGGC GTTAAGAATT TCTCTAAAAA GACTGACTTT GCCGTGACAA ATGGTGGAGG 858 0 

TATTCGTGCC TCTATCGCAA AAGGTAAGGT GACACGCTAT GATTTAATCT CAGTATTACC 8 64 0 

ATTTGGAAAT ACGATTGCGC AAATTGATGT AAAAGGTTCA GACGTCTGGA CGGCTTTCGA 8 7 00 

ACATAGTTTA GGCGCACCAA CAACACAAAA GGACGGTAAG ACAGTGTTAA CAGCGAATGG 87 60 

CGGTTTACTA CATATCTCTG ATTCAATCCG TGTTTACTAT GATATAAATA AACCGTCTGG 8820 

CAAACGAATT AATGCTATTC AAATTTTAAA TAAAGAGACA GGTAAGTTTG AAAATATTGA 888 0 

TTTAAAACGT GTATATCACG T AAC GAT G AA T GAC T T C AC A GCATCAGGTG GCGACGGATA 8 94 0 

TAGTATGTTC GGTGGTCCTA GAGAAGAAGG TATTTCATTA GAT C AAG T AC TAGCASTTA 9000 

TTTAAAAACA CCTAACTTAG CTAAGTATGA TACGACAGAA CCACAACGTA TGTTATTAGG 90 60 

T AAAC C AG C A GTAAGTGAAC AACCAGCTAA AGGACAACAA GGTAGCAAAG GTAGTAAGTC 912 0 

TGGTAAAGAT AC AC AAC C AA TTGGTGACGA C AAAG T GAT G GATCCAGCGA AAAAACCAGC 918 0 

TCCAGGTAAA GTTGTATTGT TgtAGCGCAT AGAGGAACTG TTAGTAGCGG TACAGAAGGT 92 4 0 

TCTGGTCGCA CAATAGAAGG AGCTACTGTA TCAAGCAAGA GTGGGAAACA ATTGGCTAGA 93 0 0 

ATGTCAGTGC CTAAAGGTAG CGCGCATGAG AAAC AGT TAT TTCATAATCA ACAGTCATTG 93 60 

ACGTAGCTAA GTAATGATAA ATAATCATAA ATAAAAT^C AGATATTGAC AAAAAATAGT 94 2 0 

AAATA 94 2 5 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3886 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

AGTTGTAATG TCACATTTCC AGAGTCTGAA ATTATCTTTA TCACGTTACA TTTACTAGGC 60 

TCTAAAATGA C T G AAC AT AC AGCATCTTCA ATTACCTTTG AAT AC CAT G A TTTATCGCAA 120 

AATATACATG AAT T GAT C AC TTGTGTTAGC CAAGAATTAG GCATTGATAT GTCAAAAGAC 180 

AACAAGT T AC ATACCAGTCT GATCACACAT ATCAAACCAG CTATACATCG TAT T AAAT AC 24 0 

GATATGCTAC AACCTAATCC TTTGAGGCAA GAAGTTATGC GTCGCTATCC TCAAATCATT 300 

GAAGCCGTTA GCAAGCATAT TAGTCCAATT GAACAAGATG CTGCTATTCG CTTCAACGAA 3 60 

GATGAATTAA CAT AC ATT AC AATTCACTTC GCATCAAGTA TAGAGCGTGT TGCAACACAT 4 20 

AAACAATCAA TGATTAAGGT TGTCTTACTA TGTGGTTCTG GTATAGGCAC GTCACAACTT 480 

TTAAAATCAA AAC T AAAT C A CCTGTATCCT GaGTTnCACA TTGGGAtGc CTATTcCATT 54 0 

TaTcAATTGG aAGaAAGTCG ATTATTGCAA GAT AAC AT T G ATTATGTCAT T TCAACAG T A 600 

CCTTGTGAAA TATCAGCTGT ACCAGTTATT CATGTCGATC CATTTATCAA TCAACAATCT 660 

CGTCAAAAAT TGAATCAAAT TATCAATGAC TCAAGAGAAC AACGAGTCAT GAAAATGGCA 720 

ACTGATGGCA AGTCACTCGC AGATTTATTG CCTGAACATC GCATCATTAT AAATAAACAA 780 

CCATTATCAA TTGAATCCGC AATTGCAGTG GCTGTGCAAC CTTTAATCAA TGATGGCATT 840 

GTCTATTCAA ATTATACAGC TGCAATTTTA AAACAATTTG AACAATTCGG GTCATATATG 900 

GTCATTAGTC CACATATTGC ACTTATTCAC GCTGGTACTG ATTATGTACA GAATGGTGTA 960 

GGTTTCGCAC TAACATATTT CACTGAAGGG ATTATCTTTG GTAGTAAAGC TAACGATCCC 1020 

GTTCACCTTG T AAT T AC AT T AGCAACGGAC CACCCCAATG C AC AT T T AAA GGCATTGGGA 10 80 

CAGTTAAGCG AATGCTTAAG CAACGACTTA TATCGACAAG ATTTCTTAGATGGGAATATT 114 0 

TTTAAAATTA AACAACACAT TGCTTTAACT AT G AC AAAGG AGGCTTAATA ACGTGTCATT 1200 

AG AC AT T T T G TCAACAACAC GCATCATTGT AAAAGAACAA GTAAATGATT GGACTGAAGC 12 60 

TATAACTATA GCTTCTCAGC CATTACTACA AGAACAAATT ATT GAAC AAG GCTATGTTCA 1320 

AGCAATGATT GATAGCGTTA ATGAACTTGG ACCTTATATC GTTATCGCAC CTGAAATTGC 1380 

AATTGCACAT GCAAGACCGA ACAATGACGT ACATCAAGTT GGTTTAAGTC TATTAAAGTT 14 4 0 

GAAT C AAC AT GTGGCATTTT GTGATGAAGA TCACTACGCA TCTCTCATTT TTGTATTGAG 1500 

TGCCATCGAC AATCATTCAC ACTTATCTGT A"TACAAAAT TTAGCAACCG TACTGGGCGA 15 60 

TAACCAAACA GTCCAGCAAC TATTAACTGC AACAAATGCA CAAGACATTA AAAACATTTT 1620 

AAAGGAGCAT GATTAATATG AAAATTTTAG TAGTATGTGG CCACGGTTTA GGAAGTAGTT 168 0 

TTATGGTAGA AATGAACGCA CAAGAAGCAC TTAGGCAACT TAATGCACCA TCTGATACG 174 0 
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AAGTTGAACA TAGTGACATT ATGACAGCAA GTCCAGAGAT GGCTGACTTG TTTATTTGTG 1800 

G TAG AG AT T T AGCTGAAAAT GCCGAACGTC TAGGGGATGT CTTAGTTCTT GATAATATTT 18 60 

TAGATAAAGC TGAATTACAA CAAAAGCTCT CAGAAAAATT ACAACAACTT AACATGATTT 192 0 

AAAGGAGGTA CGACCTATGC AAGCAATCCT TAATTTTATA GTCGATATTT TAAGTCAACC 1980 

AGCCATTCTT GTTGCACTGA TTGCCTTTAT AGGTTTAATC GTTCAGAAAA AACCTGCCGC 204 0 

AACGATCACT TCAGGAACCA TTAAAACGAT ATTAGGCTTC TTAATTTTAA GTGCAGGTGC 2100 

TGATGTCGTC GTTCGATCTC TTGAACCATT CGGCAAAAA TTCCAACACG CATTTGGTGT 2160 

GCAAGGTATC GTACCTAACA ACGAAGCTAT CGTCTCACTA GCCTTAAAAG ATTTTGGAAC 2220 

AACAGCTGCA CTCATCATGG TCTGTGGCAT GATTGTTAAT ATTTTAATTG CCCGCTTCAC 2280 

TAATTTAAAA TATATCTTTT TAACAGGTCA TCATACATTT TACATGGCTG CGTTTTTAGC 2 34 0 

AATCATTTTA ACAGTCAGTC ATATTAAAGG CTGGCTAACG ATTGTTATCG GCGCACTCGT 2400 

ATTAGGATTA ATCATGGCAG TATTACCTGC ATTACTCCAA CCTACGATGC GAAAAAT T AC 2 4 60 

AGGGAATGAC CAAGTAGCTT TAGGTCATTT TGGCTCAATC AGTTACTTTG CCGCAGTGCT 2520 

GTAGGTCAAT TATTCAAAGG TAAGTCTAAA TCAACGGAAG AGATTAAATT TCCAAAAGGC 258 0 

TTAAGTTTCT TACGAGAAAG TACAATTAGT ATCTCGATTA CGATGGCATT ACTTTACTTC 2 64 0 

ATCGCATGCT TATTTGCGGG CGTTAGTTAT GTACACGAAT CTATTAGTGA TGGTCAAAAC 2 7 00 

TTTATTGTCT TTTCATTAAT TCAAGGTGTG ACATTTGCTG CTGGTCEATT TATTATTTTA 27 60 

ACGGGCGTTC GTTTAATCTT AGCTGAAATC GTCCCAGCAT TTAAAGGAAT TTCTGAAAAG 2820 

CTTGTACCAA ATTCTAAACC TGCATTAGAC TGCCCTATTG TGTTCCCTTA TGCACAAAAT 2880 

GCAGTATTAA TTGGATTCTT TGTCAGCTTT ATTACAGGTG TCATCGGTAT GTTTATCTTA 2 94 0 

TTCTTATTTG GTGGCGTCGT CATTTTACCT GGCGTAGTTG CACACTTCTT CTTAGGTGCA 3000 

ACGGCTGCTG TATTCGGTAA TGCAAGAGGC GGTATTAAAG GTGCTATTGc TGGCGCCGCT 30 60 

CTAAATGGTA TCCTAATCAC GTTTTTACCA TTATTATTCT TGCCATTTTT AGGCGAATTA 3120 

GGTGGTGCTG CAACAACATT CTCAGAT^CA GACTTTTTAG CTGTCGGTAT CGTGTTCGGT 3180 

AACGCAGTAA AATATATGGG ATTATTTGGT GCGATTCTAT TTATTATTAT CGTAGGTGCG 324 0 

ACAACAATTT TATTAAAAGG CCGTCAAAAA GAACAGCAAT AGTGTTAACG TAGAAATATA 3300 

AAACACCGTC ACATATTGAG TGAATGCCCC TTTt ATCAAG AGGAAAGCCA CTffCTTATG 33 60 

GACGGTGTTT TGTATTATAT TAAATGATAC TTAGCCATAC TATCGACAGC TGCTAAAATT 34 2 0 

GCTTCTTCTT GTGTCGCAAT CGGTTCCCAA CCAAGTAATG TTTTTgCACG TTCGTTACTT 34 80 

ACATTGCGGT TCATATCTAA TAAAAGTTTA CCTTCTTTAG CTTGATGATT AAATTTAGCA 354 0 
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CCTAGACTCA AAATAAAGTC TGGTAATTTT TTAGTAGAAA CTTTTTGAGC TATTTCAGGT 



3600 



CTCTTTTCTT TAATTAATTT TGCAATTTCC AACAAATTAA TTTGTCCATC AGCCGTCGCA 



3660 



ATAAATCGCT TGCCATTAGC TTGTTCATTT GTCATTGCCA AAATGTGCAG TTCAGCTACG 



3720 



TCTCTCACAT CAACAACATT TAACGGAATT TGCGG'ACAC GTTTCATTGA ACCATTCAAT 



3780 



AAATTTTCTA ATAAATGAAA GCTTCCTGAA ACGTGTGCAT CTAATGATGG CCCAAAAATT 



3840 



GCAACTGGAT TGATTGTGGC AAATTCTACT GTTGTATTTT CATTCT 



3886 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GTCATCTATC AAAAATTTGG TATACAGACC GACAATTATT AATTAATAAT TTAATTTOC 60 

AGGCAATACC AG T GAT T AAA TAT C C AC AAA TACAACATAA AGAACAACCA TTAGAATCTA 120 

TTTCACAACT TATATTGTCT AAGATGACAT CTAATCAATA GTGTTTAAAT TTCTCAGTGG 180 

CTGTGAATGA GGTTTAAAAG TACTATAAAA CGTAAACTTT GATACTTTAA AATACGCAAA 24 0 

AAACGGTAAA CCCTAATTCA TATTATAGAG TTTACCGTTT TATTTTTTAA CTTGCATCAT 300 

AG T TAT AT T A ACATTATTGT TGGTAGTTTG GATCAGTAAC CATTGCTTGT CCAGTATAAT 360 

CAACCGTTAC AATTGAATAT TTTCCaTTTG CATTTGGGTC TTTAAAACTA AACACATACT 4 20 

TATAGTTGCC ATTATGTTCT TCAATAGAAT AAT CAT TATA C AC T T T AT T A TTACTACCAA 4 80 

ATTTATTTGC TTCATTATTA GCCGCATTTA AAGCTGTTTG GAAATTTGGC AATTGCTGTA 54 0 

AAGCTTGATT TTTATTTCCA TTAAACGGAT AAATTTGACG TGCAACCGGC GCGGCATTTT 600 

GnCCATAATA TGGTGCAACG TAACTTGATT TTTGATTATT ATTCGCTTGG TTATTACTTG 660 

ATTGGTTATT ATTTGTTTGG TTTTGGTCAT TGTTTGTTGC ATTTGAATTA GATTGTTGCT 720 

GGTTATCGTT TGCACTATTA TCTTTATTAT CTTTGTTTAC GTCTTTACTA TCATCTTTAT 78 0 

TATCTTTCTT ATCTTTAGAT GAATCATTTG TTTTTTTATC TTGTTGTTCA GTTTTCGCTT 840 

TATCATCTTT TTCTTTATTA CCGTCTTTTT GTTGGTCACT ATCTTGACCA CATGCAGCTA 900 

AAAATAATGA TAATGCTAGT AACCCTGTAA CTAATCTTTT CATACATATC TCCTCCTATA 9 60 

ATTCGATATT CAT T G AAT AA TCTTGAAATA CATATCTACC ATGTGTATCT TTTCATGGCT 1020 

TGTTACCAAA GcGTCAACAT T AAT AT GAT C TGAATTATCA CAATATCAT AACTACAATC 1080 

TAAGGTTCTT T T T AT TAT AC CCTAATTTTT GTTCATTATT ATTTAATTTT TGTGAATTTT 1140 
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ATGtTTkCTA TAAATTTAAT TATTTTACTT TAACAATTCA TTACGCATTT AGCATTTCAA 1200 

GG TAT AC AC A ATATTTATTA CTATGATTTC ATTTTATCTG CTGCAAAAAC AATCATTATA 12 60 

ACTCTTTTTC CATAATTAAA TCTGTATCCG TTACATCACC TGTTTGAAAA TGATGTTCAC 1320 

CAACCACTTT AAATCCATGA CGTTTATAAA ATGCTTGAGC ACGAGGATTA TGCTCCCAAA 1380 

CTCCTAGCCA AATTTTATGT TTATTATGTT CTTGAGCAAT TTTTTCGGCC AATTCTATCA 14 4 0 

ATTGTGAACC TCTTCCGCCA CCTTGAAAG 1 CTTTCAAAAA ATATATGCGC TGCACTTCTA 1500 

AATAGGTCTC CCCCATTTCT TCAGTTTGAG CACTATTAAT ATTCATCTTT AT AT AAC C AA 1560 

CATTCGCACC ATCTTCTTGa TAAAAATAAT GAAATGAATC TACATGGTTA ATCTCTTGTG 1620 

TAAATTTCTC T AC AG T AT AA TTGTCTTTAA AAAATTGATC AAAATCTTTG TCAOATAGT 168 0 

AAGAACCAAA CGTGTCATAA AATGTTCTAG TTGCTAATTC AACTAATTCA CTAGCATTTT 17 4 0 

GTTCTGAAAT TTCTTTGATT ATCCCAGCCA TATAAATCCT CCAATAAACA GTGATCGAAT 1800 

CAAAATATTA CTTATGTTAT TTTTCAGCCA AAACTATTTA AAAAT AC AT T AACACAAATC 18 60 

AATTACAAAT TGTATTGATT GTGTGTAACA TCAATAAATG AT AC ATT TAT TCCAGTAAAA 192 0 

TGGCCGTATT TTCAAAAGAG AAAAAGAGAG GATGTATCGT T G T G AT AG AA AC AT T T AAAG 198 0 

CGTTTGTAAT T GAT AAA GAT GAGAGTGGTA AAGTGACACC AACTTTCAAA CAATTATCGC 2 04 0 

CTACTGATTT ACCTAAAGGA GATGTGCTGA TTAAACTACA TTACTCTGGT ATAAATTATA 2100 

AAGATGCTTT AGCGACTCAA GATCATAATG CAGTCGTAAA ATCGTATCCT AT GAT T C C AG 2160 

GAATAGATTT AGCTGGAACA ATTGTTGAAT cCGAAGCACC AGGCTTTGAa AAAGGAGAAC 2220 

AAGTAATTGT AACGAGTTAT GACCTAGGTG TCAGCCATTA TGGCGGTTTT AGTGAATATG 2280 

CGCGTGTAAA ATCAGAATGG ATTATCAAGC TTCCTGATAC TTTAACATTA GAAGAATCAA 234 0 

TGATATATGG CACAGCTGGT TATACTGCCG GTTTAGCAAT T G AAAG AC T T GAAAAAGTTG 2 4 00 

GAATGAATAT TGAAGATGGT CCTGTACTCG TTCGCGGTGC TTCAGGTGGT GTCGGTACTT 24 60 

TAGCAGTACT CATGCTTAAT GAACTTGGTT AT AAAG T TAT CGCAAGTACA GGTAAACAAG 2 52 0 

ATGTTAGCGA TCAATTACTT GAACTTGGTG CCAAAGAAGT TATCGATCGA CTTCCTGTTG 2580 

AAGATGATCA TAAAAAGCCA CTCGCATCAT CAACTTGGCA AGCTTGTGTA GACCCTGTTG 2 64 0 

GTGGCGAAGG TATTAATTAT GTTACAAAGC GTTTAAATCA TACEGGGTCA AT T AC AG T T A 2700 

TTGGTATGAC TGCCGGTAAT ACTTATACTA ATTCTGTATT CCCTCACATT TTAAGAGGTG 2 7 60 

TAAACATTTT AGGAATTGAC TCGGTATTTA CTGCTATGAA AT T AAG AC AG CGCGTTTGGC 2820 

GTCGTCTCGC AAAAG AT T T A ATGCCTGAAA ATTTACATGA GATCAAGCAA GT TAT T ACAT 2880 

TTGATGAACT T CCAGAAC AA C T T AAC AAAG TAATTAAACA TGAAAATAAA GGGCGCATTG 2 94 0 
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TTATCGATTT CGGTGTAGAT AAATAGTATT CATGAAAAAG ACATCCCGTT ATGCGAGATG 300 0 

TCTTTTTTAA TTTAGTATTT GATATACATA CCGCCTGAAT CTGGTTCGGT AGGTATAAAT 3060 

CCAAATTTTG TATATAATTT ATCCCETGGG TAGTCTGCAA TCAGAcTAAC GTATGTACTC 3120 

TCAACAGCCA CACCTTTAAT ATATTGCATA ATATGCTCCA T AAT TAG AC T GCCGTAACCT 3180 

TGACCTTGGT AACTTTTCAA AACTGCAATA TCAACAATTT G AAAAAC AG T TCCGCCATCG 32 4 0 

CCAATCACTC TACCCATACC AATTAACCGA TCTTTATCAT ACAAGGTTACTGTAAATAAG 3300 

GCATTAGGTA ATCCTTTTTC aGCTGTTCGC GCGTCTTTGG ACTCATACCT GCGTTAATCC 3360 

TTAATGCGCA ATAATCCTCG CAAGTCGGAA TATCATATGT CACTTTAACC ATTATTTACC 34 2 0 

CCACTTTTCA TCACACAATA TATCAACCTA GTATAAATGT TTATTTACAA TAGTCTTATT 34 8 0 

CGCTTCTTTA AACACTTCAT GATGACTTGA AACATAACCC TCTGCATTCG CATCTGGTTG 354 0 

GATATATGTT TTAGCAAGGT TCGCTGCATT TGCACCATCA CTAAATGCAC TTGCAATTAG 3600 

ATGTGATTTT GC AT CAT GAT AAACAATATC TCCACACGCA T AG AT AC C AG GTATACTAGT 3660 

TGTCGTATTA CCAAATCCTT TAACACGACA ATTATCATGC ATATCTAGCT TTGAAGATGT 37 20 

TtCACTCAAT AATGTATTAC AACGATCAAA CCCATGACTA ATAATGACAT CGTCAAATTT 3780 

AACTGTATGC CTATCGCCAC TTTCAACATG TTCCAAAACA ACTTCACTTA TATGCGTTTC 3840 

ATCATCATTG CCGACCAAGT ATTTAATACG TGTTTTTGGG CATAGTTTCA CATTTAAEC 3900 

TGTCACCAAC GTTTTCATCG CTTCAT GACC ACTTACATCT TCTTTTCGAT AAACAACTGT 3960 

CACGCTTTTA GCAATCTTGG CAATATCATG CGCCCAATCT AATGCTGTAT TTCCTCCACC 4 02 0 

TGATATTAAT ACATCTTTAT CTTTGAAACG TCTGTAACTT TGTACAACAT AATGTAAATT 4 080 

AG T T AAT T G A TATCTCTCTA CACCTTTAAC ATCTAATTGT TTTGGATTAA TAATACCCGC 414 0 

ACCAATTGCA ATGATAACTG CTTTCGATGT ATATATTTCT CCCGCTTCTG TTTCAACTTC 4200 

GAAATGACGT TCTGCCTTTT TCCTAATATC TACCACACGT TCATTCAAAT GAACTTCCGG 4 2 60 

TTTAAAATAT AATCCTTGCT TAATTGTATC TTTTAAAATT TCATGACAAG GTTTTGGCGC 4 32 0 

AATGCCGCCA ATATCCCAAA TAATTTTTTC AGGGTAAATT CTCATCTTAC CCCCTAATTC 4380 

AGATTGAACA TCTATCAATC TTACAGACAT ATCTCGCAAT CCAGCATAAA AGCTTGCATA 4 4 40 

C AAAC C AG AC GGACCGCCAC CAATGATTGT AACATCTTTC ATTATGTGCC TCCTATGACT 300 

CTCTATATTC ATTTCTTTCA TTAACGTGCT CAAATTGATA AT TAT TAT C A TTTAAAGCCA 4 5 60 

T TAT ACT ATT AAT ATT TAT A TTGTTAAAAT AAATCGCATA GTTAGCCATG AATTATCAAT 4620 

AACTATCCCA TC AC AAT AT A AAAACACACC TTTCAAAGTA AGAwACAACA ATGTTCCTTT 4 680 

GAAAGATGTG TATATTTTTT AGTTCTAGTT ATATTATTTT TTAAAAGACT CATCACGTGG 47 40 

TTCTTTAAGA ATTGCTTGTC TTAAAAGGAA AAATAGCAAC AATAAACCTG CAAGCATACC 4800 
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TGTGTGCCCA ATACCTGCAA AGCCTGCnAA TGCTTCTGGA GAGTATGATT TACCAGTGAC 



TTGGAAGAAT CCTTTTGTC 



4860 
4879 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

ATAATGTCTT AGaTTGATTG GGAGTTTTTT TAATTTTTTT GAAATTAAAT TAATCTGTAs 60 

yTAATAAAAA ATTTGAATAA CTGACACAyT TTTTTGATCA TAGCTAyATA CTTTGTGAAT 120 

TAATTCACAT TATAATAAGA GTGAAGATAA GAGTATTATA AATnATCTTT AAATAAATAT 180 

ATGTGAAGTA AAAAT T AC AC GTTAGCATAT CGATTATGgT CATTTCkTTT AAC AT AT T AA 24 0 

CTgGGGaACG TTAAAAGTTA ACGGkTGATA TCyAACtAAA AACAAGGTCA C AG TAG TAT G 300 

TTTTAATCTG GCGTCTATTA CAAATAAAAA TTACATCTAT AATTATTCGT TTTCTTTTTT 360 

GAAAGTAATA GCCAATTAAT ATCATA~:ATA CTGGAGTGAC TATAAGGAGG ACATTATTAT 4 20 

GAGAGCAGCA GTTGTAACGA AAG AT C AC AA AGTAAGTATT GAG G AC AAAA AGTTAAGAGC 480 

TTTAAAACCT GGTGAAGCGT TGGTACAAAC GGAATATTGT GGCGTTTGTC ATACCGATTT 54 0 

ACATGTTAAG AATGCTGATT TTGGTGATGT TACAGGCGTT ACTTTAGGTC AGAAGGTAT 600 

TGGTAAAGTC ATCGAAGTTG CGGAAGATGT AGAATCATTA AAAAT TGG AG ACCGTGTGTC 660 

TATCGCTTGG ATGTTCGAAA GCTGTGGAAG ATGTGAATAT TGTACAACAG GTCGTGAAAC 720 

ACTTTGCCGT AGTGTGAAAA ATGCTGGTTA TACAGTAGAT GGTGCAATGG C T G AAC AAG T 780 

TATTGTTACT GCAGACTATG CTGTGAAAGT ACCTGAAAAA TTAGATCCAG CAGCAGCGTC 840 

TTCTATTACA TGCGCAGGTG TGACAACTTA TAAAGCTGTA AAAGTAAGTA ATGTAAAACC 900 

TGGACAATGG TTAGGTGTTT TTGGTATAGG TGGTTTAGGT AACCTAGCTT TACAATATGC 9 60 

TAAAAACGTT ATGGGGGCTA AAATTGTTGC CTEGACATC AAT GAT GAT A AATTAGCATT 1020 

CGCGAAAGAA TTAGGTGCTG ATGCTATTAT TAATTCTAAA GATGTTGATC CAGTTGCAGA 108 0 

AG T T AT G AAA TTAACTGATA ACAAAGGATT AG A T G C AAC A GTGGTAACTT CAGTTGCTAA 114 0 

GACGCCATTT AAC CAAGCGG TTGATGTTGT AAAAGCTGGT GCAAGAGTTG TTGCCGTTG 1200 

TTTACCTGTT GATAAAATGA ACTTAGATAT CCCAAGATt A GTGCTTGATG GTATTGAAGT 12 60 

AGTAGGTTCA CTTGTTGGTA CAAGACAAGA CTTACGTGAA GCGTTTGAAT TTGCTGCTGA 1320 
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AAATAAAGTA ACACCTAAAG TTCAATTAAG AAAATTAGAA GAAATCAATG ATATTTTTGA 138 0 

AGAAATGGAA AATGGTACTA TAACTGGTAG AATGGTTATT AAATTTTAAA AATATCAACT 14 4 0 

GACTATATAG ATAAAGAAGG TAGTGCTCTG AACACTATCA TTATTAATCA AACCCCGAGG 15 00 

TTTTCCTGAA AAGATAGTGG nAAATCCCCG TGTTTTTTGG GTTTGAGGnG GTTGTnTGTA 15 60 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11014 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

GTCCTGTnGC TGCAATGAAT ACGCCTAAAA ATCCAGGAT GTAATGGATA CTTTGTGGTA 60 

GTACTAATGA TAGAAATGAT AAAAATGAAA TCACAAAGGC TACGCTCGCA AAAGCTTGAC 120 

ATGTACGCTT ATCGCCATAA TCTAACCCTG TACGTATATG TAATAAATAC TGTAATCCGA 180 

TACTTAAATA CATAATTGCC ACGCATAAGA AGAAT GGGAA GAATGTCTTT TCAAAGTCCG 24 0 

GATATAGGCT GTTAGATAGG AAGACCATGA TGAACATATT AAACATCATA AAC GAG AC G T 300 

CTTTGAATGT AACTTGACCA AATCGATTTG TAAAAAATGT TTGATGAGAC CACATTAACC 3 60 

ATAAGAACAA ACTCATGACG ATGTATTTGA AAAATAAATC AGCTGAAATG GAACCGTTTT 4 20 

GTGTTGTTAA AATCACATGT GCAATTTTTT GAATGGCATA GACGAAAATT AAATCAAAGA 4 80 

AC AAC T CAT G GAATCCTGCA CGCTTTTCAG CTAAATGTTT TGGTGTTAAT GCATTAACCA 540 

TAAAATTTTA ACTCCTTTAA GATGTGTAAT TAATTTACTA AGTATACTAT TTATTTTTTC 600 

TAGTGAATAG GGGCAGATTT GGCGATGAAG TGGAAGGAGA GGTSCTGCA AGGTAATTGC 660 

GGAATTAACA AT CATCAGCG ATTTAATATT TGACTGGAGA CGTCATGGTA ATAAAAAATT 720 

GAT GAG AAA T TGATGGTGAA ACCAGCTGTG AATAsCGaTG cAATGATrsA TAGaATTTAA 7 80 

TTAGAGTCAT TACGCGaAAT GATTAATGAT AATTTGTGGT AAATCAAAGC aTAATTTTGT 84 0 

AC T AT AG AT G AGGATGATAG AGCATATTTA AG AG G G T G AA ATGTTAAAGT GAAACCGTTT 900 

ACGTTTCCGA TTGCCCAAAC AAATTACATC ATTGTATAAT ATGATTTGTT AAATGCATAA 9 60 

CAAGAATGAA AATGTAACAT ACGTAGCAAT TGGTTTCATA AATTGGATGT TAGTGGCGTA 1020 

TTGGTTCATT AGACGTATTA GTAATZAAAT TGTATATATC ATAAGGAGAT GAAT AT G AC A 108 0 

TGACGAGAGT CGTATTAGCA GCAGCATACA GGACACCTAT TGGCGTTTTT GGAGGTGCGT 114 0 

TTAAAGACGT GCCAGCCTAT GATTTAGGTG CGACTTTAAT AGAACATATT ATTAAAGAGA 12 00 

CGGGTTTGAA TCCAAGTGAG ATTGATGAAG TTATCATCGG T AAC G T AC T A EAGCAGGAC 12 60 
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AAGGACAAAA TCCAGCACGA ATTGCTGCTA TGAAAGGTGG CTTGCCAGAm ACAGTACCTG 1320 
CATTTACGGT GaATAAAGTA TGTGGTTCTG GGTTAAAGTC GATTCAATTA GCATATCAAT 1380 
CTATTGTGAC TGGTGAAAAT GACATCGTGC TAGCTGGCGG TATGGAGAAT ATGTCTCAAT 14 4 0 

CACCAATGCT TGTCAACAAC AGTCGCTTTG GTTTTAAAAT GGGACATCAA TCAATGGTTG 1500 
ATAGCATGGT ATATGATGGT T T AAC AG AT G TATTTAATCA ATATCATATG GGTATTACTG 15 60 

CTGAAAATTT AGTAGAGCAA TATGGTATTT CAAGAGAAGA ACAAGATACA TTTGCTGTAA 1620 

AC T C AC AAC A AAAAGCAGTA CGTGCACAGC AA&TGGTGA ATTTGATAGT GAAATAGTTC 168 0 
CAGTATCGAT TCCTCAACGT AAAGGTGAAC CAATCGTAGT CACTAAGGAT GAAGGTGTAC 17 4 0 

GTGAAAATGT ATCAGTCGAA AAATTAAGTC GATTAAGACC AGCTTTCAAA AAAGACGGTA 18 00 

CAGTTACAGC AGGTAATGCA TCAGGAATCA ATGATGGTGC TGCGATGATG TTAGTCATE 1860 
CAGAAGACAA AGCTAAAGAA TTAAATATCG AACCATTGGC AGTGCTTGAT GGCTTTGGAA 1920 
GTCATGGTGT AGATCCTTCT ATTATGGGTA TTGCACCAGT TGGCGCTGTA GAAAAGGCTT 1980 
TGAAACGTAG TAAAAAAGAA TTAAGCGATA TTGATGTATT TGAATTAAAT GAAGCATTTG 2040 

CAGCACAATC ATTAGCTGTT GATCgTGAAT TAAAATTACC TCCTGAAAAG GTGAATGTTA 2100 
AAGGTGGCGC TATTGCATTA GGACATCCTA TTGGTGCATC TGGTGCTAGA GTATTAGTGA 2160 
CATTATTGCA TCAACTGAAT GATGAAGTTG AAACTGGTTT AAC AT CAT TG TGTATTGGTG 2 22 0 

GCGGTCnAAC TATCGCTGCA GTTGTATCAA AGTATAAATAATAAGAAAAC AGGTTATCAC 2280 
AACAGTATTA ATtACATGTT GGCATAACCT GTTTTTATTT GTTTATGGAT TTATTGGGTA 234 0 

ATATTAGTCA TTTGATGGTT TAATTGCAAA TGCTCTAACA GGGAACCCAG GTGCATCTTT 24 00 

TGGTTTAGGG CTGATAGCGT AAATGATGGC GCCACGAGTT GGTAATTGAT CTAAATTAGT 230 
TAATAACTCG ACTTGGTATT TATCCTGACC AAGAATATAA CGTTCGCCAA CTAAATCACC 2520 
ATTTTTTACA ACGTCCACAG ATGCATCGGT ATCGAATGTT TCATGACCAA CAGCTTCAAC 258 0 

ACGACGTTCT TCAATTAAGT ACTTCAAAGC ATCTAATCCC CAACCCGGTG CATGTTGTTG 2 64 0 

TCCGTTCGCA TCTTTGTTTT Ci&ACTTTTC AATATTAGGC CAACGTTTTG ACCAATCGGT 2 7 00 

ACGAAGTGCA ACAAAAGTGC CAGGTTCAAT AGTACCATGC TCTTTTTCCC ATGCTTCTAT 27 60 

ATGCGCACGT GTTACGATGA AATCATTGTT GTTCGCTACT TCTGTTGAAA AGTCTAATAC 2820 

AATTAACGGC AATACCAATT CTTTTAAATC AATGTCTTCT AAATAAOTT TATTCTCGAC 2880 

AAAGTGAATT GGTGCATCAA TGTGAGTACC ATATTGCGTT ACAATATTCC AACGTTGCAC 2 94 0 

ATAGAAACCA TGATCTTTAA CCGTGAATAA AGTTGAAACT TCGCCTTTTT CAAACTCACT 3000 

AAAACGTGGT ATTTCCGGAT CAAATGTATG CGTTAAATCA ACCCAAGTTG CTTGTTTTAA 30 60 
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AGTATTTAAT TGTTGCCATA AAGGATATTG TGTCATAAAA TCACCCGTTT TTAGTTTATT 3120 

ATATGATAAA TGCTGCGATT ATTCTTGGCG TTTAGCTTTA ACAGCATTCA CAAGCACAGT 3180 

CAATGCATCT TTAACTTCTT CTTCTTTTCG CGTTTTTAAA CCACAGTCAG GGTTTACCCA 324 0 

GAATAATGAG CGGTCGATTT G T T G TAG T G A AC GAT T GAT T GCTGTAGTAA TTTCTTCTTT 3300 

TGTTGGAATA CGTGGACTAT GAAT AT C AT A TACACCTAGA CCAATACCTA AATCATAATT 3360 

AATATCTTCA AAGTCTTTAA TTAAATCACC ATGGCTACGA GATGTTTCAA TTGAAATAAC 34 2 0 

ATCAGCATCT AAGTCATGAA TAGCATGAAT GATTTGACCG AATTGAGAAT AACASTATG 34 80 

TGTATGGATT TGAGTTTCAT CACGAACTGA AGACGTTGCA AGTTTAAATG ATAAAACAGC 3540 

ATCTTTAAGA TATTGTTCGT GATATTCAGA GCGTAATGGT AAGCCTTCAC GTAATGCAGG 3600 

TTCGTCAACT TGGATAACTT TGATTCCTGC AGCTTCAAGT GCTAATACTT CTTCGTTGAT 3 6 60 

TGCTAAAGCA ATTTGATCTT GAACGACTTT ACGTGGTAAA TCAACACGTT C AAAT GAC C A 3720 

GTTTAGAATT GTTACAGGTC CAGTTAACAT ACCTTTAACT GGTTTATCTG TTAAGCTTTG 3780 

TGCATAAACT GTTTCATCAA CAGTTAAAGG CGCTGTCCAT TTTACATCAC CATAAATGAT 38 4 0 

TGGTGGTTTT ACGGCACGTG AACCATATGA TTGCACCAA CCGAATTTAG TTACTAAGAA 3900 

ACCTTGTAAT TTTTCTCCGA AGAATTCAAC CATGTCATTA CGTTCAAATT CACCGTGAAC 3 9 60 

TAATACATCT AAGCCAATGT CTTCTTGAAT TTTAATCCAT CGAGCAATTT CATTTTTTAA 4 020 

GAATGTTTCA TATGCTTCGT CTGTAATGCG TTTGTTCTTC CAATCTGCAC GGTATTTTCG 4 08 0 

AACTTCTCGG CTtTGTGGGA ATGATCCAAT AGTTGTTGTT GGTAAATCCG GTAAGTTCAA 414 0 

ACGTTTTTGT TGTTGTTCAA TACGTTGCGC GAATGGTGAT TGTCTTGAAG TACGCACGCT 4200 

TTCGAAATCA TAATCTAAGT TTTTGAATGA TTGATTTTGG AAACGCTCAT AACGTGCTTT 4 2 60 

TAATTTATCA TATTTAACAC TATCGTTTTG ATTAAATAGG CGACGCAATG CATCTAATTC 4 32 0 

GTCTAATTTT TCAGTTGCAA AGCTTAAGCC TTCGCCAACA CTTGTATCTA ATGTTTCATC 4 38 0 

ATCTAAAGAT AC T GGAACAT GTAATAATGA AGATGATGGT TGAATGACAA GTTCATTAGT 444 0 

GTGTGCTAAC AATTTATCGA TTAAGACTTT TTTAGCTTCA ATGCACTTG CCCATACATT 4 500 

ACGACCATCA ATAATTCCAG CGTATAATGT TTTTGATTTA TCAAAATCTC CAGCTTCAAT 4 5 60 

TTGTTTAAGG TTATAGCCAT TAT CAT GG AC AAAGTCTAAA CCTATACCAC CAACAGGTAA 4 620 

AGAACTTAAG AATTTAAGAT GTGCACGTTC AAAGTATGTT T GAAT GAC T A ATTTTTTAGC 4 680 

AACACCAGCT TTTTCGAAAT AGTCATAAGC TTCACGTGTA ATATTTTCAT AGCTTTCGCT 4740 

GTCGTCTGTA ACTAAGATTG GCTCATCAAC TTGAATGTAC TCAGCACCTG CATCAATTAA 4 800 

TGATTCAAAC ACTTCTTTAT AAAGTGGTAA TAACGTTTTA ACTTTTTCTT CAAAAGTTTG 4 8 60 

GTGACCGCCT TTTGATAATT TAACAiAAGT AATCGGACCA AC AAT GAC AG GGTGAGCGTT 4 920 
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AACGTTTAAA GATTGGGCAT ATTTAAAGCG ATCTAATAAT ACATTGCGAC TCACTTTAGG 4 980 

CTCAACATTG TCCCATTCAG GTACGATGTA ATGATAGTTA GTGTTAAACC ATTTTATAAG 504 0 

TGCACTTGCA ACATGGTCTT TATTACCGCG AGCAATATCA AATAATAAAT fiTCATCAAT 5100 

AGTTCTTCCT TGGAAACGTT CAGGGATGAT GTTGAATAAT AATGACGTAT CTAATATATG 5160 

GTCATATAAA GAGAAATCAC CAACTGGGAT GCTATCTAAG T GAT AG TACT TTTGtAATAA 5220 

TAAATTTyCT TTATGTAGAT CAGTTAATGT TTGATCTAAT TCTTCTTTAG AAATCTTCTT 5280 

TGCCCAATAA CTTTCGATGG CTTTTTTCCA TTCTCTTTTT CTACCTAATC TTGGGAATCC 534 0 

TAAGTTTGAT GTTTTAATTG TTGTCATAAT ATTGCCTCCT TGTGAGCAGT AATAGATTTT 54 00 

GAGTATGCTG CAAGTTCTAA TGAATCTTCG ACATTTTGAA ACGGTGTGAT AATGTATAAA 54 60 

CCATTAAAAT ATTCATGAAC AGTATCGATT AAffCCTTTG AAAGCTTAAG ACTTAGTTCT 5520 

CGTGTTTTGG CTTTATCATC TTTAACTGCT TCAAATTGTT GTAAAATTTC ATCTGACATC 5580 

TTGATTCCTG GCACTTCATT ATGCAAAAAG AGTGCGTTTT TGTAACTTGC GATAGGCATA 5 64 0 

ATGCCTATGA AAAATGGTTT GTTCAAGTGC TTAGTGGCAT GGTAAATTTC AAT GAT TUT 5700 

TCTTTGCTGT ACACGGGTTG TGTTATAAAA TAAGACATTC CGCTTTCTAT CTTTTTCTCT 57 60 

AATCTTTTGA CGGCACCATA TAATTTACGA ACATTAGGGT TAAAGGCGCC AgcGATGTTG 58 2 0 

AAGTGTGTAC GTTTCTTCAG CGCATCACCG TCAGTGTTAA TACCTTGATT AAATCTTAGA 5880 

GCGAGTTCAG TTAATCCTTT AGAATTAACA TCATAGACAT TGGTTGCACC TGGTAAGTGA 5 94 0 

CCAACTTTTG AAGGATCACC AGTTATGGCT AATATTTCGT TAACGCCAAT GAGCGATAAT 60 0 0 

CCAAGTAAAT GGGACTGCAA GCCGATTAAG TTTCGGTCTC GACATGTAAT ATGTACGAGT 60 60 

GGTTCAATAT TGTAATATTG CTTAATTAAG CTAGCAGCAGCAATATTGCT AATTCTGACA 6120 

GTTGCCAATG AATTATCTGC GAGTGTTACC GCATCTACAT TAGCTTTATC AAGTTTAGCG 6180 

ATATTTTCAA AAAATCTATC CGTGTCTAAA TGTTTCGGTG TATCCAATTC GATAATAACG 62 4 0 

GTTGGACGTT CTTGAACCTT AGATGTTAAT GATTGTCTAA CTTTATTTTG AGATGGATTG 630 

AAAAGTGCTT TCGTTGGTAT CGGAATCACT TTTTTGTCAT TAACAGGTTT AAGTGTCTGA 63 60 

ATAGATTCTT TAATAAATTT GATGTGCTCT GGCGTTGTAC CACAGCAACC ACC AAT T AAA 64 20 

CGAACACCTT CGCGAATTAG ATTTTGAGCA ACTTGACCGA AATATTGTGC ATTGTCACTA 64 80 

TACTTAAATT CACTATTTTC AAT AT C T AAT AAGCTGGCAT TTGGATAACA AGATAAGAAT 6540 

GCGTGCTCTG GTAATTCAAT ATGTGTGAAA GACTCTTGCA TATGGTGCGG GCCATGATGA 6600 

CAATTGAGTC CCACGATGTT TGCACCACAT TGAACGAGTT GTTTTAATCC TTCATTGATT 66 60 

GCCTGACCAT TAACTAAGTA ATTTGTGTTT GAAGCGGTTA ATTGAGCAT GATTGGAATG 672 0 
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TCGTATTTCT TTCTCGTTCG TGAAATGACA TTTGTTAACT CTTCTAGGTC GTAATACGTT 6780 

TCGAAAAGTA GCGCGTCAAC GCCTTCTTCA ATTAAGGTGT CTATTTGAAT TTCAGTATGA 68 4 0 

TAAAGAATAG TTTGTAAGCT GATATCCTCT TGTTTGATAC CTCTAAACCC ACCAACTGTG 6900 

CCTAATATAT ACGTATCTTT ATTTGCTGCT TTTTTTGCGA TGCGAACGGC GGCTTGATGT 69 60 

ATTGCTTTAA CTTTATCTTC AAGACCGAAT CGTTTTAACT TTTCAAAATT TGCACCATAA 7020 

GTATTGGTTT GAATGACATC AGCACCGGCT TCAATATATG AACGATGGAT GCGTTCAACT 7080 

TTATCTGGAT GGCTAAGATT ATATGCTTCT GGACAGGTGT CTAATCCTTC AGAGTATAAA 7140 

ATGGTTCCTA TAGCGCCATC AGCTACTAAA ACATTATCTT TCAATTGTGT GAGGAATTGA 7 2 00 

CTCATTGAAT GCCTCCTTTA ATGCGTATTT GATGTCTGCA ATGAGTTCAT CAGGATCTTC 72 60 

GAGACCAACA CTTAATCGGA ATAGACCGAA AGTGATACCA CGTTCTTGTC TCACTCTTC 7320 

AGGTAGTGCA GCGTGAGACA TTGTTGCTGG ATGTGAAAGG ATCGTTTCAA CACCGCCCAG 7380 

AC T C AC T G AA ACGAGTGGTA ATGTCAGTGC ATCGACAAAT TGTTGTGCTT TAGACTCATC 74 40 

AGCTAAACGA AAGCCAATAA CGGCACCGCC ATTTTTAGCT TGTTCTAAAT GAGCAGTAGT 7 500 
GAGTCCCGGA TAATAAACTT CTGAAATTTC ATCTTGCTTT ATTAAAAATG ACACGATTTT . 7560 

TTGAGCGTTT TCGACAGATT GTTTAAATCT GATTGGAAAA GTTTTTAAAT GTTTAGCAAG 7 620 

TGTCCAGCTA TCCTGAGCAG ATAACATATT GCCTGTACCA TTTTGTATTA AATAAAGAGC 7 680 

GTCACTAATT GCCTCATTAT TAGTTATGAC AGCACCJSCA ATTAAATCGC TATGTCCACT 77 4 0 

TAAAAATTTT GTAGCACTAT GAATGACAAT ATCAGCGCCA AGTAATAAAG GTGATTGACc 7 800 

TAACGGTGTC ATAAATGTAT TGTCCACAGC TACCAGTAGT TCATGCTTTT CGGCTATTTT 7 8 60 

AGAAACAGCT TTGATATCAG TAATTTTAAA ACAGGGATTC GATGGTGTTT CGATATAAAT 7920 

TAATTTTGTG TTTGATTGAA TGGCACCCTC GATTTGTTCG AGCTTTGTAG TATCTACGGT 7 98 0 

TGTAAATTCA ATATTAAATC GAT TCAAAAT TTGCTCAGTG AGGCGAAAAG TACCGCCATA 804 0 

TACATCATCG GGTAAGATGA CATGATCACC AG AT T T G AAA GTCAAAAGTA CTGCTGAAAT 8100 

AGCAGCAATA CCTGATGC2R AAGCAAAAGC GAATTTTCCC TGTTCTAATC GTGCTAACTT 8160 

CTCTTCTAAA AGTTCACGGT TAGGGTTGCC cTTCGTGCAT AATCATATTT AACATCGCCA 8220 

CCAAGACTTG TTTGATGGAA TGTTGAAGAA TCATAGAGTG GTGGGTTAGC TGAATGATAT 82 8 0 

TCCACACCTC TACGCCAATC GAATATCACT TCTGTCTCTT TTGAAGTGT CATACAATCT 834 0 

CTCCAATCTG AGCTTTATCT AATGCTTGGA TGATATCGCG TTCGATGTCT TCATAATTTT 8 4 00 

CAACACCTAG TGATAAGCGG ATTAAATACT CATCAATGCC ACGTTTATCT TTTTCAGCAT 8 4 60 

CTGGCATATC AACATGTGTT TGGGTGTAAG GGAAGGTCAC TAATGTTTCA GTACCTCCTA 8520 

AACTTTCTGC AAAAATGCAA ATGTCTAAAT TTTCTAATAA TTTAGCGACG CTATAGGCCT 8580 
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TGTTAAGTCT TAAACTAAGC ATGCCAGTTT GCCCGCTATA TAGTACTTCG TCAATTGCTT 8 64 0 

GAAGTGACTG ACATTTTTTA GCAAGTTTTC TAGCGTTTGA TTGCGCACGC TCAATGCGTA 87 00 

AATGCAAAGT TTTAAGTCCA CGTAACAACA AATAACTATC TATTGGTGAA AGTGTTGCGC 87 60 

CAGTCATGTT GTGAAAATCA AACAACTGTT GCGCGAGTGA TTCATCTTTG ACGGTTACGA 8 8 20 

CACCTGCTAG TACATCGTTA TGTCCGCCAA TATATTTCGT GGCTGAATGT AAGACTATAT 8 8 80 

CAGCACCTTC TGCTAGTGGT GTTGAAAGAT AAGGTGTTAA AAAAGTATTG (TGATAATTG 8 94 0 

ACAATAAGCC TTTAGCTTTA CAAAGTTGAT AGTATGGCTT TACATCAATA GCAATCATTT 9000 

GTGGGTTAGA TATTGGTTCA ATGAATAATG CAACTGTTTT ATCAGTGATT TCTTTTTCAA 90 60 

CTTGTTCATA ATCTGTAAAA TCAACGTACT TAAATTTGAT ATCGTATTGT TGCTCGTAAA 9120 

ATTCAAATAA TCTAAATGTG CCACCATATA AATCGAATGA AACTAAAATT T CAT CAT GAG 9180 

GTTTAAATAG ATTACATATT AATTGAATGG CTGACATTCC ACTTGATGTA GCGAATGATG 92 4 0 

CAATACCATG CTCAAGTTTG GCAAAACAGG TTTCAAATGT TGAGCGTGTA GGATTTTTAG 9300 

TACGTGTATA ATCAAAACCT GTCGATTGTC CTiSTTTTGG ATGCTTGTAG GCAGTAGATA 9360 

AATGGATTGG ATTCGCTATA GCACCGGTTG AATCATCGGT TAATGTGATT TGGGCTAACT 94 20 

GTGTATCCTT CATATTAAGA CCCTCCTATA AGAAAAAATA AAAAAAGCTT CCGTCCTTCG 94 80 

TACCCGAATG AATCGGATAA AAAGGACGAA AGCTTATGTT TCGCGGTACC ACCTTTATT 954 0 

GTTATTCCAT CGCTGAAATA ACCTTATTCA GTACGCATTA AAAGTAAATA TGCTTACTGA 9600 

AC AAT TAT C A CAATTAAAGT CAGTAAGTAA GGATATAGTA ATGTGCTATC CCATACTTAT 9 6 60 

TAACAAAAAA TCGTGCGTAA AGAATCCAGT ACGCCATTTA ACATCAATGT TAATACTGTA 97 20 

TCGCTATAAC GGGCGAACCC GTAGACACCT CATATTGGCA TCAACACTCC AAGGCCATTT 97 80 

TCAAACACGC TTTCAAAATC TTCTCTCAGC T AC T AAAG AC TCTCTGTATA AGCAGGGTGT 98 4 0 

GTTTTACTTy CCTCTTTATT GTGTTTACGT TTCATTAAAC TGTTATAAGA TATTAATTAG 9900 

CTTACAGAGT AAAAAAAGAT TTGTCAACAA T T AT T C AGAA AAT T T TG AT T TAAAAGTTAA 9960 

TTTGTTTGTG AAATTGTAAT TGGTATCTTG AAGTTGAAAA ATGAATTATT TTTTAAATAA 10020 

AGTGTGGTTA ATGGTTGTCT GACTCATTTA GAATACATAA AAT AT AT T T A ACTGTTGTTA 10080 

TCAAATAAAA AGTGATGTGA GTGAATTGTC AAAAAGTGAA GATCAACGTA TTACTAAAAC 10410 

AAAAGATGAA CAAATTAAGC AAATAGATAT ATCGGATATC AAACCGAATC CGTATCAGCC 10200 

CCGAAAAACT TTCGATGAAA ATCATTTAAA TGATTTGGCA GATTCAATTA AGCAATATGG 102 60 

AATTTTGCAA CCAATTGTGC T T AG AAAAAC AGTTCAAGGT TAT T AC AT T G TAGTTGGTGA 10320 

AAGAAGGTTT AGAGCTTCGA AAVTTGCTGG TCTAAAATAC GTATCAGCGA T TAT C AAAG A 1038 0 
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TTTAACAGAT GAAGATATGA TGGAACTGGC GGTCATCGAA AATTTACAAC GAGAAGACTT 10440 

AAATGCGATT GAAGAAGCTG AAAGTTATCA ACGTTTGATG ACAGATTTGA AAATTACACA ' 10500 

AC AAGAAG T A GCGAAACGAT TGAGTAAGTC GCGCCCGTAT AT AG CG ASA TGTTGAGGTT 10 5 60 

ATTACATTTG CCGAAAAAGA TTGCTGACAT GGTAAAAGAT GGGCGACTGA CAAGTGCACA 10 620 

TGGACGAACG TTATTGGCAA TTAAAGATGA ACAACAAATG CTTAGGTTAG CGAAACGGGT 10680 

TGTTAAAGAA AAGTGGAGTG TCAGATATTT AGAAAACCAT GTTAATGAAT TAAAAAATGT 10740 

TTCGTCAAAG TCGGAAACAG AC AAAG TAG A TATAACTAAG CCTAAATTTA TAAAGCAGCA 10800 

AG AAC G AC AG TTGCGAGAAC AGTATGGTAC CAAAGTAGAT ATATCAATAA AAAAATCGGT 108 60 

TGGTAAAATC TCATTTGAGT TTGATTCACA AGAAGATTTT GTGAGAATAA TTGAACAATT 10 920 

AAATCGTAGG TATGGTAAAT AGTTACACAA TTTTATATAA TAACTCTTTG TGCAAGTGTA 10980 

AATAAATTGT AATCAGTGAC ATTTGATTCT AGAT 11014 
(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

TCCCCTTATG GAATTTCACA TTCTAGTTTA CATAATATAT AT T AT AGGAA GTTATATGTG 60 

TGTAACGCAA AAgGTACCCT ACATCATAAT CATTATCTAA TATCGTCACA TACTTACTT 120 

ATGCTATAAT CATGGTATTA TATTGTTTGG AGTGATTTGA TGAGATTTGT CTTTGATATT 180 

GATGGTACGC TTTGTTTTGA CGGCCGATTA ATTGACCAGA CTATTATTGA TACATTGTTA 24 0 

CAATTACAAC ATGATGGTCA TGAACTTATA TTTGCATCAG CACGTCCGAT TCGTGATTTG 300 

TTGCCAGTTT TACCATCAGT ATTTCATCAG CACACATTAA TTGGCGCAAA TGGTGCTATG 360 

ATTTCACAGC AATCAAAGAT TTCTGTTATC AAACCAATTC ATACTGATAC ATATCATCAT 4 20 

ATCTTCAAAA TAATTCAAAA GTATGAGTTA GATTATATTA T T GAT GAT G A TTGGAATTAT 4 80 

GCTGCACAAC TTGACGCTGn AGAACGCGAT TTTTJAGCGT TTAGATCCAC ATAAGCTGGC 54 0 

CAGTTGTATT GATGTTGCAA ATATCGACAC GCCAATCAAG AkTATTTTAT TAAATATAGA 600 

CCCGGCACAA ATTACAACTA TAT TAG AC GA GCTAGATAAA TACCATCAAG AATTGGAAAT 6 60 

GATTCACCAT TCAAATGAGT ATAACATTGA TATAACAGCG CAAAATATTA ACAAATATAC 72 0 

TGCATTACAA TATATATTTG AT G C AG AT G T TAAATATATA GCATTTGGTA ATGACCACAA 7 80 

TGATATTGTC ATGTTACAAC ATGCTAGTAG TGGCTATATT ATAGGACCAT CAGAAGCATA 84 0 
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CACACACGCA ATATTGAAAC TTGATAAAAT CAAACACATC AATAATAATG CACAAGCTAT 9 00 

TTGCAAAGTC TTAAAATCAT ATAAATAAAA ACACCCCTAT CAAATGATAA TCATTATCAA 960 

TCGATAGGGG CTATTTTAAT AAAATTCGTC CTCGAACATT TCTTCCTCTT CATCTAATCC 1020 

AAATAATTCT GCCATTTCTC CATGTTCAAT TAACATGTTT AAATATGCAT CGCGGAGTTC 10 80 

TTCTTCACTC ATATCATTAA TCATTTCTTT AAGACTATCA SCCACATAT TTCTGCGTAA 114 0 

TTGATAGTCT TCTTCAACTT CGTTTAACAT CATTATATGT TTATTTGCTG CTTCTGGACT 12 00 

AGCTGTAAAG AGTAATGCAA TCATATGTTT ACATATCACT CGTCTTCCAT CAGCATGAGG 12 60 

ACAATTACAT ATGGATTTTC TAGGATGTTC CATATCAATA TAACAACGAT ATACTTTGTT 132 

GCCACTGCCC TTTACTTCAG CCTCATGCTG CGTTTCTGAA AATGATTTTA AGTTAATGAC 1380 

GCATTCACTT TGATAATAAT TAAAGCCTCT TTCTATAGAA CGAATACTTG CAATATCAAG 14 4 0 

TAATCCCATT AATGaTACTC CTTTTTATTA TTATTTTTAA ATAAAGAaAA TAAAATAGAT 1500 

AAGTGTCTAG AT T AAAAT AC TTGVTTTATC TATATTTTAT AACAAGTCTA GAATTATCGC 1560 

ATTCTTAAAT AACTAATATG AAAATGcTTG CACTAATTCt TTTGTATAAG GGTGTCTATC 1620 

AACATTAAAT AATTCCtCTA TTGCAAAATC ATCGACTATC ATGCCATCCT TAAGAACGAT 1680 

AATTCTATTA ACTAAGCGTT GTAACACGGA TAAATCATGA GAAATAACfi T AAAAT G ATT 174 0 

TAAGTTCGTA ATCGTTTGCG CTTTTAATAT ATTGATTACA TTTTGTTCAG C T AT AAC AT C 1800 

TAAATTTGAA GTTATCTCAT CACATATTAA AACGCGAGGC TGTGCTAATA ACGAACGCAT 18 60 

G AC AT T AAAT CTTTGTAATT GTCCGCCACT CACTTCGCTT GGTAATTTAG TCAATAATTG 1920 

CGCGTTTAAC TCAAAAGTAG ATAAATGTTG TAATAATAAT TGATCCTGAG CAGTATTATC 198 0 

AGTTAGACCT CTGTAATAAT ATAACGCTTC TTTTAATGAG GTCTCAATCG TCCAATCAGG 2040 

GTTAAAGCTA GTTAAAGGGT GTTGGAAAAT CGGTAACACA GCATTGTCAC TTAAGTAAAT 2100 

CTCTCCTTTA ACAGGTTTAA ACAAGCCAAG AACCAATGAA GCGAGCGTAC TTTTACCACA 2160 

GCCACTTTCG CCT AAAAT AC C AAC AT T T T C TCCATCAGGT ATAGTAATAT TGATATCTTG 2220 

TAGCACCATC TGCTTTTTAT TATAACCAAA AGTCACATGT TTTAACTCAA TCATAGTAAT 22 8 0 

CCCTCTTTAA TTGTGTTCTA TATTTAATTA GACGTTCAGT ATACGGATGC AAATGDCAT 2340 

ACTTGAAATG ATTAATATTA CCTCGTTCAA TGATTTGACC TTCTTTTAAA ACATAAATGT 2 4 00 

ACTGACAATA TTTCAATACA TGACTTAAGT TATGTGTGAT AATAAATAAT GTTTGACCAT 24 60 

GTTCTAATAC AATATGCTGT AATAAATCCA TCACTTGATT ACCGTTCAAA GCATCCAATG 2520 

ATGCAACTGG TTCGTCTGCA ATGATTAATT TAGGCTCCAA CAT GAG AAC G CTTGCTATGT 2580 

ATACGCGTTC AAGTTGGCCC CCAGAAAGTT GGAAAC TATA TTTATTTAAT ATATCTTTGC 2 64 0 
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TTTGTAAATT AACCCACGAC AAAGCCTTAT CAACTTTGGA CAAAGCCTCT TCTTTACTAC 27 00 

CTTTATAATG CTTACGATAA ATCGCAGTTA ACTGTTT&C TAATTTAGTA TGGTCGTTAA 2 7 60 

AACTTTCTGC ATAATTTTGA GAAATATAGC CAATTGTATG ACCATAATAT TGACTCAATC 2820 

TACTAACATT TTCCCCATCA AATTGGTACG AATCATACGT GCAGCTTAAA TCAAATGGTA 28 8 0 

AATATTCAAG TAAAGCTTTA GCAATCAAAC TTTTTCCAGC GCCGCTCTCT CCAAT CAAGG 2 94 0 

CATTAATCTG TTGACTAAAA ATTTTCAAAT CAATCCCTTT AATAAGAGAT TTCTCACTAG 3000 

TATTCTTTAT TGTTAAATTT TGTATATCAA TGAGACTCAT CATATTCACC CCGTTGTTTC 30 60 

AGCAATCTAT CTCTTAGTGC ATCACCGGTT AAATTAAAAA TTAAAATAGT TATAGCAATG 3120 

ACTGAAGCAG GTGCAATCAA CATAATTGGA TGAGACGAAA TAAAATCACG ACCTTGTTGC 3180 

AACATAGCGC CCCaCTCTGG TGTTGGCGGT TGTGCACCTA ACCCAATAAA T GAT AG T G AA 32 4 0 

CTTATATATA GAAT GATTTT ACCGAAATCA AC G AC CAT C A AAACGATAAT AGCCGGTATA 33 00 

ATTTTAGGTG TTAAATGACG TATTAATATT GTTCTTGTTG GTACffiGAAA TAATTGTGCC 3360 

ATTTTTATAT AAGGCTTATT CATTTCGCTA TTAACTATAC TTCTAGTCAA CCTTGTGTAA 34 20 

TTCATCCATT TTATTAATGT AATTGAGATA ACTAAATTCC AT AAAG AT G G TTGAAAAAAA 34 80 

CTTGcTAAAG CAATCATGAT GATAAATTCT GGAATACTTA GACCAACATC AATAAACCTT 354 0 

AACACTAATC GTTCAATCCA CCCTTTTTTG TATCCGGCAA ATAGACCTAG TGTAACACCT 3600 

ATGACAACGA TAGCTATTAA TGTTAAAACA GTAACAAACA ATGTTGAACG TGCACCGATA 3 6 60 

ATAATTCGGG TAAATAAATC TCTCCCATAA TCATCAGTTC CTAATAAATG CAACCAACTA 37 2 0 

ATAGGTTCAA AAGTTTGTGA TAAATTGACT TTGGTTGCAT TTTCACTACT GACAAAGAAT 37 80 

TGCAGTACAA T T AC CAC AAA AATAAATGCA ACGAATACAA AAAAT AT C AG GTTATTCTTT 38 4 0 

GAAAATATTT TAT G CAT G AC GGTCACTACT TTCTGATATC AATGGTGTAT TGGTTTTGAT 3900 

TTTTGGATTT CCTAATTGTA AACGCTGCTT CGGATCAAGT AATAACGTTA ASAATCAGC 3960 

AATCGTATTG ATAATAACAA CGAAGAAGCC AATAAATAAC ACGCATCCTT GAATAACAGG 4020 

ATAATCTCGA GATTTAATAC TATCCATTAA TAGATAACCA ATACCAGGTA TATCAAATAA 4 080 

ATTTTCAATC ACTACAGTAC CACCTATTAG ACTGCCAAGT GAAATCCCTA GTAATGGGAT 4140 

AATCGGCAAA ATTGTTGGTT TTAGTAAATC AT GAAT T AAA ATATAACGTT CATTCATACC 4 200 

GCGTAATCTT GATGCTTGTA C GAT AT TACT TTGCAATAAC ATCAATAAAT TAGAACGCAC 4 2 60 

TAAACGAATG ATGTATGCAC AC AT AC C T AA AGATAGCGTG ATTACAGGTA ATATAAACTG 4 320 

ACTTAGTATA ACGCTATCTA TATTCATTAA ATT"ETGACA ATAAATAATA AAATAATACC 4 380 

GATAAAGAAC GCTGGTAAAC TAATCGATAG TGTTGAGATC ACTCTAATCA CTTTATCCGT 4 4 40 

CCACTTATGA AATCGTTTGG CTGCTATAAT GCCGAGCGGT ATAGATATGC ATAACGACAC 4 5 00 
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TACTAATGTT GAAAATGATA TGAGTAATGT TATGGGTGCA TAGTTGAATA ATATCTGTGT 4 5 60 

TACCGGTTCT TTTGATTCAA AACTTTTTCC TAAATTAAAA TGTAATAAAT GATTCATCCA 4 620 

ATGCCACCAC TGTACCAATA AAGAATCATT TAATCCCAAT TTATCTTTGG TTGCATTTAT 4 68 0 

TTGTTCCGTC GACACTTGTG CTACATCAAG ATGTAATATT TTATCAACAG GATTGCCTGG 47 4 0 

TGATAATTTC ATTAAAATGA ATGTAAGTGT AGAAATAACA AATAAAACAA CTATCATTTG 4 8 00 

CATCAGTCTA TACAACATAG ACTTTATTAT GAACATAATA GTCCCCCTCC TTGTGTAAGT 4 8 60 

TACTAACACT TTCTTTTTAC ATGAGAATGG CGCATGTATA TGCAACTTAC ATATTAAGAA 4 92 0 

CTAACGTTCA TTATAGTATT ATCCATAAAG AAATTGAAGT ffiATTTAATT TTTTAACAAA 4 98 0 

ATCATTATAA AATATAATAT TTTGAATCAA GTCAACCATG TAAAATATAA AAAAGTCAAA 504 0 

ACAAAAACAA CTATAGCACT GTATTCCATC TCTTTCGAAA TAATTGTTAC TGCAGTGTAA "5100 

CTTAAAAGTC GATGATTTTG TGCATATAGT TGTCGAATAT TATTTTTTAT CTTTACGGCG 5103 

AAGTTCAGCG CCCTCATAGC CGTATTTTTC AATTTGCTTT TCTAATTTAC GCGCTTTTCT 522 0 

TTCTTTACGC CAATTTCTAG T AAAAT AC C A TAATAGAAAA CTAATTAATA AACTCATAAT 528 0 

CGCTAAAAAT GCAGCGTATC CTAATAATGG TTGATATTTT ATATCTTGAA AATTTGGAAT 534 0 

AAAAAATGCA AGCACACCTA ATATAACAAA TGTAATTACT GCAGATACAA ACCATTTATT 54 0 0 

TAAAACTAAG CAACAGAATA TTGTTAATAA AATCATTATT AATGTTGTGA TCCATAAATA 54 60 

ATTAGGCATA TCGAATAATG TCATATTCAT TCTCCTTTTA TTTCATTACT TTCCTTGTAT 5 52 0 

ACATTTTATT ATAAATTTTT AAAAACTTAA ACAATAGCAG TCAGTTTCA GCAATATTCT 55 8 0 

ATCTACTAAT AG AAAAAT C A TTGTTCCTTG CGACATGGAA ATCGTAACAT TATCGTTTAG 5 64 0 

GAGACAAAAT TATGTATAAT GAATGTATTA TACCAAAGGA GTGATTATAT GTCTCAAGGT 5700 

TTACCTTTAA GAGAAGATGT TCCTGTTTCA GAAACATGGG ATTTAGTAGA CTTATTTAAA 5 7 60 

GATGATCAAC AATATTATGA AAGTATTGAC GCTCTAGTAC AnCAAGCAAA TCAATTTCAT 5 8 20 

CnTACATATG CAACAACATT AAATTCAATC GAACAAATTA ATACTGCTTT AGCTGAATTA 58 8 0 

G AAAAT AT T T TAATTGCCTT AGATCGCTTA AGTAATTATG CAGAACTACG TTTAAGTGTA 5 94 0 

GATACTAGTA ATATCGAGGC ACAAGTATTG AGCGCTAAAT TATCTACTAC ATACGGTAAA 6000 

ATTGTTAAGC CAATTATCCT TT 6022 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

CCATCAATAA TGTATACATG ATTGGCATCA TATTCCCCTT TAATTAGAGA GCTACGTACA 60 

GTTTGTyTTA TTAAAGTAGA ACTAATAAAT AACCATCTCT TATGTGCACA AAC&TTCCC 12 0 

GCAACAATTG ATTCAGTTTT ACCAACCCGT GGCATACCTC TAATGCCAAT CAACTTATGA 180 

CCTTCTTCTT TGAACAATTC AGCTAAAAAG TCTACTAACA AGCCTAAATC TTCACGCTCA 24 0 

AATCGAAAGG TTTTCTTATC TTTTGCATCT TGCTCAATAT ATCTTCCATG TCTTACTGCA 300 

AGACGGTCTC TTAATTCTGG TTTTTTAAGC TTTGTTATTT CAATTTCATT TATACCACGA 3 60 

GCTATTTGCT CAAAACGTTC AACTTTTTCA AGATTGTCTG TTTTAATTAA AAGGCCTCGT 420 

TTACCTTGAT CAACACCATT AATTGTAACA ATACTTATAC CTAACATACC TAATAA 47 6 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

AGAAATACAA CGAAGCATAT AAATATAACC GATJTTTTTT CTAATTGAAT ATTAAGTAAG 60 

TGTATGTACT TTCTGGAAGT AGCACCTAGT rGGATTGTtC CTCCTACAAC AGGCCAAAAA 12 0 

TTTTTATTTT TAACTGGCTT AACAGTGTTC AGTTTTTCAT ACTCTTCTCT ACTAATTTTG 180 

GCGCACCTTT TTGGAATGAA CCAATTAATA AATGGAAAAA AGTATACAAG CCAAGTTCT 24 0 

ATTACATCGA CCATTAAATA CTCATCATCA TACTTAATAA CTCTGTATTT CGGATTTTTA 300 

TTGATAATTT CGGTTTCACA AAGCAATAAT TATCACTTCC TATTAATAAC AAAT T C AC AC 360 

AGCTATATTA TCGTTTCTGT TTCTCAAAAC ACTTGTGCCC TTTTGTATCA AAATTAATAT 4 20 

TTATATGACC T T AAAT AT AT AACATGAATC TTTTTGTCTA TTATTGAAGA CAT ATT TATA 4 80 

AAGAAAAATA GCATTGTCAT AATAACCCAA GCAATAAATA CTATAATATT TTGGATAGAT 54 0 

AAACTAATCA rrACATCTAA GAACATGATT gATAATCCAC CACAGAAAAA ATAAGAAAAT 600 

AGTACAAAGC AAAGATTCTT GAATGATGGA AAAATCATAATTTTTCCATT GCTACTCCGA 6 60 

TCATTATAGA TAGATAACTT TACTTTCTGA TTTAAATATA TATAAAACAC TAGAATACTT 7 20 

AATAATAAAA CCGAACAAAT GATAATAACG CAATTTTTTT CTAAATGAGA ATCAGGTATA 7 80 

TATATTTTAT CTCTAAACAT AGTGCCAAAT AAAAGTATGC TACCTATAGC TGGCCATAAA 430 

GCTTTaTTTT . TAACTGGTTT GACAATATTT AAATTATCAA AATCTTCTCT GCTGATTTGG 900 
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AC AT AT T T T T TTGGTATTAA CCAATTAATA AACGGAAAGA ACAAAACTAA CCAGGTGCTT 960 

ACTAAATCAA T CATC AG AT A GTCGTTTTTA TATTTAATAA TTCTATATCT GGGATTTTTG 102 0 

TTTACAACTC TAACCTCGCA AA3CAATATC TCCACTTCCG TCTCGTTGGT TTTATATCTA 1080 

ATACACTTTC AGATACTTTA TAAGTGTTTT GTATTTTAGT AACATACTAT TTTCCTGTTT 114 0 

ATTACTTAAC TTACGAACTA CAATCTAAGT TTAGTAATTT CTATTGCTTT TTAAGTTTGG 1200 

CATAAACCTT TTTATTACTA ATTGAGCCCA TGCTTATTAG AAAGAAAAA ATTGTAATAA 12 60 

TAATCCACAT AATAAATACC AG TAG AT T T T GAGGTTTTAT AGTCATTAGC CAT AT T AAAA 1320 

ATAATATAGA ACAACCTCCT AATAATAGAT ATGTGAAAAC T AT AAAACT T CCATCTTTAA 1380 

AAGTAGGCAC TAATATAACC CTATTTTCAT TATCTAGATT ATCATCATAT ATCTTTAGTT 14 4 0 

TAAGCTTTTT ATTTAAGTAA ATGTAAAATG CTGCAATACC TATAAATCCT ATAAAACATA 1500 

AAGATATTAA AATCTTATTA TCTAATTGAA CTTCAAACGT ATGTACATAT TTCCGTAAAA 15 60 

TAACTACAAA TAAAAACGAA C T AC C AG T AA CTGGCCAGAA AATATTATTT TTATTTTGTT 1620 

TATCAACATT TAAATTTTCA AGTTCCTTCT CACTAAGTTT TGCATACCTT TTGGGAATGA 1680 

ACCAATTAAT AAAAGGAAAA AAGTATACAA GCCAAGTGCT TACTAAATCA ATTAACAAAT 17 4 0 

ACTCATCATT AT AT T GAACG ACTTTATATC TCGGATTTTT ATTAATAACC TTAATATTAA 1800 

AAAGCAAAAC TCACCACGCC CATTTCATTG GAT T TAT AT G ATTGCTAATA ATATTTTAG 18 60 

CTTCACTAAC AGCATTCCCA ACACTATCCA TGGATTTTTC TGTAGTTTTT TTAACAACAT 192 0 

CT AT ACT ATT ATCGATTTTA TGCCCTACCC AGTCTACTTT ATCTTTTAAT CCAAAAATAT 198 0 

TATTTTGATA AATTAAATCT GTTCCTAATG CAAATACTGT ACTCATAGCC AAACCTGCTA 2040 

AAATCACCCA TCCTACTGGA TTACTTCCTA AAACAAAAGT CGCTAATCCA GCTCCAACTG 2100 

CTGTCCCTGC AGATCCAGCT GCAAGCGTgC ATACCATTAT GCGACAACGC CTCTCCAAAT 2160 

GTTTTATCAT CATTTGCAAG GTCATCATAC ATACCTATAC CAAATCCTAC ACCAGCAACA 2220 

CCTTTACCTA GGTATTTTCC GCCTTTTGCA AATTTACTAC CATTTTCTAT AAAC AC AT T A 2280 

CCTGATGTAC GTTTGACTTC CACAAATGAA TTTGGACCTG CTGGGCCTTT CACTCCACCT 2340 

GCTGTATTGa TAAATACACC GAATTTACTT GcATTTATAC CGTCTTGCTC TAAAAGTGTT 2400 

GACGTAATAT CTAATCCTAT ATCTCTTTTA ATACTGTCTT TATTGTCATT TATATATTTC 24 60 

AATATACTTT TCGGGATATC GTCTTCTGGA TGTTCTTTGG CATATGCCTT TATAACAGCA 2520 

AAGTCTGCTT TATTTAAAGT TTCTTTCTCT GCTTTATGTT CAATTTTCCC CATAGCAACT 2580 

TTCAAATATT TTTCATGACT TGCTTTGGCC CAATCAAGTT CTTTACCTGA AGGAATATTA 2 64 0 

AATTGATTTG TTGAAAAGTT CCAAAAATTC TGCGCTTGGG TAAGTCCTTG TTGGACAATT 27 00 
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TTTTGAAATT CTTCAACTTC TTTAAATATT TCTGGTGATT TTTGATTAAA CTCACGCAAT 27 60 

TTGCGTAGCT TCTCTTCTAA TTCATGTTTT TGTTGACCTA ATGTTCGTAT TATTTGTTGG 2820 

TTCGATGAAA TGGCTTGCTG ATTATCGGAA GCATGCTTTT TCAATTGTT ATTCAAATTT 2 8 80 

TCATATCGCG TAATTTGTTG ACTTAATGAT CTGATATCTT CTTCAAGCTC TGATTCTTTT 2 94 0 

AAAGATATGC TATCAACCTC ACTCGTATAA CGTGACACAA AATTaTCGCA AGCTTGCTTC 3000 

GTTAAATCAC TCAATGTTTT CATACTTGTT GATAATGGAA TTAACACCGT ACTAAAAAAT 30 60 

TGCTTAGCTG ACGTATACGC TTTCCCTTTA AGCGCATCAT CAT T AAT AAA TTGAGTAATT 312 0 

GCTTTTTCCA ACGCATCATA ATTTGAATTC ATTGTTTGAC TCAAATTCCC CACACTTGAA 3180 

GCTTGGTTTC GAGATCTGTC TAAATACATG TCAATACTCA TCGGCATGCT CCTTTTTCAA 3240 

AAAT AT AT G A TTTTCAAACT ATTTAIAATC AAATGCTTTT TACATCTACA AAGTTGTAAA 3300 

ATTTTAAAAC TCGGCGATGA TTATTTCTTA TGTAAAGGAG TCTAGATGCA GGTAAATTGA 33 60 

GATAACATGT CGCCTTTTTT CTTATTTTAG CATATGGATA TAATGGTGTC TTTGTATATT 34 20 

CGCAATTAAT CAATAAAAAT TATCTTTCAA TATTTTAATT TTATTGCGAC ACATCCTTA 34 8 0 

ACATTAAATA TATTAATATC TCAAAATATA TTCAC TATTA AAAT AT G T C A TCAGTTGTTA 354 0 

AAAGTATTTC CTCATCATGC GAAATATCAA AACGTATCTA AAATACGAAT AAGTTTATAC 3600 

AAT C AC AC AA CATCATCATT CAAAATTTTA TTG 3633 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

TGATACGAAt GCATTACAAT TCATATGCAA CATACAATTC CTTCTACAGC AAATGAAGTG 60 

AAACAAATAG TTGATGTGAC ATCTGTAGCA GAAAATGATA CGCATTAGTC AT AAAAT T AA 12 0 

ATGGAAATGT CGATGAAGTG TATCAGCAAT TACAGCGATT AAT T AAGAAT GCTAATGTCG 180 

AAGAGAGTGA GAATACTGAC AATATTAATA GTCAAGATAC AAGTTATACA CCTCAAGTAA 24 0 

AAGTAACAAC ACCAATTTTA GTGAAAGCAC CAATCGCTGG TCGTCGTATT TTACTTAAAG 300 

AAGTAAGAGA TTCAATTTTT AGAGAGAAAA TGGTAGGTGA AGGCTTAGCA ATCAAAGCTC 3 60 

ATGAAGAATC CAAAGTAATC GCACCGTTCA ATGGTTTAAT ATCTATGATT GTACC&CTA 420 

AGCATGCAGT TGGTATTCAA TCAGAAGACG GTGTGGACAT AGTCATTCAT ATTGGCGTGA 4 80 
ATACAGTTGA CTTGGAAGGT AAAGGGTTCA AGTGCTTTGT AAAGCAAAAT GATCATGTTG ' 54 0 
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AAGCAGGGCA AACGTTGTTG CAATTCGACC AGCAATATAT ACAACAACAA GGCTACAATG 600 

CTGACGTTAT TGTCGTTATT AGCAACTCTG CCGATTTAGG AAAAGTAGAA CTGACAATGA 6 60 

ATGAAATCAT TACGACTGAA GATGTTATTT TTAAAATATT T AAAAAC TAG GAGTGTGTTG 7 20 

TAATAATGAC AAAATTACCG CAAAATTTCA TGTGGGGTGG CGCTCTTGCC GCAAATCAAT 780 

TTGAAGGTGG ATATGATAAA GGTGGTAAAG GGTTAAGET AATTGATGTT ATGACGAGTG 84 0 

GTGCACATGG CAAAGCACGT C AG AT T AC AG AATCTATAGA TCCCAATCAC TATTATCCAA 900 

ATCATGAAGG TATTGATTTT TAT CATCGTT ATAAGGAAGA TATTGCCTTG T T T AAAG AAA 960 

TGGGATTGAA ATGTTTACGT ACGTCGATTG CGTGGACACG TATCTTTCCG AATGGGGATG 102 0 

AAGATGTGCC AAATGAAGAA GGACTCGCCT TTTATGATCG TATCTTTGAT GAATTAATTG 1080 

CACAAGGTAT TGAACCTGTT GTGACGTTAT CACATTTTGA 'GATGCCACTT CATTTAGCGA 1140 

AACATTATGG TGGATTTAGA AATAGAGAAG TTGTCGATTA TTTTGTGCAT TTTGCGCGTG 1200 

TTGTATTTGA AAGATATAAA GATAAAGTTA CATATTGGAT GACGTTTAAT GAAATTAATA 12 60 

ATCAGATGGA C AC AT C AAAT CCTATCTTTT TATGGACGAA TTCTGGGGTA GCATTGACAG 1320 

AAAATGATAA TCCTGAAGAA GTCyTGTATC AAGTAGCACA TCATGAACTT TTAGCCAGTG 1380 

CyTTAGCAGT TCGTCTTGGT AAAGaGATtA ATCCgAaGTT TAAGffiTGGr ACmATGATTt 14 4 0 

CAmaTGTACC CmTTTATCCa TAwTCGTGTC ATCCGAAAGA TATGATGGAA G C AC AAAT T G 1500 

CGAATCGCTT ACGTTTCTTT TTCCCGGATG TCCAAGTGAG AGGTTATTAT CCAAGCTATG 15 60 

CTAAAAAAAT GTTGGCACGA AAAGGATATG ATGTTGGATG GCAAGAAGGG GACGACAGTA 1620 

TTTTACAGCA GGGCACGGTT GATTATATTG GCTTTAGTTA TTACATGTCT ACGGCTGTAA 168 0 

AACATGATGT T GAT ACT AC A GTTGAAAACA ACATCGTCAA CGGTGGTTTG AATCATTCTG 17 4 0 

TGGAGAATCC GCATATCGCA AC GAG T GAT T GGGGTTGGGC GATTGATCCA GATGGCTTAA 1800 

GATATACATT GAATGTGTTA TATGATCETT ATCAGTTACC ACTTTTTATT GTGGAAAATG 18 60 

GTTTTGGTGC AG T T GAT G AA GTGGTAGATG G AC AT AT T C a TGATGATTAT CGCATTGAAT 192 0 

ATTTAAAAGC ACATATTACA GCAGCGATAG AAGCAGTTGA TCAAGATGGT GTAGATTTAA 198 0 

TCGGTTATAC ACCGTGGGGA AT CAT T GAT A TTGTTTCATT TACAACCGGT GAATGAAGA 2 04 0 

AACGCTATGG TTTAATATAT GTTGATCGAG ATAATGATGG TCATGGCACG ATGGAACGCT 2100 

TGAAAAAAGA TTCGTTCTAT TGGTATCAAC AAGTGATAGC ATCAAATGGA GATAAATTAT 2160 

AAAGGTATAT TATAAGTATT TTAGGGTTAG AGCCCGAGAC AT AAAT T AAT ATAGTAGGAC 2220 

CTACAGTGTT ATAATGGCGG gCCCCCAACA CAAAGAATTT CGAAAAGAAA TTCt AcAGGT 22 8 0 

aATGCaAGtT GGCGGGGcCC AACACAGAGA AATTCGAAAA G AAAT T C T Ac AGGTAATGCA 2340 
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AGTTGGGGAA GGACAGAAAT AAATT 



2365 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11050 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

CTGCGATACG ATTTGTTGAA AGTGGGGAAA ACAAAAAAGT TATCATTACC AATTTAGAGC 60 

AGGCATACGA AGCTTTGATT GGTAATAAAG G T AC AC AC AT T C AC AT G T AG CACTTTATCA 120 

CGCGACAAAA CATTAAATAT GTTTCTCCGT TGATTCAAAT GAAAAAGTTG TCTGCTGACA 180 

CTTTGCAAGG TTTGAAGGAG TTTAACTTAT GACAGAAAAC TTTATTTTGG GTAGA&TAA 24 0 

TAAATTAGAA CATGAACTAA AGGCATTAGC AGATTACATT AATATACCAT ATAGTATATT 3 00 

ACAACCATAT CAAAGTGAAT GTTTTGTCAG ACATTATACG AAAGGCCAAG TTATTTATTT 3 60 

TTCGCCACAA GAAAGTAGCA ATATTTACTT TTTAATTGAA GGTAACATTA T T AGAGAAC A 4 20 

TTACAATCAA AATGGAGATG TATATCGTTA TTTTAATAAA GAGCAAGTAT TATTTCCAAT 4 80 

CAGTAACTTA TTTCATCCGA AAGAGGTTAA CGAATTGTGT ACAGCATTAA CCGATTGTAC 540 

AGTTCTTGGA TTGCCTAGAG AATTGATGGC CTTTTTGTGC AAAGCTAATG ATGATATATT 600 

TTTGACACTT TTTGCATTAA TAAATGATAA T GAG C AG ©A CACATGAACT ATAACATGGC 660 

ATTAACAAGT AAATTTGCTA AAGATCGAAT TATCAAATTG ATATGCCATC TATGTCAGAC 7 20 

AGTAGGATAC GATCAAGATG AATTTTATGA AATCAAACAG TTTTTAACTA TTCAACtCAT 7 80 

GAGTGATATG GCTGGGATTT CCCGGGAAAC AGCTGGTCAT AT TAT T CAT G AACTTAAAGA 840 

TGAAAAACTT GTTGTTAAAG ATCATAAAAA TTGGTTAGTA AGCAAACATT TATTCAATGA 900 

TGTATGTGTT TAATATACAA TGTAAAATGA ATAAGTTGAA CATGAGGTCT AACGTACATT 9 60 

TATACGTTAG GCCTTTTTTG CTAGCATGAT GAATAATTTA AAATGTTAGT TAAATTTGAT 1020 

TGTTGAAATT ACAGTAAAAT TTAAGGTGAT GAAAAATTTA GAACTTCTAA GTTTTTGAAA 1080 

AGTAAAAAAT TTGTAATAGT GTAAAAATAG TATATTGATT TTTGCTAGTT AACAGAaAAT 1140 

TTTAAGTTAT ATAAATAGGA AGAAAACAAA TTTTACGTAA TTTTTTTCGA AAAGCAATTG 1200 

ATATAATTCT TATTTCATTA TACAATTTAG ACTAATCTAG AAATCEAAAT GGAGTAATAT 12 60 

TTTTGAAAAA AAGAATTGAT TATTTGTCGA ATAAGCAGAA TAAGTATTCG ATTAGACGTT 1320 

TTACAGTAGG TACCACATCA GTAATAGTAG GGGCAACTAT ACTATTTGGG ATAGGCAATC 1380 

ATCAAGCACA AGCTTCAGAA CAATCGAACG ATACAACGCA ATCTTCGAAA AATAATGCAA 14 4 0 
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GTGCAGATTC CGAAAAAAAC AATATGATAG AAACACCTCA ATTAAATACA ACGGCTAATG 1500 

AT AC AT C T G A TATTAGTGCA AACACAAACA GTGCGAATGT AGATAGCACA ACAAAACCAA 15 60 

TGTCTACACA AACGAGCAAT AC C AC T AC AA CAGAGCCAGC TTCAACAAAT GAAACACCTC' 162 0 

AACCGACGGC AATTAAAAAT CAAGCASCTG CTGCAAAAAT GCAAGATCAA ACTGTTCCTC 168 0 

AAG AAG C AAA TTCTCAAGTA GATAATAAAA CAACGAATGA TGCTAATAGC ATAGCAACAA 174 0 

ACAGTGAGCT TAAAAATTCT CAAACATTAG ATTTACCACA ATCATCACCA CAAACGATTT 1800 

CCAATGCGCA AGGAACTAGT AAACCAAGTG TTAGAACGAG AGCTGTACGT AETTAGCTG 18 60 

TTGCTGAACC GGTAGTAAAT GCTGCTGATG CTAAAGGTAC AAATGTAAAT GATAAAGTTA 192 0 

CGGCAAGTAA TTTCAAGTTA G AAAAG AC T A CATTTGACCC TAATCAAAGT GGTAACACAT 198 0 

TTATGGCGGC AAATTTTACA GTGACAGATA AAGTGAAATC AGGGGATTAT TTTACAGCGA 204 0 

aGTTACCAGA TAGTTTAACT GGTAATGGAG ACGTGGATTA TTCTAATTCA AATAATACGA 2100 

TGCCAATTGC AG AC AT T AAA AGTACGAATG GCGATGTTGT AGCTAAAGCA ACATATGATA 2160 

TCTTGACTAA GACGTATACA TTTGTCTTTA CAGATTATGT AAATAATAAA GAAAAT AT T A 2220 

ACGGACAATT TTCATTACCT TTATTTACAG ACCt&GCAAA GGCACCTAAA TCAGGAACAT 2280 

ATGATGCGAA TATTAATATT GCGGATGAAA TGTTTAATAA TAAAAT TACT TATAACTATA 234 0 

GTTCGCCAAT TGCAGGAATT GATAAACCAA ATGGCGCGAA CATTTCTTCT CAAATTATTG 2400 

GTGTAGATAC AGCTTCAGGT CAAAACACAT ACAAGCAAAC AGTATTTGTT AACCCTAAGC 2 4 60 

AACGAGTTTT AGGTAATACG TGGGTGTATA TTAAAGGCTA CCAAGATAAA AT C G AAG AAA 2520 

GTAGCGGTAA AGTAAGTGCT ACAGATACAA AACTGAGAAT TTTTGAAGTG AATGATACAT 2 580 

CTAAATTATC AGATAGCTAC TATGCAGATC CAAATGACTC TAACCTTAAA GAAGTAACAG 2 64 0 

ACCAATTTAA AAATAGAATC TATTATGAGC ATCCAAATGT AGCTAGTATT AAATTTGGTG 2 7 00 

ATATTACTAA AACAT AT GT A GTATTAGTAG AAGGGCATTA CGACAATACA GGTAAGAACT 27 60 

T AAAAAC T C A GGTTATTCAA GAAAATGTTG ATCCTGTAAC AAATAGAGAC TACAGTATTT 2 820 

TCGGTTGGAA TAATGAGAAT GTTGTACGTT ATGGTGGTGG AGTGCTGAT GGTGATTCAG 2880 

CAGTAAATCC G AAAGAC CCA ACTCCAGGGC CGCCGGTTGA CCCAGAACCA AGTCCAGACC 2940 

CAGAACCAGA ACCAACGCCA GATCCAGAAC CAAGTCCAGA CCCAGAACCG GAACCAAGCC 3000 

CAGACCCGGA TCCGGATTCG GATTCAGACA GTGACTCAGG CTCAGACAGC GACTCAGGTT 3005 

CAGATAGCGA CTCAGAATCA GATAGCGATT CGGATTCAGA CAGTGATTCA GATTCAGACA 312 0 

GCGACTCAGA ATCAGATAGC GACTCAGAAT CAGATAGTGA GTCAGATTCA GACAGTGACT 318 0 

CGGACTCAGA CAGTGATTCA G AC T C AG AT A GCGATTCAGA CTCAGATAGC GATTCAGACT 324 0 
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CAGACAGCGA TTCAGATTCA GAG&GCGACT CAGATTCAGA CAGCGACTCA G AC T C AG AT A 3300 

GCGACTCAGA CTCAGACAGC GACTCAGATT CAGATAGCGA TTCAGACTCA GACAGCGACT 3360 

CAGACTCAGA CAGCGACTCA G AC T C AG AT A GCGACTCAGA TTCAGATAGC GATTCAGACT 34 20 

CAGACAGCGA CTCAGATTCA GATAGCGATT CGGACTCAGA CAGCGATTE GATTCAGACA 34 8 0 

GCGACTCAGA CTCGGATAGC GAT T C AG AT T CAGATAGCGA TTCGGATTCA GACAGTGATT 354 0 

CAGATTCAGA CAGCGACTCA GACTCGGATA GCGACTCAGA CTCAGACAGC GATTCAGACT 3600 

CAGATAGCGA CTCAGACTCG GATAGCGACT CGGATTCAGA TAGCGACTCA GACTCAGATA 36 60 

GTGACTCCGA TTCAAGAGTT ACACCACCAA ATAATGAACA GAAAGCACCA TCAAATCCTA 37 20 

AAGGTGAAGT AAACCATTCT AATAAGGTAT CAAAACAACA CAAAACTGAT GCTTTACCAG 3780 

AAACAGGAGA TAAGAGCGAA AACACAAATG CAACTTTATT TGGTGCAATG ATGGCATTAT 3840 

TAGGATCATT ACTATTGTTT AGAAAACGCA AGCAAGATCA TAAAGAAAAA GCGTAAATAC 39 00 

TTTTTTAGGC CGAATACATT TGTATTCGGT TTTTTTGTTG AAAATGATTT TAAAGTGAAT 3 9 60 

TGATTAAGCG TAAAATGTTG AT AAAG TAG A ATTAGAAAGG GGTCATGACG TATGGCTTAT 4 020 

ATTTCATTAA ACTATCATTC ACCAACAATT GGTATGCATC AAAATTTGAC AGTCATTTA 4 08 0 

CCGGAAGATC AAAGCTTCTT TAATAGCGAT ACAACTGTTA AAC CAT T AAA AACTTTAATG 414 0 

TTGTTACATG GATTATCAAG TGATGAAACG ACAT AT AT GA GATATACAAG CATAGAAAGG 4200 

TATGCGAATG AACACAAATT AGCTGTGATT ATGCCCAATG TGGATCATAG CGCATATGCT 4 2 60 

AACATGGCAT ATGGTCATAG C TAT TAT GAT TATATTTTGG AAGTGTATGA TTATGTTCAT 4 32 0 

CAAATATTTC CACTTTCCAA AAAGCGTGAT GACAATTTTA TAGCAGGTCA CTCTATGGGA 4 380 

GGATATGGCA CAATTAAATT TGCATTAACA CAAGGGGATA AATTTGCCAA AGCTGTACCA 4 4 40 

TTATCTGCTG TGTTTGAAGC GCAAAATTTA ATGGATCAG AGTGGAATGA TTTTTCAAAA 4 500 

GAGGCCATAA TTGGCAATCT TTCAAGTGTT AAAGGAACTG AACATGATCC GTATTACTTG 4 5 60 

C TAG AC AAAG CTGTAGCTGA AGATAAACAA ATTCCAAAAT TGCTCATTAT GTGTGGTAAA 4 620 

CAAGACTTTT TATATCAAGA CAACTTAGAT TTTATCGATT ATTTATCACG CATAAATGTT 4680 

CCTTATCAAT TTGAAGATGG ACCAGGAGAT CAT GAT T AT G CATATTGGGA TCAAGCGATT 4 74 0 

AAGCGTGCTA TAACATGGAT GGTGAATGAT TAATTATTTC TTGGAAAATA TGTGGCTGCA 4800 

TTAAATACAC AG AG T GAGAG ATACAAACTA TTTACGCACG ACTAACATTT CTAAGTGTTT 4 8 60 

AAATTATTTT TGTATTAATA TGATTGGCGC AATTTGCTGA TACACAAAAA TGTTTCTCGT 4 920 

G AAAC T TAG A TTTAGCTTAT AGTTTTATCA TCATTTGTAT GACTTACATT ATAAATTTTA 4 980 

TTATAATGAG GTTAACGCTT TGAAAGGAGT CATCATCATG TCGACCAATA AAAAC GAT T A 5 04 0 

TGAGCATATG TTGTTTTATT TTGCATATAA AACCTTTATT ACTACGCTG ATGAAATTAT 5100 
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AG AG AAG TAT GGTATGAGTC GTCAGCATCA TCGTTTTTTG TTTTTTATCA ATAAATTACC 5160 

TGGTATTACT ATTAAATCAT TACTAGAAAT ATTAGAAATT TCTAAmCAAG GAT CACATGC 5 22 0 

AACACTTCAA AAATTAAAAG AGCAAGGTCT CAT TAT T G AA AAAGTTTTAG AGACTGATCG 52 8 0 

ACGTGTCAAA AAATTATATT CGACGGATAA AGGCGATCAA CTCATTGCTG AATTGAACAA 534 0 

GGCGCAAGAT GAATTATTGC AAAATATATA TCAACAAGTC GGTTCGGATT GGTATGATGT 5 4 00 

GATGGAAGCA TTGGCTAAAG GgCGACCTGG cTTTGATTTT ATTAAGCATT TGAAAGATGA 54 60 

AAAAGAAAGC TAGCATCAGA AATGTTiAAA ATCTTCGCAT TCTTAAATTT AAAAAATATG 5 520 

TCAAAAAGTG TATAATAAAA ACATATAATT TAATTGAACT CAGTTTCAAC ACATCTTAGA 5580 

AAGGAGTTTG AAT GAT G AAA AAAT TAGCAG TTATTTTAAC ATTAGTTGGC GGTTTATACT 5640 

TCGCATTTAA AAAATACCAA GAACGTGTTA ACCAAGCACC TAACATTGAG T&TAAATTA 57 00 

AACCATAAAA AATTCCCGAA CACCTTGTTA TAGTGCTCGG GAATTTTTTT ATGCTTTACT 57 60 

TGAATATATC AAAT AT TAT T TTTGCGCTTT CTGTATTTTC GATATTACCA C T AAAT GAT T 5820 

CTGATCTAGG TCCGTAAGCG TAgGTATTAA CATCCTCGCC TGTATGTCCA TCGGAAGTCC 58 8 0 

ACCCTGTATA AGATTTATCA TTTACTGGCT TCTGAATAGC GTGTTGTAGG GCTTTTGTTT 594 0 

GCGTTTCTAC TTCTGCGGAT TTTTCGTCTT TTTCTTTTTT AAGTAGTCTT TTTAGCTTTT 6000 

TATTCTCTTT TTTAACCTTT TTCATATCAT CTTGTGAAAA TTCAAATCCA TAACCTTCAT 6060 

TAATAACTTT TTCAGGGTCT TCACCTTTAG CCATTTTTC TGTCATATAT GATCCAGAGT 612 0 

GTTTCATAGA TTTAATCGGT TGAGGATTCC ATTCGTATCC TTTATCTTTA CCAATTGTTA 618 0 

GACCACCAGT AGAGTGATCT GCTGTTGCAA CGACAAGTGT ATCTTTATGT TTCTTAGCAT 62 4 0 

ATTGAATGGC GTCATCGAAT GCTTTTTCAA AACCTTCCAT TTCAGACATA ACGCCTGTAA 6300 

TATCGTTGGA ATGCGCTGAT TTATCTATAG AAGCACCTTC G AC CAT T AAA AAGAATCCTT 63 60 

TTTTATTGCG CTCAAGCTTA CTAAGTGCAC TTTGTTGCAT ATCAGCTAAT GATGGTTCGT 6420 

CTTTAGAAGC ATCTATTGCA AGTGGCATAT TTTTATCTGC AAACAAACCA AGAACTTTAT 64 8 0 

CTTTATCAGA TTTTGATAAC TCCTTACTGT TCGTGGCAAG GTCGTAACCA TCTTTTTTGA 65 4 0 

ATTTTTTATC TAAATTGCCA TTACTTTTAC CGAAATATTT AGCGCCGCCG CCTAATAAAA 6600 

CATCAACTTT ATGCTTTCCG TTGATTTTAT CTTTATAAAA TTGTTTAGCG ATTTCGTTTT 6660 

TATCATCTCT AGAAGTCACG TGTGCAGCAT ATGCTGCTGG ETTGCATCT G T T AAT T C AG 6720 

CTGTTGAAAC AAGACCAGTC GACTTACCTT TTTCTTTTGC ACGTTCAAGC ACCGTCTTTA 67 8 0 

CTTTCTGCTT GTTACTGTCA ACACCGATGG CACCATTATA TGTCTTATGA CCAGAACTAA 68 4 0 

AGGCTGTTCC GCCAGCTGCA GAATCAGTAA TATTCTGTTT TGGGTCATTT GAATATGTAC 69D 
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GATTTGTGCC TTTTAAATAT GAATCAAAAG CAGTAGGGGT CATTTCTTTA GCATGCGGAT 6960 

CATTTTTATA AT AAC GAT AA GCTGTGTTAA ATGATGGACC CATGCCATCG CCAACTAAAA 7 02 0 

AGATAACATT TTTTGGATTT T TAG TAT T AC CAACCGCGAA ACTTTCATCT T TAG AAC T T T 7 08 0 

TATCGGATTG CGCAATTGCA GGTGTGACAG AACTAAAAAC CGTTGACACG ATAATAAGGT 714 0 

TAGCAACTGC AAATTTTGTG GCTTTTTTAA CT GAT AAC AT AAGACATCCT CCTGAGTATA 7200 

TGACTATGTC TTCAGTGTAA AAGAGGAATT TtGAGCAATT ATGTAGTTTT AGTTAnAAAT 7 2 60 

ATGTAAACAG AGTGATTTAG AATAACAAAA aATGAATATA TATGACAAT TGTTATAGAA 7 320 

AGCGTTAGAA TAGAAGCGTG TGAAAATATA GAATTAAATA TAATTTGAGG TGGAAAAATG 7 38 0 

ATACTAGTAA TGTTATCTCC ATTATTAATC ATATTCTTTA TAGTGTTGTC TATTTTAGAA 7440 

GAGCGTAAAC GTACGAAGAA AAAGCAACTC GAGAAAGAAA AAGCAAATAC ACTAAATCAA 7 50 0 

AATACAAATG ACACGGAAAG TTCAAATCAA GAGCCGTCAT TGCAGCAGGA TAAAGAACAA 7 5 60 

AAAGATAACA AAGGATAATT CAATTGAAGG AAGAAGATTA TAGATGAAAA TATTAATTGT 7 62 0 

TGAAGATGAT TTTGTTATAG CAGAGAGTTT AGCATCTGAA CTTAAAAAAT GGAATTACGG 7 68 0 

TGTTATTGTC GTTGAACAAT TTGATGATAT ACTGTCTATC TTTAACCAAA ATCAACCTCA 774 0 

GCTTGTATTG CTAGATATTA ATTTGCCAAC GTTAAATGGT TTTCATTGGT GTCAAGAAAT 7 8 00 

CCGAAAAACA TCTAATGTGC CAATTATATT TATTAGTTCC CGTATTGATA ATATGGACCA 7 8 60 

AATTATGGCA ATACAAATGG GGGGAGATGA TTTTATCGAA AAGCCATTTA ACTTGTATT 7 92 0 

AACGATTGCC AAAATTCAAG CATTATTGAG ACGAACTTAT GACTTGTCAG TAGCTAATGA 7 980 

TTCATTGACG GTTAAAGGTT GTACATTAAT ATTAGATGAA GCAAAAGTCG TGTATCAAGA 8 04 0 

ACAAAACATA CAGCTATCTT TGACTGAATT ACAAATATTA AAGT T AT TAT TTCAAAATGA 8100 

AGaTAAATAT GTAAGTAGrA CTGCTTTAAT TGaAAAATGT TGGGaATCAG AAAACtTCAT 8160 

AGATGATAAC ACATTAGCTG TTAACATGAC GCGCCTGCTG AAAAAATTAA ATACTATTGG 8 22 0 

CGTTAATGAT TTTATCATTA CAAAGAAAAA TGTCGGATAT AAAGTATAGG GTGAATGCAA 828 0 

TGACCTTTCT TAAAAGTATT AC T C AG G AAA TAGCAATJST CATAGTTATT TTTGCTTTGT 8 34 0 

TTGGCTTAAT GTTTTACCTG TATCATTTGC CAT T AGAAGC ATATTTACTA GCACTTGGCG 8 4 00 

TTATTTTATT ATTATTACTC ATATTCATAG GTATTAAATA TTTAAGTTTT GTAAAAACTA 8 4 60 

TAAGCCAACA ACAACAAATT G AAAAC T TAG AAAATGCGTT GTATCAGCTT AAAAATGAAC 8 520 

AAATTGAATA TAAAAATGAT GTAGAGAGCT ACTTTTTAAC ATGGGTACAT CAAATGAAAA 8 58 0 

CACCCATTAC TGCAGCACAA CTGTTACTTG AAAG AG AT G A GCCTAATGTT GTTAATCGTG 8 64 0 

TTCGTCAAGA GGTTATTCAA ATTGaTAACT ATACAAGTTT AGCACTTAGT TATTTAAAGT 8 7 00 

TATTAAATGA AACTTCTGaT ATTTCTGTCA CTAAAATTTC GATTAATAAT ATCATTCGCC 87 60 
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CAATTATTAT GAAATATTCA ATACAGTTTA TTGATCAAAA AACAAAAATC CATTATGAAC 8820 

CTTGTCATCA CGAAGTATTA ACTGACGTTA GATGGACCTC TTTAATGATA GAACAATTAA 8880 

TAAATAATGC ACTTAAGTAT GCGAGAGGTA AAGATATATG GATTEATTT GATGAGCAAT 8 94 0 

CCAATCAATT ACACGTAAAA GATAATGGTA TCGGTATTAG TGAAGCGrAC TTGCCTAAAA 9000 

TATTTGATAA GGGCTATTCA GGTTATAATG GCCAGCGCCA AAGTAACTCA AGTGGGaTTG 90 60 

GTTTATTTAT CGTAAAACAA ATTTCAACAC ACACAAACCA TCCTGTTTCA GTCGTATCTA 9120 

AACAAAATGA GGGTACAACA TTTACGATTC AATTTCCAGA TGAATAAAAA CTTTCAATAT 918 0 

TGTAAGTATA C TAG T AAC AT TTTTTTACTA ATTTAAATGT TATTAGTATT TTTTTGTTTT 92 4 0 

AATATAGAAC TAACAAAGAA ATGAGGTGCA TGCCATGTTG CTAGAAGTGn AACATGTAAA 9300 

AAAGGTTTAT GGTAAAGGTT TGAATGOAC GACAGCACTT AATCAAATGA ATT TAT C AG T 93 60 

TGGAGCTGGT GaATTTGTTG CaATTATGGG TGAGTCTGGG tCAGGGAAGT CTACACTACT 9420 

AAATTTAATT GCtTCTTTTG ATGGACTAAC TGAAGGTGAC ATTATTGTGG ATGGCGCACA 94 8 0 

TTTAAATAAT AT G AAAAAT A AAAGTAAAGC ATTGTATCGT CaACAAATGG TQ.GTTTTGT 95 4 0 

TTTTcAAGAT TTTAATCTTT TACCAACAAT GACGAATAAA GAAAATATAA TGATGCCATT 9600 

AATTTTAGCT GGTGCTAAAC GAAAAGATAT AGAACAAAGG GTACATCAGT TGGCAGTACA 9660 

ATTACATTTA GAGGGATTCT TAAACAAGTA TCCTTCTGAA ATCTCTGGGG GTCAGAAGCA 9720 

ACGCATTGCC ATTGCACGTG CAT T AGT T AC TAAGCCGACG ATTTTACTAG CCGATGAACC 97 8 0 

TACAGGTGCA CTTGATTCAA AAAC AT C AAA GGCATTGATG ATGTTATTTC AAGAGATTCA 98 4 0 

TCAATTGGAA CAGACAATTT TAATGGTAAC TCATTCAAAT ATCGATGCGT CTTATGCAGA 9900 

GCGAGTCATT TTTATTAAAG ATGGGCGTCT ATATTATGAA AT AT AT C G T G GTGAAGAAAG 9 9 60 

TCAATTAGCT TTTCAACAAC GAATAACAGA TAGCTTAGCA CTTGTGAATG GAGGAAGTGT 10020 

CAATATATGA AGTTAAGATT GTTATGnACA TAGTGCGACG TCAATTTATT ACGCAGCGAC 1008 0 

TTGTAATCAT TCCATTCATT TTAGCGGTAA GTGTACTATT CATGATTGAA TATACGCTTG 1014 0 

TGTCAATTGG GTTAAATAGC TACATAAAAC AGAAGAATGA CTTCCTAGTA CCATTTATTA 10200 

TCATAGCTAA TTTTTTTATG GCGCTTTTAA CTTTTATTTT TATTTTCTAT GCAAATCACT 102 60 

TTATGATGTC ACAAAGACGA AAAGAGTTTA GCATTTTTAT GACATTGGGC AT GACC AAGA 10 320 

AAAGTATGCG TTTAATTGTA GTGATGGAAA CTATC.TTACA ATTTGTGATA ATTTCAGTCG 10380 

TTAGTATTGC CGGCGGATAC TTACTTGGTG CGATATTTTT CTTGTTTATA CAGAAAATAA 104 4 0 

TGGGCAGTGA AGTTGCGACG TTAAGGTATT ATCCATTTGA CTCTGTAGCG ATGTTTATTA 10 500 

CTTTGATTAT CATTGCTGTA TTAATGGGCA TGCTACTTAT ffTCAACTTG TTTAGTATTA 10560 
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ATTTTCAACG GCCGATAACT TATCAACATC GTTCCGATTC TAGTGTCATA TCACGATGGT 10620 

TGCGTTACGT TTTAATTGTT ATAGGAAGCG CAnACTATAT TTAGGTTACT TTATTGCATT 10680 

ACAACAAGAT ACGACGTTTG GTGCCTTTTT TAAAATATGG ATTGTCATAG GATTAGTTAT 10731 

TATCGGTACT TATGCATTTT TTGTAGGTAT AAGTGAAATA ATTATTAGTA TATTGCAGCA 10800 

GGTAT CAAAA GTTTACTATC ATCCACGGTA TTTTTTTGTG GTAGTTGGGA TGCGTGTACG 10860 

TCTTAAAATG AATGCAGTCA GTCTTGCAAC AATCACTTTG CTGTGTACAT TTTTGATTGT 10920 

AACGCTCACA AT G AC AT T AA CAPCCTATCG TGATATGAAT CATACCATTA CGAAATTGAT 10980 

TACGAATGAT TakGATTTGT CATTTAGCGA CAATTCTAAG TCACAAaTAG AACGTCAACA 11040 

AACAATTGAG 11050 



(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 983 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

C G AC AT AAC G AGGCAAGGGT ACATGATACT TTAGCCTCGT TTTTGAfflTG TATTTTTCTG 60 

AATATAAGGG CAATAGATGG TATTTTATAw TTTTTTTAAG GTAGTGATTA ACATAGATAT 120 

TTATCAACAC TTTAGACAGG AAGAATACGA ATTAATTGAT CAGCTAACGG ATAAATGTGA 18 0 

TCAAGCGGAA CAGCATTATG CACCAGTATT AACGCATTTT TTAGATCCAA GAGGGCAATA 24 0 

TATATTGGAA GTGATTTGTG GCAGTTATGA AGATTTAAAC GTATCTTTTT ATGGTGGACC 300 

TAATGCTGAA AGAAAAAGAG CAATCATTTC GCCGAACTAT TATGAACCTA AAGAAAGCGA 3 60 

CTTTGAATTA ACTTTAATGG AAATAGATTA TCCTGAAAAA TTCGTCACTT T AAAAC AT C A 4 20 

ACATATTTTA GGGACATTAA TGTCTTTACE TATCGAACGC GAACAAGTTG GAGATATAAT 4 80 

TGTGaATGAA CGAATTCAAT TTGTTTTGAC AAGTAGATTG GAATCATTTA TTATGTTAGA 54 0 

AT T AC AAC G T ATTAAAGGCG CATCAGTTAA ACTTTATACT ATTCCAGTAA CAGATATGAT 600 

ACAATCTAAT GAGAATTGGA AAAATGAAAG TGCaCAGTTA GTTCTTTAAG GTTSATGTT 660 

GTTATTAAAG AAATGATACG TAAATCACGT ACGATTGCGA AACAACTAAT CGAAAAAAAA 720 

CGTGTTAAAG TGAATCACAC TATTGTTGAT TCAGCAGATT TTCAATTACA AGCAAATGAT 7 80 

TTAATATCCA TCCAAGGTTT TGGTAGAGCA CACATTACTG ACTTAGGTGG TAAAACTAAA 84 0 

AAAGATAAAA CGCACATTAC C TAT AG AAC A TTATTCAAAT AGTAATGATT TAAGGAGGAT 900 

AACAAATGCC TTTTACACCA AATGAaATTA AGAATAAAGA GTTTTCACGT GTaAAGAATG 960 
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GTTTTAGAAC CTACTGnAGT TGG 



983 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

TTTTGCAAAG CTTATTTTAT GTCAAACAGA TiGTCAATGT GAAACAAAGG TTAGTACATA 60 

TAATCATCCA GACTTTATGT ATATATCAAC AACTGAGAAT GCAATTAAGA AAGAACAAGT 12 0 

TGAACAACTT GTGCGTCATA TGAATCAACT TCCTATAGAA AGCACAAATA AAGTGTACAT 18 0 

CATTGAAGAC TTTGAAAAGT TAACTGTTCA AGGGGAAAAC AGTATCTTGA AATTTCTdA 24 0 

AGAACCACCG GACAATACGA TTGCTATTTT ATTGTCTACA AAACCTGAGC AAATTTTAGA 300 

CACAATCCAT TCAAGGTGTC AGCATGTATA TTTCAAGCCT ATTGATAAAG AAAAGTTTAT 3 60 

AAATAGATTA GTTGAACAAA ACATGTCTAA GCCAGTAGCT GAAATGATTA GTACTTATAC 420 

TACGCAAATA GATAATGCAA TGGCTTTAAA TGAAGAATTT GATTTATTAG CAT T AAGGAA 4 80 

AT C AGITATA CGTTGGTGTG AATTGTTGCT TACTAATAAG CCAATGGCAC TTATAGGTAT 54 0 

TATTGATTTA TTGAAACAGG CTAAAAATAA AAAACTGCAA TCTTTAACTA TtGCAGCTGT 600 

GAATGGTTTC TTCGAAGATA TCATACATAC AAAGGTAAAT GTAGAGGATA AACAAATATA 660 

TAGTGATTTA AAAAATGATA TTGATCAATA TGCGCAAAAG TTGTCGTTTA ATCAATTAAT 720 

TTTGATGTTT GATCAACTGA CGGAAGCACA TAAGAAATTG AmTCAAAATG TAAATCCAAC 780 

GCTTGTATTT GAACAAATCG TAATTAAGGG TGTGAGTTAG ATGCCAAATG TAATAGGTGT 84 0 

TCAGTTTCAA AAAGCGGGAA AATTAGAATA T TAT AC AC C T AAT GAT AT AC AAGTAGATAT 900 

AGAAGACTGG GTAGTTGTCG AATCTAAAAG AGGCATAGAG ATAGGTATTG TTAAAAATCC 9 60 

ATTAATGGAT ATTGCTGAAG AGGATGTTGT GTTACCTCTT AAAAAT AT T A TTCGCATTGC 1020 

T GAT G AC AAA GATATTGATA MTTTAATTG TAATGAACGA GATGCTGAAA ATGCATTAAT 1080 

ACTATGTAAA GACATTGTAA GAGAACAAGG TTTGGACATG CGTTTAGTCA AT T GC G AAT A 114 0 

TACATTAGAT AAATCGAAAG TTATTTTTAA TTTTACGGCG GATGATCGTA TTGATTTTAG 1200 

AAAAT TAG T A AAAATATTAG CGCAAC AT T T AAAAACACGT ATCGAGTGA GACAAATTGG 12 60 

TGTAAGGGAT GAAGCCAAAT TGCTTGGCGG TATCGGACCT TGTGGTAGGT CGTTATGTTG 1320 

TTCTACATTT TTAGGGGATT TTGAACCAGT ATCGATTAAG ATGGCTAAGG ATCAAAATTT 1380 
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ATCATTAAAT CCAACTAAAA TTTCTGGTGC ATGTGGTCGT TTGATGTGTT GTTTAAAATA 14 4 0 

TGAAAATGAC TATTATGAGG AAGTACGTGC ACAATTACCT GATATTGGTG AAGCAATTGA 1500 

AACGCCTGAT GGTAACGGGA AAGTAGTTGC TTTAAATATA TTAGACATTT CTATGCAGGT 15 60 

GAAGCTTGAG GGACATGAAC AGCCACTTGA ATATAAATTA GAAGAAATAG AAACTATGCA 162 0 

TTAAGGAGGC ATTATTACAT TTGGATCGG^ ATGAAATATT TGAAAAAATA ATGCGTTTAG 168 0 

AAATGAATGT CAATCAACTT T C AAAG G AAA CTTCAGAATT AAAGGCACTT GCAGTTGAAT 174 0 

TAGTAGAAGA AAATGTAGCG CTTCAACTTG AAAATGATAA TTTGAAAAAG GTGTTGGGCA 1800 

AT GAT G AAC C AACTACTATT GATACTGCGA ATTCAAAACC AGCAAAAGCT GTGAAAAGC 18 60 

CATTACCAAG TAAAGATAAT TTGGCTATAT TGTATGGAGA AGGATTTCAT ATTTGTAAAG 1920 

GCGAATTATT TGGAAAACAT CGACATGGTG AAGATTGTCT GTTCTGTTTA GAAGTTTTAA 1980 

GTGATTAATC AAGCACACTC AAATAGTGTT ATAATTATAA ATGAATATGG TTTGGATAAG 2 04 0 

T C T G AG AC AA TGCATGTTTC AGGCTTTAAT TGTGTATAAA GTTTTGGTGA TTGCATAAGA 2100 

GATGGCGGTA CTAAATGTTA TTATTAAGTG TGCACGCAgT ATCaTTAGTT ATAAAATGTA 2160 

GCTGTTAAAA GTCAAAAATA CATCGAATGT AGTTAGGCAT ATAATATAAA AAGAGTTTTC 2220 

AATTACTCAA TAGAAAAAGG TTGTCTTCAT AGGAGTAAA AATGTTAAAA GAGAATGAAC 22 8 0 

GATTTGATCA AC T AAT C AAA GAAGATTTTA GTATTATTCA AAATGATGAT GTTTTTTCAT 2 34 0 

TTTCAACGGA TGCTTTGTTG TTAGGGCATT T T AC AAAAC C TAGAACAAAA GATATTGTGT 24 00 

TGGACTTATG TTCAGGCAAT GGGGTGATAC CCTTGTTATT GTTTGCGAAA CATCCACGAC 24 60 

ATATAGAAGG TGTTGAGATT CAAAAAACAC TTGTCGATAT GGCGCGACGC ACATTTCAAT 2520 

TCAATGATGT TGATGAATAT TTAACAATGC ATCACATGGA TTTGAAAAAC GT T AC T AAAG 2580 

TATTTAAACC TTCACAATAT ACTTTAGTAA CGTGTAATCC GCCTTATTTT AAAGAGAATC 2 64 0 

AGC AAC AC C A ACATCAAAAA GAAGCACATA AGATAGCGAG ACATGAGATT ATGTGTACAC 27 00 

TTGAAGATTG CATGATTGCA GCCCGTCATT TATTAAAAGA AGGTGGCAGG CTAAACATGG 27 60 

TACATCGTGC AGAGAGACTA ATGGATGTCT TGTTTGAAAT GAGAAAAGTG AATAT TGAAC 2820 

CTAAGAAAGT CGTTTTTATA TATAGTAAAG TAGGGAAATC AGfiCAAACG ATAGTAGTAG 2880 

AAGGTCGAAA AGGTGGAAAT CAAGGT T TAG AAATCATGCC CCCATTTTAT ATTTATAATG 2 94 0 

AAGATGGTAA TTATAGCGAA GAAATGAAGG AAGT AT AT T A TGGATAGTCA TTTTGTATAT 3000 

ATTGTAAAAT GTAGTGATGG AAGTTTATAT ACAGGATACG CTAAAGACGT TAATGCACGT 3060 

GTTGAAAAAC ATAACCGAGG TCAAGGAGCC AAAT AT AC GA AAGTAAGACG TCCGGTGCAT 312 0 

TTAGTTTATC AAGAAATGTA T GAG AC AAAG TCTGAAGCAT TGAAGCGTGA ATATGAAATT 3180 

AAAACTTATA CCAGACAAAA GAAATTGCGA TTAATTAAGG AGCGATAGTA TGGCTGTATT 324 0 



365 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



ATATTTAGTG GGCACACCAA TTGGTAATTT AGCAGATATT ACTTATAGAG CAGTTGATGT 3300 

ATTGAAACGT GTTGATATGA TTGCTTGTGA AGACACTAGA GTAACTAGTA AACTGTGTAA 3 3 60 

T CAT TAT GAT ATTCCAACTC CATTAAAGTC AT AT C AC G AA CATAACAAGG AT AAG C AG AC 34 20 

TGCTTTTATC ATTGAACAGT TAGAATTAGG TCTTGACGTT GCGCTCGTATCTGATGCTGG 34 80 

ATTGCCCTTA ATTAGTGATC CTGGATACGA ATTAGTAGTG GCAGCCaGAG AAGCTAATAT 354 0 

TAAAGTAGAG ACTGTGCCTG GACCTAATGC TGGGCTGACG GCTTTGATGG CTAGTGGATT 3600 

ACCTTCATAT GTATATACAT TTTTAGGATT TTTGCCACGA AAAGAGAAAG AAAAAAGTGC 3660 

TGTATTAGAG CAACGTATGC ATGAAAATAG CACATTAATT ATATACGAAT CACCGCATCG 3720 

TGTGACAGAT ACATTAAAAA CAATTGCAAA GAT AG AT G C A ACACGACAAG TATCACTAGG 37 8 0 

GCGTGAATTA AC T AAG AAG T TCGAACAAAT TGTAACTGAT GATGTAACAC AATTACAAGC 3840 

AT T GAT T C AG CAAGGCGATG TACCATTGAA AGSCGAATTC GTTATCTTAA TTGAAGGTGC 3 900 

TAAAGCGAAC AAT GAGAT AT CGTGGTTTGA TGATTTATCT ATCAATGAGC ATGTTGATCA 3 9 60 

TTATATTCAA ACTTCACAGA TGAAACCAAA ACAAGCTATT AAAAAAGTTG CTGAAGAACG 4 020 

AC AAC T T AAA AC G AAT G AAG TATATAATAT TTATCATCAA ATAAGTTAAT CACTTTAOG 4 08 0 

ATTaTATGAA ATTTTAAACG ATTTTATAAA CGCAAGCTGT AATTTTAAAT GGTAAGTTAT 414 0 

CATTTTGCAT TGATACTGAT AAAATGATGT TGACTATGAT AAAAAAATGA TGACATCGAC 4 2 00 

GTTTTTTAAT GTAAAATAAA TACATTGAAA GTAATAAATA CCTTAACATT GAATAAGATG 4 2 60 

AAAAT GAGAT G AC CAGAT AA ATGTTCGCGT CCGTTGAAAT GCATAGAAAT CTTAGATATT 4 320 

ATTTGAAGTG AGACATTACG AGGAGGAACA GTTATGGCTA AAG AAAC AT T T T AT AT AACA 4 38 0 

ACCCCAATAT ACTATCCTAG TGGGAATTTA CATATAGGAC AT G CAT AT T C TACAGTGGCT 444 0 

GGAGATGTTA TTGCAAGATA T AAG AG AAT G CAAGGATATG ATGTTCGCTA TTTGACTGGA 4 5 00 

ACGGATGAAC ACGGT CAAAA AATTCAAGAA AAAGCTCAAA AAGCTGGTAA GACAGAAATT 4 5 60 

GAATATTTGG AT GAGAT GAT TGCTGGAATT AAAC AAT TGT GGGCTAAGCT TGAAATTTCA 4 62 0 

AATGATGATT T TAT C AG AAC AAC T G AAG AA CGT CAT AAAC ATGTCGTTGA GCAAGTGTTT 658 0 

GAACGTTTAT TAAAGCAAGG TGATATCTAT TTAGGTGAAT ATGAAGGTTG GTATTCTGTT 47 4 0 

CCGGATGAAA CAT AC TAT AC AGAG T C AC AA TTAGTAGACC CACAATACGA AAACGGTAAA 4 800 

ATTATTGGTG GCAAAAGTCC AGATTCTGGA CACGAAGTTG AAC TAG T T AA AG AAG AAAG T 4 8 60 

TATTTCTTTA AT AT TAG T AA ATATACAGAC CGTTTATTAG AGTTCTATGA C C AAAAT CCA 4920 

GATTTTATAC AACCACCATC AAGAAAAAAT GAAATGATTA ACAACTTCAT TAAACCAGGA 4 980 

CTTGCTGATT TAGCTGTTTC TCGTACATCA TTTAACTGGG GTGTCCATGT TCCGTCTAAT 504 0 
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CCAAAACATG TTGTTTATGT TTGGATTGAT GCGTTAGTTA ACTATATTC AGCATTAGGC 5100 

TAT T TAT C AG ATGATGAGTC ACTATTTAAC AAATACTGGC CAGCAGATAT TCATTTAATG 5160 

GCTAAGGAAA TTGTGCGATT CCACTCAATT ATTTGGCCTA TTTTATTGAT G G CAT TAG AC 5220 

TTACCGTTAC CTAAAAAAGT CTTTGCACAT GGTTGGATTT TGATGAAAGA TGGAAAAATG 528 0 

AGTAAATCTA AAGGTAATGT CGTAGACCCT AATATTTTAA TTGATCGCTA TGGTTTAGAT 534 0 

GCTACACGTT ATTATCTAAT GCGTGAATTA CCATTTGGTT CAGATGGCGT ATTTACACCT 54 00 

GAAGCATTTG TTGAGCGTAC AAATTTCGAT CTAGCAAATG ACTTAGGTAA CTTAGTAAAC 54 60 

CGTACGATTT CTATGGTTAA TAAGTACTTT GATGGCGAAT TAC CAGCGTA TCAAGGTCCA 5520 

CTTCATGAAT TAGATGAAGA AATGGAAGCT ATGGCTTTAG AAACAGTGAA AAGCTACACT 55 8 0 

GAAAGCATGG AAAGTTTGCA ATTTTCTGTG GCATTATCTA CGGTATGGAA GTTTATTAGT 5 64 0 

AGAACGAATA AGTATATTGA CGAAACAACG CCTTGGGTAT TAGCTAAGGA CGAffGCCAA 57 00 

AAAGAT AT G T TAGGCAATGT AATGGCTCAC TTAGTTGAAA ATATTCGTTA TGCAGCTGTA 57 60 

TTATTACGTC CATTCTTAAC ACATGCGCCG AAAGAGATTT TTGAACAATT GAACATTAAC 5820 

AATCCTCAAT TTATGGAATT TAGTAGTTTA GAGCAAT AT G GTGTGCTTAA TGAGTCAATT 58 80 

ATGGTTACTG GGCAACCTAA ACCTATTTTC CCAAGATTGG ATAGCGAcGG AnAATTGCAT 594 0 

ATATCAAAGA ATCAATGCAA CCGCCTGCTA CTAAAGAGGA AAAAGAAGAG ATTCCTAGCA 6000 

AACCTCAAAT TGATAT TAAA G AC T T T GAT A AAGTTGAAAT TAAGGCAGCA ACGATTATTG 60 60 

ATGCTGAACA TGTTAAGAAG TCAGATAAGC TTTTA&AAT T C AAG TAG AC TTAGATTCTG 612 0 

AACAAAGACA AATTGTATCA GGAATTGCCA AATTCTATAC ACCAGATGAT ATTATTGGTA 618 0 

AAAAAGTAGC AGTTGTTACT AACCTGAAAC CAGCTAAATT AAT GGGACAA AAATCTGAAG 62 4 0 

GTATGATATT ATCTGCTGAA AAAGATGGTG TATTAACCTT AGTAAGTTTA CCAAGTGCAA 6300 

TTCCAAATGG TGCAGTGATT AAATAACTGT ATTTTTAAAA ATTAGGAGAG AT AAT TAT GT 63 60 

TAATCGATAC ACATGTCCAT TTAAATGATG AGCAATACGA TGATGATTTG AG T G AAG T G A 64 20 

TTACACGTGc TAGAGAAGCA GGTGTTGATC GTATGTTTGT AGTTGGTTTT AACAAATCGA 64 80 

CAATTGAACG CGCGATGAAA TTAATCGATG AGTATGATTT TTTATATGGC ATTATCGGTT 654 0 

GGCATCCAGT TGACGCAATT GATTTTACAG AAG AAC AC T T GGAATGGATT GAATCTTTAG 6600 

CTCAGCATCC AAAAGTGATT GGTATTGGTG AAATGGGATT AGATTATCAC TGGGATAAAT 66 60 

CTCCTGCAGA TGTTCAAAAG GAAGTTTTTA GAAAGCAAAT TGCTTTAGCT AAGCGTTTGA 6720 

AGTTACCAAT TATCATTCAT AACCGTGAAG CAACTCAAGA CTGTATCGAT ATCTTATTGG 67 8 0 

AGGAGCATGC TGAAGAGGTA GGCGGGATTA TGCATAGCTT TAGTGGTTCT CCAGAAATTG 68 4 0 

CAGATATTGT AAC T AAT AAG CTGAATTTTT ATATTTCATT AGGTGGACCT GTGACATTTA 6900 
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AAAATGCTAA ACAGCCTAAA GAAGTTGCTA AGCATGTGTC AATGGAGCGT TTGCTAGTTG 6 9 60 

AAACCGATGC ACCGTATCTT TCGCCACATC CGTATAGAGG GAAGCGAAAT GAACCGGCGA 7 020 

GAGTAACTTT AGTAGCTGAA CAAATTGCTG AATTAAAAGG CTTATCTTAT GAAGAAGTGT 7080 

GCGAACAAAC AACTAAAAAT GCAGflGAAAT TGTTTAATTT AAATTCATAA AGTTAAAAGT 714 0 

G AGAAAG AT C ACCGCCATAA ATGTAAACGA TGCTATATTC GTTTAATATG CTATGGTTCT 7200 

TTCTCACTTT TTTAAATTAA AATATCGTGC ATGTGGAATA CGTGCGATAG AGATGGTTAG 72 60 

AGCTTTGAAA TTAAGAATTG TAGGAAGGCG TTTTAAATGA AAAT C AAT G AGT T TAT AG T T 7 32 0 

GTAGAAGGAC GAGATGATAC TGAGCGTGTT AAACGAGCTG TTGAATGTGA TACGATTGAA 7380 

ACGAATGGTA GTGCCATCAA CGAACAAACT TTAGAAGTAA TTAGAAATGC TCAACAAAGT 74 4 0 

CGAGGCGTTA TTGTATTAAC AGATCCAGAT TTCCCAGGAG ATAAAATTAG AAGTACAATT 7500 

ACT G AACAT G TCAAAGGTGT TAAACATGCG TATATTGATA GAGAAAAAGC TAAAAATAAA 7 5 60 

AAAGGGAAAA TTGGTGTTGA ACATGCCGAC T T AAT T GAT A TTAAAGAAGC GTTAATGCAT 7 62 0 

GTTAGTTCAC CCTTTGATGA AGCTTATGAA TCAATTGATA AATCTGTGCT AATAGAGTTG 7 68 0 

GGGTTAATTG TTGGGAAAGA TGCAAGGCGC C GAG AG AAA TTTTAAGTAG AAAATTGCGA 77 4 0 

ATCGGCCATT CCAATGGTAA GCAGTTATTG AAAAAGTTAA ATGCATTTGG TTATACCGAA 7800 

GCGGATGTAA GGCAAGCTTT AGAAGATGAA TGAGGAAGTG AAAATGTTGG ATAATAAAGA 7 8 60 

TATTGCAACA CCATCAAGAA CGCGAGCGTT GTTAGATAAA TATGGCTTTA ATTTTAA&A 7 92 0 

AAGTTTAGGA CAGAACTTTT TGATAGATGT G AAT AT CAT T AATAATATCA TTGATGCAAG 7 980 

TGATATTGAT GCACAAACTG GGGTGATTGA AATTGGTCCA GGCATGGGGT CAT T G AC AG A 80 4 0 

ACAATTGGCC AGACATGCTA AAAGAGTATT GGCATTTGAA ATTGATCAAC GTTTAATACC 8100 

TGTAT TAAAT GAT AC ACT AT CACCTTATGA TAATGTGACG GTGATTAATG AAGATATTTT 8160 

AAAAGCGAAT ATTAAAGAAG CTGTTGAAAA TCATTTACAA GATTGTGAAA AAATAATGGT 82 20 

TGTTGCAAAC CTGCCGTACT AT AT T ACG AC GCCAATTTTA TTAAATTTGA TGCAACAAGA 8280 

TATACCAATT GATGGCTACG TGGTGATGAT GCAAAAAGAAGTGGGCGAAC GCTTAAATGC 8 34 0 

TGAAGTAGGT T C AAAAG CAT ATGGTTCGTT ATCAATTGTC G T AC AAT AC T ATACAGAGAC 8 4 00 

TAGTAAAGTA TTAACGGTAC CTAAATCTGT ATTTATGCCA CCACCTAATG TTGATTCAAT 8 4 60 

AGTTGTAAAA CTGATGCAGA GAACTGAACC GTTAGTAACA GTAGATAACG AGGAAGCATT 52 0 

CTTTAAGTTA GCAAAAGCAG CATTTGCACA AAGAAGAAAG ACAATTAACA ATAACTATCA 8580 

AAATTATTTT AAAGATGGTA AACAACACAA AGAAGTGATT TTACAATGGT TGGAACAAGC 8 64 0 

AGGTATTGAT CCAAGACGTC GCGGTGAAAC GCTATCTATT CAAGATTTTG CTAAATTGTA 8 7 00 
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TGAAGAAAAG AAAAAATTCC CTCAATTAGA AAATTAAATG ATTGACAAAG CAAAGCACTA 8 7 60 

TTGTTAAAAT TTAAATTTTG TTTGACGAAA ACGTTGCAAA TATGGTATTA TGTAACTTGT 8820 

AGCGAGGTGG AGCAATATGC CAAAATCAAT TTTGGACATC AAAAATTCTA TTGATTGTCA 8880 

TGTAGGAAAT CGTATTGTAC TGAAaGCCAA TGGAGGCCGT AAGAaAEAA TAAAACGTTC 8 94 0 

TGGAATTTTA AAAGAAACAT ATCCGTCAGT TTTCATTGTT GAGTTAGATC AAGACAAACA 9000 

CAACTTTGAG AGAGTATCTT ATACATACAC TGATGTGTTA ACTGaAAATG TTCAAGTTTC 90 60 

ATTTGAAGAG GATAATCATC ACGAATCAAT TGCACACTAA ATAAGACATA TAGAGATGTT 9120 

AGACGTTTCT TAGTATAAGA AGTAAATATT ATGATAATTA TTTGAGTGTT GGGcATTATG 9180 

TTCAATACTC TTTTTATTTA CAAAATGTTT AACACTGATG TTTCGCTTAT AGATTTTTCA 92 4 0' 

GTAAATGGAT AATTGTATTT ATAAACACAA ATACAAGTAA ATACTAAGTA ATTAGATGGA 93 00 

G AAAAT TACT TTTTTATTAA AAAAACACTA AAAAACAAAT TAAAATGTCA AATATTAATT 9360 

CTCTTTATGT T AAAAT CAT C ATATTAAGAT AACGAAAAGA GGGCGGAAAA TGATATATGA 94 20 

AACGGCACCA GCCAAAATTA ATTTTACGCT CGATACACTT TTTAAAAGAA ATGATGGCTA 94 80 

TCATGAGATT GAAATGATAA TGACAACAGT TGATTTAAAT GATCGTTTAA CTTTCATAA 954 0 

AAGAAAAGAT CGAAAGATAG TTGTTGAGAT TGAACATAAT TATGTGCCTT CTAATCATAA 9600 

AAATCTCGCA TATCGTGCAG CGCAACTATT TATTGAGCAA TATCAACTAA AGCAAGGTGT 96 60 

AACAATTTCT AT C G AT AAAG AAATACCTGT TTCTGCTGGC TTAGCTGGAG GTTCGGCTGA 97 2 0 

TGCAGCAGCA ACGTTAAGAG GATTGAATCG ACTTTTTGAT ATAGGGGCGA GTTTGGAAGA 97 8 0 

ATTGGCTCTA CTAGGCAGTA AAATCGGGAC AGATATTCCG TTTTGTATTT ATAATAAAAC 98 4 0 

TGCACTATGT ACTGGAAGAG GAGAGAAAAT CGAGTTTTTA AATAAACCAC CTTCAGCTTG 9900 

GGTGATTCTT GCTAAACCAA ACTTAGGCAT ATCATffiCCA GATATATTTA AGTTGATTAA 9960 

TTTAGATAAG CGTTACGACG TACATACGAA AATGTGTTAT GAGGCCTTAG AAAAT C GAGA 10020 

TTATCAACAA TTATGTCAAA GTTTGTCTAA TCGATTAGAG CCAATTTCTG TTTCAAAACA 10080 

CCCACAAATC GATAAATTAA AAAAT AATAT GTTGAAAAGT GGTGCAGATG GTGCGTTAAT 1014 0 

GAGTGGAAGC GGACCTACTG TGTATGGGCT AGCACGAAAA GAAAGCCAAG CAAAAAATAT 102 00 

TTATAATGCA GTTAACGGTT GTTGTAATGA AGTGTACTTA GTTAGAC TAT TAGGATAGAA 102 60 

GGGTTGAAAA GATGAGATAT AAACGAAGCG AGAGAATTGT TTTTATGACG CAATATTTGA 10320 

TG 10322 
(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5614 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

GAT T G AT T AA ATGTTTTAAT CCACTTCAAT GCCTTCGATA AACTCTACAA TCGCGCTATT 60 

CATATAATTA TTCGATTTCA TTTGTTCAGC ATATGTCTCA TTAAATCCAG ACATAACTTT 120 

TTTAAAwGCG AAAATTGAAA TTGGTATCGT TACT AATAAG GCACTAGCCA TACGCCAATC 180 

AATGAGCATT ATGTATAAAA AGATAGCAGC TGACAAAAGT AAGTTTCCTA TAACTTCAGG 24 0 

AATCATATGT GCTAAAGGTA ATTCTATTGT TTCAACCTTA TCGACAAATA TATTTTTTAA 300 

TTCACCTATT TTCTTAGATT CCaCTACGCC TAAAGGGAGA CGCATTAATT TTTGAGCTAA 630 

TTTTTTACGA ATTTCAGATA AAAT T T CAT A TGCCGTAATA TGTGATAGCA TCGTTGACGC 4 20 

TCCAAAACAA CACACTTGTG AAATATAAGC GAT T AAAGCA ATAAAGATAT AAACCATAAT 4 80 

CGAATTAATC GTATATGTAT TGTTAATCAT CATTAAAATA ATTTTAAATA CTGCCCAATA 54 0 

AGGAACTAAT CCAGAAAAGA CACTGATGAT AGACAACAAA ATTGATAACA TAATTTTCCA 600 

TTTACTATCA GAAGCAAATC CGCTGAGTAT CTTTATCCAA TTTGTCTGAT TGTTCATCGT 660 

ATATGTAACT CCT kTCAATT AATAATCTAA ATTAAGCCGC TTATATTATT TATTTCACTG 72 0 

GATGATATAC ATAATATAAA TTTGTTATTT GTTAAAAATT AATACTTKT ACAAGTACAT 780 

CATATATTAG TTGATAACGA TTATCAATGT CGCGTGGATT TGTGACACAT TTCTTTTAAA 84 0 

AATTCACAAG GTTATGGGGC AGAAATGATA AAGAGCCACT AATGATTTAT TATGTAGTGG 900 

TTCTGGGAGT GGGACAGAAA TGATATTTTC ACAAAATTTA TTTCGTCGTC CCACCCCAAC 960 

TTGCATTGTC T C TAG AAAT T GGGAATCCAA TTTCTCTTTG TTGGGTCCCT GAATATAGCC 102 0 

TTGTAGAGTC TAGTACATTG ATTTGTATCC CAATGTCCCT ATAATTGATT ATTCGCTTTA 1080 

TCTAATGATC CTATGACTCA ACTATTAAAT CATTTTTCGA AATACTTAAT TCTAATATAA 114 0 

TTAAATTCAT TTATTGTAAT ATTGCAAAAA TACATTGCAC ACCTTGTTCA TCAATGCTAT 1200 

AATTAATTAC ATAATAAATT GAACATCTAA ATACACCAAA TCCCCTCACT ACTGCCATAG 12 60 

TGAGGGGATT TATTTAGGTG TTGGTTATTT GTCACCTTTT TTATTGTTGC GCGTTCGTAA 1320 

CCAATGTGCA AAAAACGCAA CAAGACAGCC GCTTATAGCT GAAGTCATGA TGTTATTAA 1380 

T AAAT T G AAC ATCCGTCATA CACCTCCTCT CTGCGTTAAA GTAACGCCCG AGATGTTAGG 14 4 0 

CGACCAT CAT ATTATATCAT TTATTTATTA TATTTCACGC AATATTAAGG CTTAAGTAAA 150 0 

GTTTTTTTTA GTGGTTTACG CTACTTTAAT TGCTATCTTT TAAAATCCAT TTAGATAATA 1560 

TAAATGTGAT GGGTATCGTA ATAATTAAAC CAGCAAATGG TGCAATTTCT GCTGGCAAAT 162 0 
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TTAGCCAGGA TACAAATACA TATAATAAAA CTGTTTGTAA GCTTACGTTG ACAATCTGCG 168 0 

TAATTGGAAA ACTAATGAAT TTTCTCCAAG TAGGTTTTAC CCTGTAAACA AAATAACAAT 17 4 0 

TCAAATAATA TGAAATCACA AAAGCGACTA GAAATCGGT AATATGACTA ATCATATATT 18 00 

CAATGTGTAA TAATTTTAAC AGCAATAAAT AGACAACATA ATAATTTAAC GTATTAATGC 18 60 

CGCCAACAAT GATAAATTTT AAAATTTCAG CATGCGTTTG TGTTAGTTTC ATATGTGTAc 192 0 

TCCTCAACAT CAAAATATAT GCATAACTAC GTTCTCGAAC ATACTCGAAT ATGCGAGCCA 1980 

ATCCGCTTCA CTTCAAATAT GCTTATTTCA ATCTTTATAC CCTTTCACAG CAAATTTAGT 204 0 

CTCTTTCCCC TCATCCTTAT AC GC CAT TAT AATGTAACTG ATTTATCGCG TGACTCATTA 2100 

GCACTATAGA GATTACTTTA GTTCACTAGT AATTTTATAT ACAATAAGAG C G AC AAC AG T 2160 

AATGAGAGGA TGTCTACTAT GCAATTACAA AAAATTGTCA TCGCTCCTGA CTCATTTAAG 222 0 

GAAAGTATGA CCGCACAGCA AGTTGGCAAT ATTATAAAAC AGGCTTTTAC TAATGTTTAT 2280 

GGGAATACCC TTCATTATGA TATCATTCCG ATGGCTGATG GTGGTGAAGG TACCACAGAT 2 34 0 

GCTTTAATGC ATGCAACAGG TGCCACTAAG TATACAGTCA TCGTAATGA CCCTTTAATG 24 00 

CGACCTATTG AAGCATGTTA TGCACGCGCA GACGAACAAC AAATTGCAAT TATTGAAATG 24 60 

GCGGCAGCGT CAGGTTTGGA TTTATTAGAA AAAGAGGAAC GTAATCCTTT ATACACATCA 2520 

TCATATGGTA CCGGTGAACT AATTAAAGAT GCATTAAATC ATGGTGCTAA GACCATTATT 2580 

TTAGGGATTG GTGGCAGTGC AACAAATGAT GGTGGTACAG GTATGCTAAG TGCACTAGGC 2 64 0 

GTAAAGTTTA CTGATGTAAA CGGGGACTTA TTACAAATGA ATGGTGCTAA TCTTGCTCAC 2700 

ATTGCACAAA TCGATATAAC CAATCTAGAT TCGCGATTAA AAGAGGTGAC CTTTAAAGTG 27 60 

GCCTGTGATG TTTCAAATCC TTTAT1SGGT GAAAATGGTG CTACCTATAT TTATGGTCCT 2820 

CAAAAAGGCG CTGATGCAAA GATGATACCA AAGTTGGATT TCGCAATGTC GCAT TATCAT 2 88 0 

GATAAGATAA AAATGTGCAC AGGAAAGTCC GTTAATCAAA TACCAGGTTC TGGTGCAGCT 2 94 0 

GGCGGTATGG GCGCAGCATT ATTAGCGTTT TGTGAGACAA C T T T AAC AAA flGTATTGAT 3000 

GTCGTCTTTG ACATTACAGA TTTTCATCAA AGAATTAAAG ATGCAGACCT CGTTATTACT 3060 

GGAGAAGGAC GCATGGATTA TCAGACC AT C TTTGGTAAAA CACCCGTAGG CGTTGCGTTA 3120 

GCTGCAAAAC AATATCATAT TCCTGTCATC GCGATTTGTG GCAGTCTAGG CGAAAATTAT 318 0 

CAACATGTTT ACGATTTCGG TATTGATAGT GCCTATTCTA TAATCTCTTC ACCTAGCACT 32 4 0 

TTAGAAGATG TCCTACAAAA TAGCGAACAA AATTTATTAA ACACTGCAAC TGACATTGCT 3300 

CGTATTCTGA AATTACAATA ATGTCAAAGT AAATCATCAG CTTTATTATT TGCAGTTAAA 3360 

ACTTGAATGA GGTGAAACCC AT GAAAAG AA CTCATAAATA CCGTGATTCA TATCAATACG 34 20 

ACAATCAAAA CCAAAATCAT CGTCGTCAAT CTGAAGACGC ATCGTATAGA CAACAATATG 34 8 0 
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CTAAAGGCGA TCCTGAAGAA CACCCGGAAC GAT ACT AT AA TGGTAGAGAT TATCGAAGAG 354 0 

AACAAATTCT TGAAGAAGAA AACGAGAAAT CCCGCCGTTC AAAAAAATGG TTATATATK 3600 

TTATTGCCAT TCTCTTAATT ATTGTCGCTA TTTTTGTCAC ACGCGCCTTA CTTAACAATG 3 6 60 

ATAGCGATAA AGTTAGTAAT GACCCTAAAG TCTCTCAAAA TTATAAAAAA CAAGTTGAAA 3720 

ATCAAGACGG CCAAATTAAC CAGCAAGTAG ATAATGCTAA AGAAAATATT AAAAACAACC 37 80 

AAAAAACTGA T G AC AT TAT T AAAAATTTAC AAAATCAAAT CGACAACTTG AAGCAGCAAG 38 4 0 

AACAAAACAA AGCTGATTCT AAGCTAACTC AATTTTATCA AGACCAAATC AACAAATTGA 3900 

CAGAGGCAAA TAATGCACTT AAAAACAATG CAAGCCAAGG TAAAATTGAA AGCATGTTAA 3 960 

ATGATATTAA TACAAAATTC GACAGT AT T A AATCTAAATT AGAAAGCTTA TTTAAAGATG 4 02 0 

ACAATGGTGG CGCTAATTAA TTATTACACC TGCTTTGATG ATAAACATTA ATTCCCTATA 4 080 

CTTTATCTGT ATCACTACGT TATTCGTGAT GATGCATTAA GAGTATAGGG ATTTTTTATA 4140 

TAAACTTGTA TTCTAACTAC ATACAAATAC ACACAAAACG TATATAATTT ATATAATTAT 4C0O 

ATCATTTAAA GGAGTCTGAT CATGCAATCA ACGAAAACCA AAACGAAGCA TTTTTCATTT 4260 

TTATTGCTAA TTACGTTAGG CGTCATGACC GCTTTTGGCC CACTAACTAT AGATATGTAC 4 320 

G T AC CAT CAT TACCTAAAGT GCAAGGTGAT TTTGGTTCTA C T AC AT C AG A AATTCAATTA 4 38 0 

ACATTATCAT TCACAATGAT TGSTCTTGCA CTAGGCCAAT TTATCTTTGG ACCTTTATCC 4 44 0 

GATGCTTTTG GTCGCAAACG GATTGCTGTA TCCATTTTGA TCATTTTCAT TTTGGTATCA 4 500 

GGTTTGTCTA TGTTTGTTGA TCAATTGCCA TTATTCTTAA CTTTACGATT TATTCAAGGT 4560 

TTAACTGGTG GTGGCGTCAT CGTGATTGCA AAAGCCTCTG CTGGTGAfflA ATTTAGTGGC 4 620 

AACGCACTCG CTAAATTTTT AGCATCTTTA ATGGTAGTTA ATGGCATCAT CACTATTCTT 4 680 

GCACCATTAG CCGGTGGATT AGCTTTATCC GTAGCAACAT GGCGTTCTAT TTTCACAATT 4740 

TTAACTATTG TGGCACTCAT CATTTTAATT GGCGTCGCTT CTCAATTACC TAAAACATCT 4800 

AAAGATGAAT TAAAGCAGGT GAATTTTAGT AGCGTCATTA AAGATTTTGG AAGTCTTTTG 4 8 60 

AAAAAACCAG CATTTATTAT TCCAATGCTA TTACAAGGwT TAACTTATGT AATGCTATTT 4 920 

AGTTATTCAT CTGCATCGCC ATTTATTACT CAAAAATTGT ATAATATGAC ACCCCAACAA 4 980 

TTTAGTATCA TGTTTGCTGT TAACGGTGTA GGTTTAATCA TTGTCAGTCA AGTCGTTGCT 5 04 0 

TTATTAGTAG AAAAAT T AC A TCGCCACATA TTATTAATCA TTTTAACTAT TATACAAGTG 5100 

GTAGGTGTTG CTTTAATTAT CCTGACACTT ACATTCCATT TACCACTTTG GGTCTTACTC 5160 

ATCGCATTCT TCTTAAATGT GTGTCCTGTG ACGTCAATTG GACCGCTTGG TTTCSAATG 5220 

GCTATGGAAG AACGAACAGG TGGCAGTGGT AACGCATCAA GTTTACTTGG CTTATTCCAA 5280 
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TTTATCTTAG GTGGCGCTGT TGCACCATTA GTTGGCTTAA AAGGCGAATT TAATACATCA 534 0 

CCAT AT AT G A TTATTATCTT CAT T ACAGCC ATTCTATTAG T C AG T C T AC A AATCATTTAC 54 00 

TTTAAAATGA TTAAAAAGCA ACATGTCGCA TAACACTTCA ACATAATTAG AACCCTAGCA 54 60 

AAGAT AT C T A TCTTTGTCAG GGTTCTTCTT TATGAATTAT GAGATCGAAT CTTCAACTAA 55 20 

AATTACGCCT TCATAGCAAG GACATTTCTA TTCAATCACC CTTTAACAGG CATCCAAATT 55 80 

TcTGTAATAT ATTTTTCACT TGTAGTATCA CCAT 5 614 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

CATGGTAAAT TTGTTGAAGC AAATAATAAT GAAATTGCTA AAGCAATGAC TATTTCACAT 60 

AAAGACAATG ATATGAAGTA TATGGATATC ACAGAaAAAG TGCCAATGTC GGAATCTGAA 120 

GTTAACCAAT TGCTAAAAGG TAAGGGGATT TTAGAAAATC GAGGGAAAGT TTTTCTAAA 18 0 

GCTCAAGAAA AATATGAGGT TAATGTCATT TATCTTGTTA GCCATGCATT AG T AGAAAC A 24 0 

GGTAACGGCA AATCAGAATT AGCAAAAGGC ATTAAAGATG GGAAAAAACG CTATTACAAC 300 

TTTTTTGGTA TAGGAGCATT CGATAGTAGT GCTGTTCGTA GTGGGAAAAG TTATGCTGAA 3 60 

AAGGAACAAT GGACATCACC AGATAAGGCG ATTATTGGTG GTGCAAAGTT CATTCGTAAT 4 20 

GAATATTTTG AAAACAATCA ACTGAATTTA TATCAAATGC GATGGAATCC AGAAAATCCT 4 80 

GCGCAACATC AATATGCGAG TGACATTCGC TGGGCAGATA AAATTGCCAA ATTAATGGAT 54 0 

AAATCCTATA AGCAGTTTGG TATAAAGAAA GATGATATTAGACAAACATA TTATAAATAA 600 

GACATCGGTG CTTAAAGGAG CTGGAACAAT TTATTGTTTC GAGCTCCTTT AGCGCATTCT 660 

GAGTGTGTTA GTTAAATGGA TTTTAACCTA ACAAAAAACG CTATATAGCA TCAAATATGC 720 

TATATCCCAC ATCATTGTTA CAAATGTACA TGATGTAAAT GAATATTGCT GTCTAAATGT 78 0 

GCATGTAATA TACAATGGTG CAGATAATAC ACTTAAGTCC TTAAAAATGA AACGTTAgTT 840 

CCAAGAGTCA TTTTTAAACA ATAGTGCATG TGATAAAATA GAAAAGAATG AAAAATATAG 900 

AGGTGACAAT AT GAAG AT AG CAATTATAGG TGCAGGCATC GGTGGATTAA CAGCTGCTGC 9 60 

ATTATTACAA GAACAAGGTC ATACTATTAA AGTCTTTGAA AAAAAT G AG T CAGTTAAAGA 1020 

AATTGGCGCT GGGATTGGTA TCGGAGATAA TGTGCTTAAA AAACTAGGTA ATCATGACTT 108 0 

AGCTAAAGGT ATTAAAAATG CTGGGCAAAT CTTATCTACA ATGACAGTGT TAGATGACAA 114 0 
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AGATCGCCTG TTAACTACTG TTAAATTAAA AAGTAATACA TTGAATGGA CGTTACCACG 1200 

CCAAACATTA AT TG AC ATT A TTAAATCTTA TGTAAAAGAT GACGCAATAT TTACAAATCA 12 60 

TGAAGTCACG CATATAGATA AT GAG AC AG A TAAAGTTACC ATACATTTCG CGGAACAAGA 1320 

AAGTGAAGCA TTTGATTTAT GTATTGGTGC TGATGGAATT CATTCTAAAG TGAGACAATC 1380 

TGTAAATGCT GACAGTAAAG TAT T AT AT C A AGGGTATACA TGCTTTAGAG GTTTAATTGA 14 4 0 

TGATATTGAT TTAAAGCATC CGGaTTGTGC AAAAGAATAC TGGGGaAGAA AAG G a AG AG T 1500 

AGGTATTGTT CCGTTATTAA ATAATCAAGC ATATTGGTTC ATTACAATTA ACTCGAAGGA 1560 

AAACAATCAT AAATATAGTT CGTTTGGTffl. ACCTCATTTG CAAGCATACT TTAATCACTA 162 0 

TCCAAATGAA GTTAGAGAGA T C T T AG AC AA ACAAAGTGAA ACAGGTATCT TATTGCATAA 168 0 

TATTTATGAT TTGAAACCAC TCAAATCTTT TGTTTATGGT CGTACTATTT TACTAGGAGA 17 4 0 

TGCAGCACAT GCGACAACGC CTAATATGGG GCAAGGTGCT GGACAAGCAA TGGAGATGC 1800 

TATCGTATTA GTAAATTGTT TTAACGCATA CGACTTTGAA AAAGC AT T AC AGCGTTATGA 18 60 

TAAAATACGT GTCAAACATA CTGCAAAAGT AATTAAGCGT TCTAGAAAAA TCGGTAAAAT 192 0 

TGCCCAATAT CGTAGTCGTT TATTTGTTGC AGTTAGAAAT CGTATTATGA AAATGATGCC 1980 

AAATGCATTA GCAGCTGGAC AAACTAAATT CTTATATAAA TCGAAAGAAA AATAATACAA 2 04 0 

CAATATGAAA ACCCCCGTAT GTTGAAACGA GAGCTCAACA TATGGGGGTT CTTGTTTTTA 2100 

TAATGTTATT ATAATAAATT CAATTATTAG T T AAC G AC AA ATTGTGGTTT CTCACCTTGA 2160 

ACGGCACTAA TTGCAGCATT AGCAACAATT TTAGACATCA TGTCACGTGC TTCAAATGTA 2220 

GCATTACCAA TATGCGGTGT TAATACTACA TTATTAAGTG ATTTTAAGTC ATCGGTAATA 2280 

TCTGGTTCAA ATT CAT AT AC ATCAAGTGCA GCACCTTCAA TTTCATTATC TTTCAATGCT 2 34 0 

TGCACTAGTG CTTGTTCGTG CACGATTGGA CCACGAGAGG CATTGATTAA ATACGCCGTA 24 00 

GATTTCATCA TTTTAAATTG TTCTGTATCA ATTAAATGAT GCATTTTAGG AT T AT AAGCA 2 4 60 

GCGTTGATAG TGATAAAATC TGCATTCTTT AATAGTGTAT CTAAATCTAC ATATTTTGCA 2520 

CCGATTTCTC GTTCTTTTTC TTCTTTGCGA TTAGGTCCAG TGTATAGCAC ATCCATGTCA 2580 

AATGCTCTTG CACGACGAGC TACTGCACTA CCAATTTCAC CTAAACCGAT AATGCCGATT 2 64 0 

GTTTTCCCAG ATACTTCTCT ACCTCTGAAA AATAAAGGTG CCCATCCATC AAATCCAGTT 2700 

GTACGTGATA ATTGGTCCCC TTCAACAATA CGACGCGCTA CTGCAAGTAC TAATCCAATT 27 60 

GTTAAATCAG CAGTCGCGTT TGTTGATGCT TTAGGTGTGT TTGAACATC TATACTTTTT 2820 

TCTCGGGCAT ACTCGATATC AAT AT TAT T A AAACCAGCGC CATAGTTGGC AATGATTTTT 28 80 

AAGTCTTTAC CAGCATCGAT AACATCTTTA TCAACGTTTG TAGATAATAA ACTAATTAAG 2 94 0 
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GCAGTCGCGT TTTTAACACC TTTAATTAAA GTGTCTTTAT CGACTAATCC TTTACCTTCA 3000 

TACATTTCAA CTTCAAAATG TTCTTGTAAA AGTTTTAAAC CTACTTCTGG TATtGCACCA 30 60 

gCAACATAAm CTTTTtCCAT AAAAG A t C AC TCCTTTTATC TTAGTATAGT AGAAGATTAG 3120 

ACAGTATACA ACTATGTCAT GATGTCTTGT GTATCAATGA TGTAAGCGCG TACTTTTGAT 318 0 

GGAGGCGATA TAACTTAGGC AC T G TAGAAC TATGAATATT GTAATGTGGA AAAACTGGAT 32 4 0 

CAATTAAATT AG AT AAC G T A GTTTTAAAGT TAATAGTATT AGAAAAAATT AATATTTTGA 33 00 

ATATGGGAGG AAATATAAAT AAGTAGGTGG CAACGAAAAA TAGCAAAAAA AGAGCTTCTC 33 60 

CTATAAAGGA AAGCTCAAAG TTTTTTGATG ACATATGTAC T AGAAT T AAGT T T C AAG AC A 3420 

ATATGTATCA TCGTGTTTAT ATTAAATATG GATGTAGTTG TAGTTACCTG CTTCACTTGC 34 8 0 

AGAAATAGTT CTAGAACTTA CTGAGAAAGG TCCGCCACTA TAATTCATTT CTGAAATTGT 354 0 

AACTGAACCA TCACTGTTTA CACTTTCTAC ATATGCAACG TGACCAAATG GTCCTTCAGA 3 600 

AGATTGTAGG ATAGCGCCTT TAGAAGGTGT ATGATTTACT GTGAATCCAG CACCTTGTGC 36 60 

AGCAGCAGCC CAATTATTAG CATTTCCCCA AGTAGAACCG ATTTCTCCGC CAACTTTATC 3720 

ATATACATAC CAAGTACATT GTCCTGCAGT GTATAAGTTA CCAGAATGTG AAATTGATGA 37 80 

TGTAGTTGTC GTAGTTGTCG TAGTCGTTGT AGTTGAGTC GTGTTGTAGT TATAGTTGTT 384 0 

GTAATTTGTA TAATTTTCAG CAGCATCTGC ATGATGTGCT TGACCTACTA ATGCTGTGCC 3900 

GATTCCTGCT GTTAACGTAG TTGCTGTTAC TAATTTTTTC ATGAATAAAG TCCTCCAAAG 3 9 60 

TTCTATATCT TTTTTTATAA ATAAAACGTA GCGACTGTTT TATTCTCACA TCTCGAATC 4 020 

ATGACAATAG TTACTTTAAC AAAATtAATG cTTCTTGTGG GGAATGTTAT TGATTTGTAA 4080 

AAGAATAAAA AAACTTTGAC TAATTTTGTA ATAAAAATTA GTCAAAGTTA CAATGAGATT 4140 

AACAGATAAT TAATAGGAAA TATTTATTTG TAATATGTTT AAATAAATCG AATTGTTAAA 4 2 00 

GGTATTATAT ATTCTTGGCC ATTATAATAT T T G AC AC AC G CAATAATTGT GAATACAAAA 4 2 60 

GATAATATTG AGAAAGCGAA TATGGATAAA ATACCGATAA ACGTAATGAT GAAACCTATA 4 320 

ATAATAATGA AATCAATATC TGTAGCAATT AGGAAAACGC CTATTAAAGT GATAACGACT 4 380 

AAAACGATAG ACCAAATAAT ATAAGAAATC GTATAGTTAAGATAATTTTT TCCAGCACGA 4 440 

TCAACTAGTT TCGATTCATC TTTTTTCAAT AACCATATTA TCAGTGGACC AATAATAGAT 4 500 
GTGAATAAAC TTAATAAATA GATAAGCATC GCCATAATGT TCTCATCATT GGATTTGCGA " 4 5 60 

TTCGGTTGAT GATTTGTTAC GTCGTTCATT TCAGTTGTCA TATTAGACAC TCCTTTGAAA 6320 

ATTGTAATAT TATCTTTAAC TATAACAAAA TATAATCAAA AATAAACATG TTTATTAAAC 4 680 

AATTATTAAA AATAAAAATA ATTGGTGGAC GTCGGCGTTT AAATAGGTTA ATTTAAGGTT 474 0 

AT AT AT AC TT AACATTTATA ATGATGCGTA ATGAATTCGC ATCATTTTTA TATTGTCTTA 4 800 
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CGTATAATTT GTTTTTAATT TTAACCAAAG ATAGAAAGAG GGTTGTTTAT GAAAATAGCA 4 8 60 

ATTGTAGGAT CAGGAAATGG CGCAGTTACG GCAGCAGTAG ATATGGTGAG CAAAGGCCAC 4 92 0 

GATGTTAAAT TATATTGTCG TAATCAATCT ATAAGTAAGT TTCAAAACGC AATCGAAAAG 4 98 0 

GGCGGATTTG ATTTTAATAA TGAAGGTGAT GAACGTTTCG TAAAATCCAC TGATATTAGT 504 0 

GATGATATGG AATATGTTTT AAAAGATGCT GAAATTGTTC AAGTGATTAT TCCATCTTCA 5100 

TACATAGAGT ATTATGCTGA TGTAATGGCA GAGCATGTAA CTGATAATCA GTTGATATTC 5160 

TTCAACATGG CTGCAGCAAT GGGGTCAATT CGTTTTATGA ATGTTTTAGA AGATAGACAT 5220 

ATTGAAACAA AACCACAACT AGCGGAAgcT AATACGTTGA CGTATGGTAC GCGTGTCGAT 52 8 0 

TTTGAAAATG CAGCAGTTGA TTTATCTCTA AATGTACGTC GTATCTTCTT T T CAAC AT AT 534 0 

GATAGAAGCT GTCTAAATGA TTGTTATGAC AAAGTTTCAA GTATTTATGA TCATTTAGTA 5400 

AAAGAGGAAA GCTTAATTAA AACAAATCTT GAAAATGGTA ATCCTGAAGT GCATCCAGGA 54 60 

CCAACATTAT TGAATGTCGG TCGCATTGAT TATGCTGGCG AGTTCGCTTT ATATAAAGAA 552 0 

GGAAT T AC T A AACATACAGT TAGATTACTT CATGCAATCG AATTAGAACG TTTGAATTTA 5 58 0 

GGCCGTAGAT TAGGTTTTGA ATTATCAACA GCTAAAGAAT CACGTATTGA ACG3TGTTAT 564 0 

TTAGAACGTG ATAAAGAAGA TGAACCATTA AATCGTTTGT TTAATACAAG CCCAGTATTT 5700 

TCACAAATTC C AGGAC C AAA T CAT G T AGAA AGCAGATATT TAACTGAAGA TATTGCATAT 5 7 60 

GGTTTAGTAC TATGGTCAAG CTTAGGTCGT GTTATTGATG TACCGACACC AAATATAGAT 5 82 0 

GCAGTAATTG TAATTGCATC AACCATTTTA GAGAGAGACT TCTTTGAGGA AGGCTTAACA 588 0 

GTTGAAGAAA TTGGTTTAGA TAAGCTTGAT TTAGAAAAAT ATTTAAAATA AATGATGGCT 5 94 0 

TGAAGATAGA AAAGGATATA GCATTATGCA AAAGCAATAA ATT G AAG AAA AGAGGTTTCT 6000 

CAT CAAT AAG CGnAGGGGAC GATAGATGAT GAAAASLAAA CCCACCTTTT TAGAATCAAT 60 60 

TTCGACAATG ATTGTAATGG TTATTGTTGT TGTAACAGGC TTTGTGTTTT TTGATATTCC 6120 

AATTCAAGTA TTATTAATTA TTGCCTCAGC ATATGCCACA TGGATTGCAA AACGTGTAGG 618 0 

CTTAACATGG CAAGATTTAG AAAAAGGCAT TGCAGAACGT TTAAATACTG CAATGCCTGC 624 0 

AATTTTAATT ATACTAGCGG TAGGAATTAT AGTAGGCAGT TGGATGTTTT CTGGCACAGT 6300 

GCCAGCCTTG ATTTATTATG GCTTAGATTT ATTGAATCCA AGCTATTTTT TAATATCAGC 6360 

CTTTTTTATA AGTGCTGTTA CATCTGTAGC AACTGGTACA GCATGGGGCT CTGCATCAAC 6420 

TGCAGGGATT GCACTTATTT CTATTGGTAA TCAATTGGGG ATTCCTCCAG GGATGGCAGC 64 8 0 

GGGTGCTATT ATAGCAGGGG CTGTGTTTGG CGATAAAATG TCACCATTAT CAGATACAAC 654 0 

TAATTTAGCG GCGCTTGTTA CTAAAGTTAA TATATTTAAA CATATACATT CGATGATGTG 6 600 
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G AC G AC GAT A CCTGCATCAA TCATAGGTTT ATTAGTATGG TT2LTTGCTG GATTTCAATT 6660 

TAAAGGGCAT TCAAATGATA AAC AG AT T C A AACTTTGTTA TCAGAGCTTG CACAGATTTA 67 2 0 

TCAAATTAAC ATATGGGTCT GGGTTCCCTT AATTGTGATC ATTGTTTGTT TGCTATTTAA 6780 

AATGGCTACA GTGCCAGCTA TGCTAATATC AAGCTTTTCT GCCATTATAG TGGGGACTTT 68 4 0 

TAATCATCAT TTCAAAATGA CAGATGGTTT CAAAGCAACA TTTAGTGGTT TTAACGAATC 6900 

AATGATACAT CAGTCTCATA TTTCATCCAG TGTGAAAAGC T T G T TAG AAC AGGGTGGTAT 6960 

GATGAGTATG ACCCAAATAT TAGTAACGAT ATTTTGCGGA TATGCATTTG CAGGTATTGT 7 02 0 

AGAAAAAGCA GGATGTTTAG AAGTCTTATT AACTACTATT TCTAAAGGCA TCCATTCTGT 7 08 0 

AGGAAGTTTA ATATGTATTA CTGTTATTTG TTGTATTGCG CTTGTATTCG CTGCAGGTGT 714 0 

TGCTTCGATT GTAATTATTA TGGTCGGTGT GTTAATGAAA GATTTGTTCG AAAAATACCA 7200 

AGTCTCTCGT TCCGTTTTAT CAAGAACACT TGAAGACTCA AGTACGATGGTCTTACCTTT 72 60 

AATACCATGG GGAACATCAG GTATTTACTA TACGAATCAA CTTCATGTCT CTGTTGAAGA 7 320 

ATTTTTCATA TGGACAGTAC CATGTTATTT ATGCGCAATT ATAGCAATTA TCTATGGTTT 7380 

TACAGGGATA GGTATTAAAA AGTCATCGAA TTCACGTTTA ACTTAATGTG AGCGTGGAAT 74 40 

ATATATAATA TGTTGAAACA CTTTAATCAT TTATAATTGT AGCGGTTATA AT T T GAAAAG 7500 

GTTTTAACTT AGAATAAATA TCCTCTATGC ATATACTGAA TATGTTTTGT AGCGGAACAT 7 5 60 

GTTGATATAT GTAATGTAAG TTTTATGTCA TGATTTGTAA TGACTAAATT AATTGAGAAT 7 62 0 

TTGAAGGCAA GTATATTTGT AAGTACTTTA ACTAAAAATT TATCAATGTA TAGCCGATTT 7 68 0 

GACATGCCTA AATTTGGGTG TGTCAATGGC TGTATGTTGT TTATTCTTTA TTACAGAGTG 7 74 0 

AATCGGATTG GTGAAAATCG AAATTTTGAG ATTTTTACCA ATTCGATTTT TTTCATAGAA 7800 

ATTAAAAAAG CCAACAAGGC TCTTGAAACC TTGTTGGCGT AAACATAGCC ATCACTAKT 7 8 60 

AGTGAATGAA GTTATAACCA GCAGCTTGGC TAGCTGAGAT TGTACGTGAA GTTACAACAC 7 92 0 

CTGGGCCATA ACCATAGTTC ATTTCTGAAA CTCTTACTGA ACCATTGCTG T T AAC AC T T T 7 98 0 

CAACGTATGC AACGTGACCG TATGCACCTT GAGTTGTTTG CATAATTGCA CCAGCTTTTG 804 0 

GTGTATTGTT CACTGTGTAA CCAGCTCTTG CAGCTGCGTT AGCCCAGTTA CTTGCATTGC 8100 

CCCAAGTTGA ACCGATTTTA CCACCTACAC GAT CAAAT AC GTAGTATGTA CAT T G AC C AG 8160 

AAGTGTATAA GTTACGTCCT GAAGTATAAC CACTTGAGAT TGAACGGCCA TTTGATGATG 8220 

GAGCCATAGT TGTAGTTACT TGAACATTGT TGCTTGAAGT GCTGTAGCTT GCACCTAAAC 8280 

CACCAGTACG GTAGCTGTTT GTGTTGTAAC TATTATAGTT ATTGTAGTTA TATGATTGAT 8340 

TATTATTTGA GTAGTTGTTG TAACGGCTGT AGTTATTGTA GCTATAACCG TTGTTGTAAT 84 00 

TGTTATAGTT ATTGTAACCA TTGTAGTAGT AATAGCTGTA GTAGCCATTA TCTTGGTTTA 4360 
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ATTGACTTGG ATGCCAGTTA CCTTTCCATG TGTAATGGTA GTTACCTTGT GCATCAATAG 8520 

TGTAAGTATA GCTATATGAT GTTGGGTCGT TTGGATTATA ACCGTAGTTA TCTTGCTCAG 858 0 

AAGCATGAGC TTGATTTCCT GATGCAATTG CGATTGTAGC GAATCCTGCA GTTGCGATAG 8 64 0 

TAGCTGTAGC GATTTTCTTC ATTTTAAAAA TATCCTCCTA AAAATTTTAA ATCTAAAATA 8700 

TTTTCGTAAT GTCCGTGTGA CAAAATTAAT GTTATAAGTT ATCTCTCGTA AT T AAACGAC 87 60 

AAGAAAGACT ATAACAGAAA TTAGCGTCCT TGTGTGCTTT GTTAACGTTT TGTAATTTTT 8820 

TGCTAATATC TTGACACAAT AGAATTTTAA AAGTATAGAA ATTTGCSTT TGCAAAACTT 8 880 

ATAACTACGG CATTCTTTGT GAAAACTGAA TGTTTCGAAA ATAAGTCTGT TACAAATTTG 8 94 0 

TAATATTACT GAAAATTCTA AATGTATATT TTGTGCATAA TAT AG G AC T T TTAATCAGAA 9000 

AATGCATAAA AATTCGACAC GAACGATCAA GTTTCTATGA TTATGAAAGC TTTGGTATTT 9060 

GGATGAAAAT GTATATTTAA TGGATAAAAT ATCCTAATTT AGCATAAAAA AATGTTTTAA 9120 

I 

TAAAAGTATT ATTTGATATA ATCGATTTAT GTTTTGTTAC TGCTAAAAAA CATGTGGCG 9179 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1868 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CCTTCAGCCA TTTGACTTCG ACATGAGTTG CCTGTACATA TAAAATAAAT TGTTTTTTTA 60 

GTCATAACAA TCTCCTAATT AATTAAAATA TGATAAGTGT TAGATACAAC CCTATGAGGG 120 

TTATAAATAG TACTGGAATT GTAATGATGA TAC CAGTTTT AAAGTATGTG CCCCAAGAAA 180 

TCTTAACATC TTTTTGtGTT AAGACGTGTA ACCACAGTAA TGTAGCTAAA GAGCCTATCG 24 0 

GTGTAATTTT TGGACCTAAA TCAGAACCGA TAACATTCGC ATAAATTAGGCCTTCTTTTA 300 

ACATGCCATG GACATTTGAT TGACCAATAG CAATCGCATC TATTAAAACT GTAGGCATAT 3 60 

TATTCATTAT T GAT GAT AAA AACGCTGAAA TGAAGCCCAT TCCCAAAATA GTGCTAAATA 4 20 

GACCGTAATT GGAAATATAT TCTAATATTT TAGCCAATAT TAAAGTAATG CCAGCATTTC 4 80 

TTAAGCCGAA TACGACGATA TAC AT AC C AA TTGAAAATAA TACTATATTC CAAGGTGCGC 54 0 

CCTTAATGAC TTGCTTAATA TTTACAGCAT TTGATTTACG AGCCAACATT AGAAAAATAA 600 

AAGCAATGAT TCCAGTGAAA ATTGATACCG GAATTTTAGT AAATTTACTG ATTAGATAGC 660 

CGAAAAGTAA TATAACTAGA ACAATCCaTG Z&ATTTTAAA TAGCTTTAAA TCATTAATGG 7 20 
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CATCTTTAGG ATGCTTTATA TTATTATCAT CAAACGTTTT AGGTATCGCT TTTCTAAAAT 780 

ATAACCACAA TACTATAATA CTTGCTAAAA GCGAGAATAA ATTAGGTATA ATCATTCTAC 84 0 

TAAAATATCG AACGAATCCT ACATGAAAAT AATCAGCAGA TATAATATTC ACTAGATCC 900 

TCACGATTAA AGGTAAAGAA GTTGTGTCAG CTATAAAACC ACTCGCAATA AT n AAAG G G A 960 

ATATGGCCCG CTTACTAAAA CCTATATTTT TAACCATCGC TAATACAATA GGCGTTAAGA 102 0 

TTAAcGTGCG CCATCATTTG CGAAAAATGC AGCAACAATG GCACCCAATA ATATGATATA 1080 

AACGAACATT TTTAAACCAT TGCCTTTTGA AGCATGAAGC ATGTGAATAG CTGACCATTC 114 0 

GAATAATCCA ACTTTATCTA AT AT T AAT G A AATAAGAATG AC T G AG AC AA AAGTCAAAGT 12 00 

AGCATTCCAA ACAATACCTG TTACTTCGAA AACATCGGAA AAACTTACAA C AC C AG T AAT 12 60 

AACAGCAATA AAGGCACCAG TTAGTGCGGT AATGCCAAH TCTAGGCCTT TAGGTTGCCA 1320 

TAATACAAAT AAT AAAGT T A C T AG AAAAAT GAGTGTCGCT AAAGTTGTCA TCATTAGCAT 1380 

TCACCAGTCT TAAGGTTATG ACAAATACAT CGTTGGTTAG AGGTATGAAC CTTAGACAAG 14 4 0 

TTATTAATTA CGGACTCAAA AAT AT TAT G A TTgAGCTGGT ATAAATGTTT ATTTCCGATT 1500 

TTTCGTGTCG TAACTAAGTT GGTTTTTACT AATGCTTTCA TATGrTAGCT AAGTGTAGGT 1560 

TGAGAGAATT GAAAATGTGC TAACAAATCA CAAGCGCATA ACTCTCCACA AGAAAGTAAA 162 0 

TCTAGTATTT CTAATCTGCT TGAATCTGAT AAAACTTTTA AAAATGTTGC TAGTTCTTTA 168 0 

TACGTCATAA CATACCTCCT AGACGTTAAA TAGATTATCA T C T AT AT AGA TGAATGTCTA 17 4 0 

TGTTCCTTTG GTATATTACA CGATATGACT ATGTAATTTA AATTTGGTTT TAGTATTAAA 18 00 

AGGGTATTAA AGATAAATTA TAGATATTGA TTTTGCAAAA TATACTCTTT GTTCTGCATT 18 60 

GAAAAAGG 18 68 
(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

ATTTATGAAA TCCATAGCnA TAAACATTAT TCTTGCATCG GCTATACAAA CAGTTACCGC 60 

AAGCAAATTT GTATATCAAC CTGGAATTGT GTTCACGTCA ATGGCaAATG CCGATGATGT 120 

GTTATCAGGC GATAGTTATT TTATGGCTGA ATTAAAATCT ATTAAGCGTA TTGTTGAAAT 810 

TCCAGATAAT CAAAAAATAT ACTGCTTTAT AGATGAAATT TTTAAAGGTA CCAACACAAC 24 0 

TGAACGAATT GCCGCTTCAG AATCAGTACT ATCATTTTTA CAT G AAAAAT CTAACTTTAG 300 
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AGTTATTGCA GCAACACATG ATATTGAGTT AGCTGAACTC TTAAAACAAC GTTATGAAAA 360 

TTACCATTTC AATGAGGTAA TAGAAAATAA TAACATACAT TTTGATTACA AAATTAAGCC 4 20 

TGGCAAAGCA AATACACGTA ATGCCATCGA ATTATTAAAA ATCACTTCAT TTCCAGCAAA 4 80 

AATATATGAA CGAGCAAAAG ATAATGTCCC GAAAATTTAG CATTTAACTT TAAACATAAA 54 0 

AACGTCAGCT ATCACATGAC AGAAGACTAT GAACAGTTTC AATAATGTC ATAGTAATCA 600 

TGTTAATAAC TGACGTTTAT TTTATTCTGC AGAATACTCT TCTAAATCTA TATTGCTGTG 660 

CCCATTTAAT GCTAAATCAG CAAATCGACC TTGCTGATAC AAATAGTGGC CGGCAACGCC 720 

TATCATTGCA GCATTATCTG TGCATAATTT AGGACTTGGG ATAGTTAATT GAATGTCATT 780 

GACTTTGCAT TGATCCGCTA AAGATTGACG TAATCCTTTA TTACTCGCCA CGCCACCAGC 84 0 

AACAATTAAT CGCTGAACAC CATATTCTTT ACAAGCTTGA ATAGCTTTAA ACGTGAGCAC 900 

CTCTACAACA CTGTTTTGAA AGCTCGTTGC TACGTTAGCT TCAATGATTG GaATATTTTT 9 60 

TTGTCGTTGA TTGTGAAGTT GATTGATTAC GGCACTTTTC AACCCACTAA AACTAAAATC 1020 

ATAACTATCT TTATCCAACC AAA C AC G AG G GAATGAATAA GTATCTTCAC CTTCAGCAGC 1080 

CAACCGATCA ACTTGTGGAC CACCTGGATA ATTTAAACCA ATTGTTCGTG CCACTTTATC 114 0 

ATAAGCCTCA CCTACTGCGT CATCTCGTGT TTCACCAATG ACTTCAAATG ATAA&GATC 120 0 

CTTCATATAA ACTAATTCAG TAT GTCCACC TGAAACAATA AGTGCAATTA GCGGGAATGT 12 60 

TAATGGCTCT TCTATGTGAT TAGCATATAT ATGTCCTGCA AT AT GAT G AA CAGGAATAAG 1320 

TGGCTnATCG TAAGCAAATG CCAATGCTTT GGCTGCATTA ACACCTATTA GTAACGCACC 138 0 

AATTAGTCCA GGGCCTTCTG TAACCGCTAT GGCATCAATA TCTTCTATTG ATACATCGGC 14 4 0 

ATCCCCTAGA GCCTCGTTTA TTGTTGCTGT TATACCTTCA ACGTGATGTC TACTTGCCAC 1500 

TTCGGGAACG ACACCGCCAA ATCGTTTATG ACTTTCAATC TGACTTAAAA CTGTATTTGA 15 60 

TAAAATATCT CTGCCATTTT TTATAACACT AACGCT'ETT TCATCACAAC TTGTTTcAAC 162 0 

AGCTAGTATT AATATATCTT TAGTCATTTA AATTCACCCA CATAACCATT GCGTCCTCAC 168 0 

CTTCACCATA ATAATTTTTA CGTTTACCAC CAT AT T GAAA TCCTAAATTT TCATATACAT 17 4 0 

GTTGTGCCAC TTTATTATTA ACTCTTACTT CTAAACTCAT C AC AT C AC AA GTGTGACTTG 1800 

CATAGTTTAT TCCGTATTTT AAAAG CAT T T GACCTAAACC ATAGCCTCTA TAATTATCAT 18 60 

CGATTGCAAC TGTTGTAATT TGAGCTTGAT CGATAACAAT CCATAAACCT AAATAACCAA 1920 

TAATTTGTTG TTCAAATTCt AAGACAAAAT ATTTCGCAAA GTTATTTTGC TCTATTTCAT 1980 

GATAAAATGC GTCAATTGTC CAAGAACTGT CATTGAAACT CCGACGCTCA AGATCAAAGA 20 4 0 

CTTGTGGCAC ATCTTCTTTA GTCATCTCTC TAATGTTTAA TTGTTCTTTT GACTGTTGAT 2100 
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CCAATTTCGT TCCGCCTCAG CTAATTTATG GTATTTAGGA GTAAATGTAT GTACGTCTGA 2160 

AGGTTTATCT AGCAATTGAT AC AT G AC T G A TGCATTTGGT AGcSCGCAA TCACTTCACC 2 22 0 

TTGTAATTCA TCTTGTAATT TTACAGTATC TTTCCCAATA TAAATAAATG GTTGGTTTAA 2280 

ATCTTCTAAA AAAGCTCGCA ATGCCTCTAT CGACATATAT TGATCTTCTA AAATAGTCAC 234 0 

TAATTGACCA TTTTGCCACT GGAATATGCC TGTATAAACT GCTTGTCGTC TTGCATCAAA 2 4 00 

CACAGGAACC AATAATTTAT CAGTATGATC GATTGTTGCT GCCAATGCCT TTAATGATGA 2 4 60 

AACACCATAT AATTTAACAT CTAACGCATA CGCTAATGTT TTAGCAACAG TAACACCGAT 2520 

ACGTAAGCCA GTATATGAAC CAGGACCTTC AGCAACAATA ATCGCATCTA ATTGCTGTTT 2580 

AGCTATTTTG CTCTGCTCAA ATAGCEACT AATCGCAGGC ATTAATTGCA CTGAATGATT 2 64 0 

TTGTTTAGAA TCCGTAGTTA TTTCAGCTAA AACTTCATCG TTTTGCATCA ATGCTACTGA 2700 

TAATGGTTGA TTCGATGTAT CAATGAGCAG CGAATTCATG GATAATTGCC TCCTTAATTT 27 60 

GTTCATAATG TTCTCCTTGC GCGAACAACT CAATTTGTCT TGTATTTTCA BTATTGTTG 2820 

AAATGTTAAT AGATAAATGC GTCGCTGGAA GTAAATCTTT TATAAATTGA CTCCATTCAA 288 0 

TAACAGTAAT TGCCTGATCT TCGAAAAATT CATCAAATCC TAAATCTTCA TCAGAATCTT 2 94 0 

CTAAGCGATA ACAATCCATA TGATGCAATT TTAAATTTTT ACCCCTATAT GATTTAATGA 3000 

TGTTAAATGT CGGGGAATTA ATCGTACGTC TTACACCAAG AGCTTTTCCT ATAAATTGCG 30 60 

TTAACGTTGT TTTACCTGCT CCTAAATCTC CGTTAAGTAA AATCAAATCA CCACTTTTCA 3120 

ATTGCTCAAC TAAAAATATA GCAAATTGAT TCATTTCATC TAAATTATTT ATCTTTATCA 318 0 

ATGTTGATTC TCCTATATTA TGCTTTTCAT TCAAAAAAT GATTATCCAT TGTTCAATCG 32 4 0 

TATCTAACTT TATATTTAAC CTTTATATTG TAACAAATTT CAACTTAAAT TTCTTATCTT 3300 

TGAAACAGAT TATCTATTCA AAGTTAATTG TAAGAAAATT TAAAATATTT GTTGACATAC 33 60 

TAAAGCAGAT ATAGTAAATT AAATTTATCA AATTTTTAGA CAATTCTAAC TATTAAAG3' 34 2 0 

ATATATACCA TTCACGGAAG GAGTATAATA AAATGCTTAA TCAATATACT GAAC AT CAAC 3480 

CGACAACTTC AAATATTATT ATTTTATTAT ACTCTTTAGG ACTCGAACGT TAgTAAATAT 3540 

TTACTAAACG CTTTAAGTCC TATTTCTGTT TGAATGGGAC TTGTAAACGT CCCAATAATA 3600 

TTGGGACGTT TTTTTATGTT TTATCTTTCA ATTACTTATT TT TAT TACT A TAAAACATGA 3 6 60 

TTAATCATTA AAATTTACGG GGGAATTTAC TATGCGAaCG AgcATGATCA AAAAAG GAGA 37 2 0 

TCACCAAGCA CCAGCAAGAA GTCTTTTACA TGCCACGGGC GCGCTAAAAA GTCCAACTGA 37 80 

TATGAACAAA CCATTTGTAG CTATTTGTAA CTCTTATATTGATATTGTTC CTGGACATGT 38 4 0 

TCACTTGAGA GAGCTTGCAG ATATAGCTAA AGAAGCAATT AGAGAAGCCG GTGCCATTCC 3 900 

ATTTGAATTC AATACAATTG GTGTTGATGA TGGAATAGCT ATGGGACATA TCGGAATGCG 3960 
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ATATTCTCTA CCATCACGTG AAATTATTGC AGATGCAGCT GAAACTGTAA TTAACGCTCA 42D0 
TTGGTTTGAC GGCGTATTTT ACATTCCTAA TTGTGACAAG ATTACACCCG GTATGATTTT 4 080 

AGCAGCCATG AG G AC AAAC G TACCAGCTAT CTTTTGCTCT GGTGGACCAA TGAAAGCTGG 414 0 

CTTATCTGCA CATGGAAAAG CATTAACACT TTCATCAATG TTTGAAGCAG TCGGCGCATT 4 2 00 

TAAAGAAGGA TCGATTTCTA AAGAAGAATT T T TAG AT AT G GAACAAAATG CCTGCCCTAC 4 2 60 
TTGTGGTTCA TGTGCTGGGA TGTTTACTGC AAATTCAATG AACTGTTTGA TGGAAGTTTT 4 320 

AGGTCTAGCA TTACCATACA ACGGTACTGC ACTTGCAGTC AGTGATCAGC GACGAGAAAT 4 380 

GATTCGCCAA GCAGCTTTTA AAT TAG T T G A AAATATTAAA AATGATTffiA AACCACGTGA 4 440 
TATCGTTACT CGCGAAgCAA TTGATGATGC ATTTGCACTT GATATGGCTA TGGGTGGTTC 4 500 

AACAAACACG GTACTGCATA CGTTAGCCAT TGCCAATGAA GCTGGTATTG ATTATGACTT 4 5 60 

AGAGCGCATT AATGCTATTG CCAAACGCAC GCCATATTTA TCAAAAATAG CACCTAGTTC 4 62 0 

ATCGTATTCA ATGCATGATG TGCATGAAGC TGGTGGCGTC CCAGCAATTA TTAATGAATT ■ 4 680 
GAT GAAGAAA GATGGCACGT TACACCCAGA TAGAATCACA GTTACTGGCA AAACGTTACG 4 74 0 

TGAAAATAAC GAAGGCAAAG AAATTAAGAA CTTTGATGTC ATTCACCCTC TTGATGCACC 4 8 00 

AT AT GATGCA CAAGGCGGTT TATCTATCTT ATTTGGTAAT ATCGCCCCTA AAGGCGCAGT 4860 
TATTAAAGTT GGCGGCGTTG ATCCATCTAT CAAAACATTT ACTGGGAAAG CAATTTGTTT 4 920 

CAATTCGCAT GATGAAGCTG TTGAAGCAAT AGACAATCGT ACCGTTCGTG CAGGCCACGT 4 980 

CGTTGTCATT AG AT AT G AAG GACCTAAAGG TGGACCAGGT ATGCCTGAAA TGTT&CACC 504 0 
TACTTCCTCT ATTGTTGGTC GCGGCTTAGG TAAAGATGTT GCATTAATTA CTGATGGGCG 5100 
TTTTTCCGGT GCCACAAGAG GTATTGCAGT TGGTCATATT TCCCCTGAAG CTGCATCTGG 5160 
TGGACCAATT GCCTTAATTG AAG AT G G T G A T GAG AT TACT ATTGATTTAA CAAATCGTAC 5220 

ATTAAACGTA AA.CCAGCCTG AAGATGTTCT AGCGCGTCGC CGAGAATCTT TAACACCATT 5280 
TAAAGCGAAA G T AAAAAC AG GTTATCTAGC TCGTTATACT GCCCTAGTAA CTAGCGC AAA 534 0 

TACAGGTGGC GTCATGCAAG TCCCTGAGAA TTTAATTTAA TTTATTTTTA TATTGGAGAT 54 00 

GGTTAAAATG TCTAAAACTC AACATGAAGT AAACCA&AT ATTGACCCTT TAAAAATGGC 5 4 60 
TGAATCACTT GAACCTGAAC AACTAAATGA AAAAACTTTA AATGATATGC GTTCAGGATC 5520 
AGAAGTGCTA GTAGAAGCTC TACTTAAAGA AAATGTGGAT TATTTATTCG GTTATCCTGG 5580 

TGGTGCCGTA CTACCTTTAT ATGACACGTT TTATGATGGT AAAATCAAAC ATATTTTAGC 564 0 

AAG AC AC G AA CAAGGTGCTG TTCATGCTGC AG AAGG T T AT GCACGTGTAT CTGGTAAamT 5 7 00 

GGCGTCGTTG TAG T T AC AAG CGGTCCaGGT GCAACTAATG TAATGACAGG TATTACGGAT 57 60 
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GCACATTGCG ACTCTTTACC TCTAGTTGTA TTCACTGGAC AAGTTGCTAC ACCAGGCATT 5 82 0 

GGTAAAGATG CATTCCAAG^ AGCGGATATT CTATCTATGA CTTCACCAAT TACAAAACAA 5880 

AATTATCAAG TGAAACGTGT TGAAGATATC CCTAAAATCG TACACGAAGC TTTCCATGTA 5 94 0 

GCTAATTCTG GACGCAAAGG TCCTGTAGTG ATTGATTTTC CAAAAGATAT GGGTGTTTTA 6000 

GCTACAAATG TGGATTTATG CGACGAAATC AATATTCCAG GTTffiGAAGT TGTTACAGAA 60 60 

CCAGAAAATA AAG AC AT T G A CACTTTCATC TCACTTTTAA AAGAAGCGAA AAAGCCTGTC 6120 

GTATTAGCCG GCGCAGGTAT TAATCAATCA AAATCAAATC AATTATTAAC ACAGTTTGTT 618 0 

AATAAACATC AAATCCCAAC AG TT AC TACT TTACTAGGTC TAGGTGCTGT ACCTTATGAG 624 0 

GATACACTAT TTTTAGGTAT GGGAGGAATG CATGGTTCTT ATGCTAGTAA CATGGCATTA 6300 

ACTGAGTGTG ATTTACTCAT TAATTTAGGT AGCCGCTTCG ATGATAGATT AG C AAG C AAA 6360 

CCTGATGCCT TTGCACCTAA CGCCAAAATT GTACATGTAG ATATTGATCC TTCAGAAATC 64 2 0 

AATAAAGTTA TTCATGTAGA TTTAGGTATT ATTGCAGACT GTAAAAGATT TTTAGAATGT 64 80 

TTAAATGATA AAAATGTTGA G AC T AT AG AA CACAGTGACT GGGTTAAACA TTGTCAAAAT 654 0 

AATAAGCAGA AACACCCATT TAAACTTGGT GAAGAAG AT C AAGTATTTTG TAAGCCACAA 6 60 0 

CAAACAATCG AATATATCGG CAAAATTACA AATGGTGAAG CAATTGTTAC fflCAGACGTG 6 6 60 

GGACAACATC AAATGTGGGC AGCTCAATTT TATCCATTTA AAAATCACGG ACAATGGGTT 672 0 

ACAAGCGGTG GTTTAGGAAC AATGGGATTC GGTATTCCTT CGTCAATTGG TGCCAAATTA 67 8 0 

GCTAATCCTG AT AAAAC AG T CGTATGTTTC GTCGGTGACG GTGGTTTCCA AATGACAAAC 68 4 0 

CAAGAAATGG CACTTTTACC CGAATATGGT TTAGATGTCA AAATCGTACT AATCAATAAT 6900 

GGAAC AT TAG GTATGGTTAA ACAATGGCAA GATAAGTTCT TTAATCAACG CTTCTCACAC 6 960 

TCAGTATTTA ATGGTCAACC TGATTTTATG AAAATGGCAG AAGCATATGG CGTCAAAGGT 7 02 0 

TTCTTAATCG ATAAGCCAGA ACAACTGGAA GAA1AATTAG ATGCAGCGTT TGCTTATCAA 7 080 

GGACCAGCTT TAATTGAGGT TCGTATTTCC CCTACTGAAG CTGTAACCCC AATGGTTCCG 714 0 

AGTGGCAAAT CAAATCATGA AATGGAGGGC TTATAATGAC AAGAATTCTT AAAT T AC AAG 7 20 0 

TTGCGGATCA AGTCAGCACG CTAAATCGAA TTACAAGTGC TTTTGTTCGC CTACAATAT 7 2 60 

ATATCGATAC ATTACATGTt ACACATTCTG AACAACCTGG GATTTCTAAC ATGGAAATTC 7 320 

AAGTCGATAT TCAAGATGAT ACATCACTTC ATATATTAAT TAAAAAATTA AAACAACAAA 7 38 0 

TTAATGTTTT AACGGTTGAA TGCTACGACC TTGTTGATAA CGAAGCTTAA TTTTAAGACA 744 0 

AAGGCAATGA TGCGCTAATT AGTTATAGAT ATATCATAGG CTGCTAGTTA ACATCTGCCA 7500 

CT ATT AC AAA GTTATATTTC AGAATTTTCG AAACACAAAA TATTTAATTA TTTGGAGGAA 7 5 60 

TTTATTATGA CAACAGTTTA TTATGATCAA GATGTAAAAA CGGACGCTTT ACAAGGCAAA 7 620 
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AAAATTGCAG TAGTAGGTTA TGGATCACAA GGTCACGCGC ATGCACAAAA CTTAAAAGAC 7 680 

AATGGATATG ATGTAGTCAT CGGCATTCGC CCAGGTCGTT CTTTTGACAA AGCTAAAGAA 7740 

GATGGATTTG ATGTGTTCCC TGTTGCAGAA GCAGTTAAGC AAGCTGATGT AATTATGGTG 7 800 

CTATTACCTG ATGAAATTCA AGGTGATGTA TACAAAAACG AAATTGAACC AAATTTAGAA 7ffl0 

AAACATAATG CGCTTGCATT TGCTCATGGC TTTAACATTC ATTTTGGTGT TATTCAACCA 7 920 

CCAGCTGATG TTGATGTATT TTTAGTAGCT CCTAAAGGAC CGGGTCATTT AGTTAGACGT 7 980 

ACATTTGTTG AAGGTTCTGC TGTACCATCA CTATTTGGTA TTCAACAAGG CGCTTCAGGT 8040 

CAAGCACGTA ATATTGCTTT APGTTATGCA AAAGGTATTG GTGCAaCTCG TGCAGGTGTT 8100 

ATTGAAACAA CATTTAAAGA AGAAACTGAG ACAGATTTAT TTGGTGAACA AGCAGTACTT 8160 

TGCGGTGGTG TATCGAAATT AATTCAAAGT GGCTTTGAAA CATTAGTAGA AGCGGGTTAT 8220 

CAACCAGAAT TAGCTTATTT TGAAGTATTA CATGAAATGA AATTAATGT TGATTTGATG 8280 

TATGAAGGCG GTATGGAAAA TGTACGTTAC TCAATTTCAA ATACTGCTGA ATTTGGTGAC 834 0 

TATGTTTCAG GACCACGTGT TAT C AC AC C A GATGTTAAAG AAAATATGAA AGCTGTATTA 8 4 00 

ACTGATATCC AAAATGGTAA CTTCAGTAAT CGCTTTATCG AAGACAATAA AAATGGATTC 8 4 60 

AAAGAATTTT ATAAATTACG CGAAGAACAA CATGGTCATC AAATTGAAAA AGTTGGTCGT 8 52 0 

GAATTACGCG AAATGATGCC TTTTATTAAA TCTAAAAGCA TTGAAAAATA AGATAGACCT 8 58 0 

AC AAT GAGGA GTTGTTAAAT ATGAGTAGTC ATATTCAAAT TTTTGATACG ACACTAAGAG 8 64 0 

ACGGTGaACA AACACCAGGA GTGAATTTTA CTTTTGATGA ACGCTTGCGT ATTGCATTGC 87 00 

AATTAGAAAA ATGGGGTGTA GATGTTATTG AAGCTGGATT TCCTGCTTCA AGTACAGGTA 87 60 

GCTTTAAATC TGTTCAAGCA AT T GCAC AAA CATTAACAAC AACGGCTGTA TGTGGTTTAG 8820 

CTAGATGTAA AAAATCTGAC ATCGATGCTG TATATGAAGC AACAAAAGAT GCAGGAAgC 8880 

CGGTcGTGCA TGTTTTTATA GCAACATCAC CTATTCATCT TGAACATAAA CTTAAAATGT 8 94 0 

CTCAAGAAGA CGTTTTAGCA TCTATTAAAG AACATGTCAC ATACGCGAAA CAATTATTTG 9000 

ACGTTGTTCA ATTTTCACCT GAAGATGCAA CGCGTACTGA ATTACCATTC T TAG T GAAAT 90 60 

GTGTACAAAC TGCCGTTGAC GCTGGAGCTA CAGTTATTAA TATTCCTGAT ACAGTCGGCT 9120 

ACAGTTACCA TGATGAATAT GCACATATTT TCAAAACCTT AAC AG AAT C T GTAACATCTT 918 0 

C AAAT GAAAT TATTTATAGT GCTCATTGCC ATGACGATTT AGGAATGGCT GTTTCAAATA 924 0 

GTTTAGCTGC AATTGAAGGC GGTGCGAGAC GAATTGAGG CACTGTAAAT GGTATTGGTG 9300 

AACGAGCAGG TAATGCAGCA CTTGAAGAAG TCGCGCTTGC ACTATACGTT CGAAATGATC 93 60 

ATTATGGTGC TCAAACTGCT CTTAATCTCG AAGAAACTAA AAAAAC AT C G GATTTAATTT 94 2 0 



384 

Copied from 10808206 on 08/12/2004 



CAAGATATGC AGGTATTCGA GTGCCTAGAA ATAAAGCAAT TGTTGGCCAA AATGCATTTA 94 80 

GTCATGAATC AGGTATTCAC CAAGATGGCG TATTAAAACA TCGTGAAACA TATGAAATTA 95 4 0 

TGACACCTCA ACTTGTTGGT GTAAGCACGA CTGAACTTCC AT T AGGAAAA TTATCTGGTA 9 600 

AACACGCCTT CTCAGAGAAG TTAAAAGCAT TAGGTTATGA CATTGATAAA GAAGCGCAAA 9 6 60 

TAG AT T T AT T TAAACAATTC AAGGCCATTG CGGACAAAAA GAAATCTGTT TCAGATAGAG 9720 

ATATTCATGC GAT TAT T C AA GGTTCTGAGC AT GAG CATC A AGCACTTTAT AAATTGGAAA 97 80 

CACTGCAACT ACAATATGTC TCTAGCGGCC TTCAAAGTGC TGTTGTTGTT GTTAAAGATA 98 4 0 

AAGAGGGTCA TATTTACCAG GATTCAAGTA TTGGTACTGG TTCATCGTA GCAATTTACA 9 90 0 

ATGCAGTTGA TCGTATTTTC CAGAAAGAAA CAGAATTAAT TGATTATCGT ATTAATTCTG 9960 

TCACTGAAGG TACTGATGCC CAAGCAGAAG TACATGTAAA TTTATTGATT GAAGGTAAGA 100 20 

CTGTCAATGG CTTTGGTATT GATCATGATA TTTTACAAGC CTCTTGTAAA GCATACGTAG 10080 

AAGCACATGC TAAATTTGCA GCTGAAAATG TTGAGAAGGT AGGTAATTAA TTATGACTTA 1014 0 

TAACATTGTT GCCCTACCTG GTGATGGAAT CGGTCCAGAA ATTTTGAACG GATCTCTATC 102 00 

ATTGCTTGAA ATTATAAGTA ATAAATATAA CTTTAATTAT CAAATAGAGC ACCACGAATT 102 60 

TGGTGGTGCC TCTATTGATA CATTCG5CGA GCCTTTAACT GAGAAAACCT TAAATGCGTG 10320 

TAAAAGAGCA GATGCTATTT TACTGGGTGC AATCGGTGGA CCTAAATGGA CAGATCCTAA 10380 

CAATCGACCA GAACAAGGAT TATTAAAATT GCGTAAATCC TTAAATTTAT TTGTAAATAT 104 40 

ACGCCCCACT ACCGTTGTCA AAGGCGCTAG TTCTTTATCA CCTTTAAAGG AGAACGCGT 10500 

TGAAGGCACA GATTTAGTTA TAGTCCGTGA ATTGACAAGT GGTATTTATT TTGGAGAACC 105 60 

TAG AC AT T T T AATAATCACG AGGCCTTAGA TTCTCTTACT TATACAAGAG AAGAAATAGA 10 620 

ACGCATTGTT CACGTAGCAT TTAAATTGGC CGCTTCAAGA CGAGGAAAAC TAACATCAGT 10 680 

T G AT AAAGAA AATGTATTAG CTTCTAGTAA ATTGTGGCGC AAAGTCGTAA ATGAAGTAAG 10740 

TCAATTATAT CCAGAAGTAA CAGTAAATCA CTTATTTGTT GATGCTTGTA GTATGCATTT 108 00 

AATCACAAAT CCAAAACAAT TTGACGTCAT CGTATGTGAA AACTTATTTG GCGATATTTT 108 60 

AAGTGATGAA GCTTCAGTGA TTCCTGGTTC ACTGGTTTA TCACCTTCTG CTAGTTTTAG 10 920 

TAACGATGGT CCAAGATTGT ATGAGCCTAT TCATGGATCA GCACCAGATA TTGCAGGTAA 10 980 

AAACGTTGCC AATCCATTTG GAATGATTCT ATCTTTAGCG ATGTGTTTAC GTGAAAGCTT 1104 0 

AAATCAACCA GATGCTGCAG ATGAATTAGA ACAACATATT TATAGCATGA TTGAACATK 11100 

GCAAACGACA GCAGATTTAG GCGGCAAATT GAATACTACT GATATTTTCG AAATTCTATC 11160 

TCAAAAATTG AATCACTAAG GGGGAGATGT AAATGGGTCA AACATTATTT GACAAGGTGT 112 20 

GGAACAGACA TGTGTTATAC GGGAAATTGG GCGAACCGCA AC TAT TAT AC ATTGATTTAC 112 8 0 
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ACCTTATACA TGAAGTTACT TCTCCTCAAG CATTTGAAGG ACTTAGGCTT CAAAACAGAA 11340 

AATTAAGACG CCCAGATTTA ACATTTGCAA C AC T C GAT C A CAATGTTCCT ACTATTGATA 114 00 

TATTCAATAT TAAAGATGAA ATTGCAAACA AACAAATCAC AACATTACAA AAAAACGCCA 114 60 

TAGATTTTGG GGTGCATATT TTTGATATGG GTTCTGATGAACAAGGTATT GTTCACATGG 11520 

TAGGACCTGA GACAGGACTT ACACAGCCTG GCAAGACAAT CGTTTGTGGT GACTCTCACA 11580 

CAGCAACACA TGGAGCATTT GGTGCTATTG CATTTGGAAT TGGAACAAGT GAAGTTGAAC 11640 

ATGTTTTCGC AACTCAAACG CTATGGCAAA CAAAACCCAA AAACTTAAAA ATCGATATTA 11070 

ATGGTACCTT ACCAACAGGC GTCTATGCTA AGGACATTAT TCTGCATTTA ATTAAAACGT 11760 

ATGGTGTTGA CTTTGGTACA GGCTATGCTT TGGAATTTAC TGGCGAAACA AT T AAAAACC 1182 0 

TTTCAATGGA TGGTCGAATG ACTATTTGTA ACATGGCTAT CGAAGGTGGT GCCAAATACG 1188 0 

GCATAATCCA ACCTGATGAT ATAACATTTG AATATGTTAA AGGGAGACCA TTTGCCGATA 11940 

ACTtCGCTAA ATCAGTTGAT AAGTGGCGTG AgCTATATTC TGATGACGAC GCGATATTTG 12000 

ATCGTGTAAT TGAACTTGAT GTTTCAACAT TAGAACCACA AGTGACATGG GGAACTAATC 12 0 60 

CTGAAATGGG TGTTAATTTC AG T GAAC CAT TCCCTGAAAT CAATGATJEC AACGATCAAC 12120 

GTGCGTATGA TTATATGGGG TTAGAACCAG GTCAAAAAGC TGAAGACATC GACTTAGGGT 1218 0 

ATGTTTTTCT CGGTTCATGT ACAAATGCTA GACTATCAGA TTTGATTGAA GCTAGTCATA 1224 0 

TTGTTAAAGG AAATAAAGTT CATCCAAATA TTACAGCTAT TGTCGTACCA GGTTCTCGTA 12 300 

CAGTAAAAAA AGAAGCAGAA AAATTAGGTC TAGATACTAT CTTTAAAAAT GCAGGATTTG 12360 

AATGGCGTGA ACCAGGATGT TCAATGTGTT TAGGCATGAA TCCTGACCAA GTACCTGAGG 12420 

GCGTACATTG TGCATCTACA AGTAATCGAA ACTTTGAAGG ACGACAAGGC AAAGGTGCAA 124 80 

GAACACATTT AGTATCCCcT GCTATGGCAG CAGCAGCAGC TATTCATGGT AAATTTGTGG 12540 

ACGTAAGAAA GGTGGTTGTT TAAATGGCAG CAATCAAACC TATTACAACA TATAAAGGTA 12 600 

AAATAGTCCC TCTCTTCAAC GACAATATCG ATACAGACCA AATCATTCCT AAGGTACACT 12 660 

TAAAGCGTAT TTCAAAAAGT GGCTTTGGTC CATTTGCTTT TGATGAATGG CGGTSTTAC 12720 

CTGATGGTTC AGATAATCCT GATTTCAATC CTAACAAACC ACAATATAAA GGGGCTTCTA 12 78 0 

TTTTAATTAC TGGAGATAAT TTTGGATGTG GTTCAAGTCG TGAACATGCT GCTTGGGCTC 12840 

TTAAGGACTA TGGTTTTCAT AT TAT TAT TG CAGGAAGTTT C AG T GAC AT A TTTTATATGA 12 90 0 

ATTGCACTAA AAATGCGATG TTGCCTATCG TTTTAGAAAA AAGTGCCCGT GAACATCTTG 12960 

CACAATATGT TGAAATTGAG GTCGATTTAC CAAATCAAAC TGTGTCATCA CCAGACAAGC 13020 

GTTTCCATTT TGAAATTGAT GAAACTTGGA AGAATAAACT TGTAAATGGC T TAG AT GAC A 1308 0 
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TTGCAATCAC CCTACAATAT GAATCATTAA TAGAAAATA TGAAAAATCa CTTTAAGGGA 1314 0 

GTTGAATATT ATGACAGTCA AAACAACAGT TTCTACGAAA GATATCGATG AGGCATTTTT 13200 

AAG AC T T AAA GATATTGTCA AAGAAACACC TTTACAATTA GACCATTACT TATCTCAAAA 13260 

GTATGATTGT AAAGTCTATT TAAAACGAGA AG AT T T AC AA TGGGTACGTT CTTTTAAATT 13320 

AAGAGGTGCT TACAACGCTA TTTCTGTTTT AT C AG AT GAA GCTAAAAGTA AAGGTATTAC 13 380 

ATGTGCAAGT GCAGGTAATC ATGCTCAAGG TGTTGCCTAT ACAGCTAAAA AACTTAATTT 13440 

AAACGCTGTT ATCTTTATGC CAGTCACTAC AC C T T T AC AA AAGGTAAATC AAGTAAAGTT 13500 

CTTTGGAAAT AGTAACGTTG AAGTTGTACT CACTGGTGAT ACATTTGATC ACTGTTTAGC 13560 

TGAAGCTTTA ACT T AT AC AA GTGAACATCA AATGAACTTT ATAGATCCAT TCAATAATGT 13 620 

TCATACAATT TCTGGACAAG GTACGCTTGC TAAAGAAATG C T AG AAC AAG CAAAGTCTGA 13 680 

CAATGTTAAC TTTGATTATC TATTTGCCGC AATTGGTGGT GGCGTTTAA TTTCAGGTAT 13740 

TAG T AC T T AC TTTAAAACCT ATTCACCTAC CACGAAAATT ATAGGTGTTG AACCTTCAGG 13800 

TGCAAGTAGT ATGTATGAAT CTGTTGTGGT AAATAATCAG GTAGTCACAT TGCCTAATAT 138 60 

CGATAAATTT GTGGACGGTG CATCTGTAGC TAGAGTTGGC GATATTACAT TTGAAATTGC 13920 

AAAAGAAAAT GTAGATGATT ACGTTCAAGT AG AT G AAGG T GCAGTTTGTT CTACGATTTT 13 980 

AGAT AT GT AT TCAAAACAAG CAATTGTAGC AGAACCTGCT GGCGCATTAA GTGTAAGTGC 14 040 

GCTTGAAAAC TATAAAGATC ATATTAAAGG TAAAACAGTG GTTTGTGTCA TTAGTGGTGG 14100 

TAATAATGAT ATTAATCGAA TGAAAGVAAT TGAAGAACGT TCATTACTAT ACGAAGAAAT 14160 

GAAGCATTAC TTTATCTTAA ATTTCCCTCA ACGTCCAGGT GCATTGAGAG AATTTGTAAA 14220 

TGACGTATTA GGACCTCAAG ACGATATTAC TAAATTTGAA TACTTAAAAA AATCTTCTCA 1428 0 

AAATACAGGT ACT GT CAT TA TTGGTATTCA ACTTAAAGAT CATGATGATT SATACAACT 14 340 

CAAACAACGT GT AAA t CAT T TCGATCCTTC CAATATTTAT ATTAATGAAA AT AAG AT G T T 14 4 00 

ATATTCATTG TTAATTTAAC ACATAGTAAG AAAAACAGTC ATAAATTGAT TTCTAATTGA. 14 4 60 

AATCATCTTA TGACTGCTTT T TAT TAT ACT TTACATTTCT CGTTTCGTCA GATTCAAACG 14 520 

TTTTCACTTC GCCAAGCCAT CTTTCTTTGT GTTTGCTTTT aTTTTGACGT TTTAGACATA 14 580 

AAAAA a GAG A CCTTGCGGTC TCAATGCGGC TCATCGCATC CACTTTTTGC CTGGCAACGT 14 640 

TCTACTCTAG CGGAACGTAA GTTCGaCTAC CATCGACGCT AAGGAGCTTA ACTTCTGTGT 14700 

TCGGCATGGG AACAGGTGTG ACCTCCTTGC TA'AGTCACC AGACATATGA ATGTAATTTA 147 60 

T AC AT T C AAA ACT AGAT AG T AAGTAAAAGT GATTTTGCTT CGCAAAACAT TTATTTTGAT 14 820 

TAAGTCTTCG ATCGATTAGT ATTCGTCAGC TCCACATGTC ACCATGCTTC CACCTCGAAC 14 880 

CTATTAACCT CATCATCTTT GAGGGATCTT ATAACCGAAG TTGGGAAATC TCATCTTQSk 14 940 
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GGGGGCTTCA TGCTTAGATG CTTTCAGCAC TTATCCCGTC CACACATAGC TACCCAGCTA 15000 

TGCCGTTGGC ACGACAACTG GTACACCAGA GGTATGTCCA TCCCGGTCCT CTCGTACTAA 150 60 

GGACAGCTCC TCTCAAATTT CCTACGCCCA CGACGGATAG GGACCGAACT GTCTCACGAC 15120 

GTTCTGAACC CAGC1CGCGT ACCGCTTTaA TGGGCGAACA GCCCAACCCT TGGGACCGAC 15180 

TACAGCCCCA GGATGcGGAT gagCCCTACA TCGAGGGTGC CAAAACCTCC CCGTTCGGAT 1524 0 

GTGGAACTT 1524 9 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14051 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

GTGGCAATAT TTCTAGTTCT CGTTTTGATA AGATTTAAA AGGATCTGTT GTGTTTGCAG 60 

TGTCCTGATT TGAATTAGAT ACAAATTCAT TCACTAAAGA TGTTGTAAGT TTCATATCTA 12 0 

CATATGTTTC ACCTTTATAT AC AG T T C G AA TAGCTAACAA TAATTGTTCA TCAGGTGCAT 18 0 

TTTTCAATAT GTAACCTTTC GCACC AT T AC GCAACACATG GAACAAATAC TCCTCATCAT 24 0 

CAAACATTGT TAATATTAGT ATTTTAGTTT CAGGAAAACT GTCAGCAATT TTACTCGTAG 300 

CGATAAGACC TGACTCACCT GGTGGcATAC TTAAATCCAT TAGTAACACA TCAGGTTTAt 360 

ATTCCATTAC TTTTTGGTAA GCTTCGACGC CATCTGCAGC CGTTGCAACA ACTTCCATAT 4 20 

CATTTTGATA ATTTAASATC ATAGAGAACC CCGTACGGAC AACAGCGTGA TCATCGGCAA 4 80 

TGACTATTTT CAATTTTATT CCCCCAATGT ATGTTTCAAA TTGGAATGTT CAATGTAACA 54 0 

TTGGTACCCT CACCAATTTT CGTTTCAATA TTGACGCTAC CGCTGACTAA CTCAGCTCGC 600 

TCATTCATTC C AT AT AAAC C GAG T C C AG AA CCTTTAGGCT TfiAACTTGG ATCAAAACCA 660 

TTTCCCGCAT CTATCACTTC TGCTACCAAA TGGCGCCCAG TTTGACGGAT ACCTACATTT 720 

ATTTCATTTA CATCAGCGTA TTTCAACGCA TTTAAAATAG CTTCTTGCAC TACTCGATAA 7 80 

ACAACCGTTT CAATATCACT ATCAAAGCGA GTATTTTTAA TATTTGATGT ATATATGATT 84 0 

TTTATTCCAT AATTTTCTTC AAACTGTTTA AAATATGATT TAAAAGCTGC TTCAAGGCCT 900 

AGATCATCCA AAGAAGCGGG TCTTAATTCA ACCGACATAT TACGTATATC ATCAATTAAT 9 60 

TTAGCGACAA TATATTCAAT ATTTTCTGCG TCTTCCAAAA GCTTAGTTGT ATCTTCTTGA 102 0 

TATTTTAATA ATCTCAATTG AACATCTACA TTGAGCATTT CTTGAATCAC ACTATCATGT 108 0 
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AACTCTCTAG AAATTCGCTT TCTTTCATTT TCTTGGGCTG AGATTGTTTT ACGCATCATA 114 0 

CGTTGTTGAT GCAATTTCTC TTGCTGTTCA ATTTGTGATG AAACATTTTG AAGCGTAAAT 12 00 

GCATGAATTC CCCTGTCTTG ATCAATCAAC TGATATGTTG CTGTAAATGGCATCACTTTT 12 60 
TGATCTTTCG TCTTCATAAA TACT TGG AAA TTCGTAGCTT GTACTTGCAT CGATTCTAAG 132 0 

AAACAGTCTT TACATGATTG TACATCATAG GCATTTGTAT ACCCTTCACA TCTGTGACAA 138 0 

ATCGCATTCG CCACAGCACT GTAATTATCT TCTTCAGATA ATATATCTTT AGCAGCATCA 14 4 0 

TTCATTGCAA TAATTTTACC GTTATCATCA GCAAAAAC T A TCTTTTCGAT TGAATGCTCA 1500 
TAATATTTTT TCAATAAAGT ATCTAACTGT ATACTGTCCT CATTAATCAT GACTTACACC 15 60 

CTAATTCATC TCATTATTTA TCATCATTGA AAATACCAAA CTTACGTTGA ATATCATCAT 1620 

TATCAAATAT TTTTGGTAAA GGACGAC CAT (TCTTTGACC AAATAATAGT ACGCCATACA 168 0 
CTTGATTCTT ATACCAAAGC GGCACTGCTA AAACTGCTGT TAATGATTCG CTCAATAAAA 17 4 0 

TTGGATAGTC AATCTTTTCT TCAGGCCCTA AAGCTAAACC AACATTGGCT ATTACCATAC 1800 

GCTTTCCTGT TTTCATAACA GTTCCAGCTA ATCCACGACC TTTTCTTAAA ATAATC&TT 18 60 
TAAATCGATT ATTTTTATTA CCTGAAACAT AGTGCCATTT TATTGGAGAT GATGGTTTGT 1920 
TAGATTCATA GAAAGCGATT GCCGCAAAAT CATAACCCTC TTCTTTGCGT ATTTTATCTA 1980 
ATGTCTCTTG AAATCTACGA TCTTCAATTA TTGCTTCTGG TGTCAAATCC TTTCACCTCT 2040 

TATGCTTACA CTTTATTCTT ACGGTAAATA ATATATCTGC GATTTATATA TGTCAAAGGT 210 0 
AC AC T C C AAA CATGCACCAA ACGTGTAAAT GGCCAACAAG CCATAATAGT GAAACCTAAC 2160 
AATATATGCA TTTTAAATGC AATCGGCACA CCACTCATCA ATGACGCATC TGGTTTTAAC 2220 

ATAAATAATT GTCTAAACCA AATTGATAAT GAAGTTCTG' AGTTAAAGTC TGGATGTTGT 2280 
ATATTTGTTA CTAATGTTGC GTAACATCCC AT AAAT AC GA TAAGTAATAA TAAGAAATTT 2340 
ACAAATATAT CCGACGCTGA ACTTAATCTT CGAATACTTT TCGTAGTAAC ACGTCTCGCT 24 00 

GTTAATAAAA ACATCCCTAT CAAAGTTATT ATACCAAAGA TGCTACCAAT ATAAACAGCG 24 60 
CCTATATGAT ATAAATGCTC AGACACACCC ACTGCATCCA TCCATGGTTT CGGTATTAAC 2 52 0 

AATCCAACTA CGTGTCCAAA AAACACTGGA ATAATACCTA AGTGAAATAA TAAACTTCCC 2580 
CACATCAACC TTTTTCTTTC TATTAATTCA CTAGATTTAG CTGTCCAAGA AAAT T TAT CA 264 0 

TAACGATAAC GTGCAATATG ACCTGCGACA AAGACAACTA AACATAAATA CGGAAATATA 2700 
ACCCATAAAA ACTGATTAAG CATGATGTTT CACTCCTTTT GGTGATGTCA AACATAATTT 2 7 60 

CAATGTTTTT CTAAGTGCTT GAATCACATA GGCATATGGA TTGTTATCTT CACCAAGTGC 2 820 

ATTCGCCATC ACATATGTTC CATCCTCAAT AATCATAATG ATTAATGAA TATTCTCTTC 28 8 0 
AGCTCTTGGA TCATTTCGCC ATTCTGCCAC TTGCAAAAAT TGAAGCATCA ACGGTAGATA 2 94 0 
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ATCAGAAAGT TCATTATCTA CCATTTCTAG TCCAAACATT TCATATAATA CCTTTAATTT 3000 

AGCTAACATT TGCCCACGTT CTTTTTGCGT ATCAAATTTG TTATACGTCA TATATAATGG 30 60 

TGCTTTTTTC GTAAAATCAA ATGTATCTGT ATAAATCGCT TTGATTTCTG ATAATGAAAA 3120 

TGTCATCATT ACATTTCTAA ATGCTACTAA ATCTTCATAT GCTGGATGAT CTTTCGAAAT 3180 

TGTTTCTTCA AAAGTTTTTG GATGAAAAGT TAATTTTTCT GGAAAACATA ACTGTTGTGC 32 4 0 

CATATATCCA AAACTTTCTT GATATTTTfT AAAATTATCG AAATTAATCA CGGAAAATCC 3300 

CTCCATAGAA ATTCTCATTA TAAATTTCTT GACCAGTTTT CCCTGAACCT ACTGCAACGC 33 60 

CACAGCCTTC ACAGTTATCT CCAAAATGCT CGCCGCCGTA ATTGTATCCT GTACTACCTT 34 20 

GTGCGTGATA CGTATCTAAA TAGGTTTCTT TGTGTGATGT TGGAATAACA AATGATCTT 34 8 0 

CATATTTGGC TAGTCCTAAT AAAC GAT AC A TGTCTTTAGT TTGGCGCTCG GTTATACCTA 354 0 

ATCGCTCTAA TCGAGACGTG TCAAATGGCT GTTGAGTAAC TTGAGATCTC ATATAACTTC 3600 

TCATCATTGC CATACGTTGT AGGGCTCCTT TTACTGGCTC TGTATCTCCT GCAGTGAAAA 3 6 60 

TATTAGCTAA GTATTCAATA GGTAAACGCA TTTCTTCAAT GGCTGGGAAA ATCGCATCTG 37 20 

GATTTTGAGT TGTATTTTTA CCTTCAAAAT AGCTCATAAT TGGGCTAAGT GGTGGGCAAT 37 80 

ACCAAACCAT CGGCATCGTT CTAAATTCAG GATGTAACGG AAATGCAAGT TTATATTCAA 38 4 0 

TTGCTAACTT ATAAATTGGA GAGTTTTGTG CAGCTCAAT CCAATCGTAA CCAATACCAT 3900 

CTTTTTCAGC TTGAGCAATG ACTTCTTCGT CAAATGGGTT TAAGAATATA TCTAATTGTT 3960 

TTTCATATAA ATCTTTCTCG TCTACTGCTG AAGCTGCTTC ATGAACTCGA TCTGCATCAT 4020 

ATAATAAAAC ACCTAAGTAA CGCATACGTC CTGTACAAGT TTCAGAGCAT ACCGTAGGCA 4 08 0 

TACCCGCCTC GATTCTCGGG AAACAGAAAG TACACTTTTC AGCTTTGTTC GTTTTCCAAT 4140 

TGAAGTAAAC TTTCTTATAT GGACAACCTG TCATACAGTA ACGCCATCCA CGACATGCGT 4 2 00 

CTTGGTCAAC TAATACAATG CCATCTTCAT CACGTTTATA CAT AGC ACC T GAAGGACACG 4 2 60 

ATGCAACGCA ACTTGGATTC AAGCAATGTT CACATAAACG TGGTAAATAC ATCATAAAAG 4 32 0 

TTTCGTCAAA TTGGAATTTA ATATCTTCTT CTATTTTTTG GATGTTAGGA TCTTTTGGAC 4380 

CTGTAACATG ACCACCTGCT AAGTCATCTT CCCAGTTAGG TCCCCATTCA ATTTCAATGT 44 40 

TATCCCCCGT AATTTCTGAA TACGCTCTAG CAACTGGCGA AECTTCCCT GATTTCGCAG 4 500 

TTGTTAAATG TTCATAATTA TAGTTCCATG GCTCATAATA ATCTTTAATT AATGGCATAT 4 5 60 

CTGGGTTATA AAAAATTTTA CCTAAAGCAA TTTTTGAAAT TCTACTTCCA GATTTTAATT 4 62 0 

CAAGTTTCCC TTTACGATTT AGTACCCAAC CACCTTTGTA GTGTTCTTGG TCTTCCCAAC 4 68 0 

GTTTCGGATA CCCTACACCT GGCtTCGTTT CTACGTTGTT GAACCACATG TACTCAGCAC 474 0 
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CTGGACGATT TGTCCaAGTG TTTTTACATG TC AC ACT AC A CGTATGGCAT CCTATGCATT 4 8 00 

TATCTAAATT TAATACCATC GCAAcTTGCG CTTTAATCTT CAAGCCAATT AACCTCCTTC 4 8 60 

ATCTTTCTAA CTGCTACATA TAAATCCCTT TGGTTCCCAA TTGGTCCATA ATAATTAAAG 4 920 

TGATAACTAA TTTGTGCGTA TCCTCCGACT AGTTGTGTTG GTTTCAAATG GATTCTAGTC 4 980 

GGCGCGTTGT GTGAACCACC ACGTGTATCT GTAATTTCTG ACCCAGGCGT TTGAATATGT 504 0 

TTATCTTGTG CATGATACAT AAACATTGTA CCTTTAGGCA TACGATGCGAAATAACTGCT 5100 

CTTGCCGTTA CAACACCATT ACGGTTATAC ACTTCTAGCC AATCATTATC TTGGATATCG 5160 

TGTTTTTCAG CATCTTCATT T GAT AT C C AA ACCGTTGGAC CACCTCTAAA TAGTGTCAAC 5220 

ATATGCTTAT TATCTTGATA CATTGAGTGT ATATTCCATT TTCCATGAGG CGTTAAATAA 5280 

CGCAgTACCA AAGCATCTGT ACCACCTTTA ATTTTCTTAT CTCTATTCCC AAATACCATT 534 0 

GGCGGCAATG TCGGTTTATA TACTGGTAAG CTCTCCCCAA ATTGTTGGAA AACTTCGTGA 54 00 

TCCACATAAT AACTTTGACG TCCTGTTAAT GTTCTAAAAG GTACTAGACG TTCTATATTC 54 60 

GTTGTAAATG GTGAATATCG TCGACCTTGT TTATTTGAAC CTGGGAATAC TGCTGTCGGT 5520 

ATTACTTCTC GTGGTTGTGA AGTTATATTT AAAAACGAAA TTTTCTCAGC AGCGCGTTCG 5580 

C T AG AAAT AT CTTTTAACGG CATTCCAGTT TGTTCTTCGA GATCTTCATA TGATTTTTGT 5640 

GATAATTTAC CATTCGTAGC AGATGAAATA CTTAGTATTG CATCAGCTAC ATTACGTCT 57 00 

GTATCAATAC GTGGACGATT CGCTCTCACA G AAT CAT CAT TTGTATCACT CCACGTACCT 57 60 

AACATACTTT TTAATTCTTC ATATTGTTCA CTGACACCGA AACTTACACC ATGTGCTCCA 5820 

ACTTTCCCTT TTTCAAGTAC AGGACCAAGC GTGACATATT TGTCGTAAAT TTTAGTGTAG 58 80 

TCGCGTTCTA CAATTGCAAA GTTAGGCATT GTACGTCCAG GTACCGCTTC AATTTCACCC 5940 

TTCGACCAAT CTTTCACTAC GCCGTATGGT GTTGAAATTT CTTGCTTTGT ATCATGACTA 6000 

AGTGGAGTTG TCACAACATC TTTAAACGTT CCAGGTAAAT AGTCTTTTGC CATTTCTGAA 60 60 

AATGCTTTTG CCAACGTTTT ATAAATATCC CAGTCTGAS GCGATTCCCA TAACGGATCA 6120 

ATGGCAGGAT TGAAAGGATG TACATATGGA TGCATATCCG TTGATGATAA ATCATGTTTT 6180 

TCATACCAAG TCGCTGCCGG CAAAACAATG TCAGAATATA ACGGTGTTGC CGTCATTCTG 624 0 

AAGTCTAAAG AG AC C AC T AA ATCTAACTTA CCTGTTGTTT CTTCACGCCA CGTAATTTCT 630 0 

TCTGGCTTTT CATCTTCATT TGGTGTAGCT AATAACCCTG ATTTTGTGCC AAGTAAATGC 6360 

TTCATAAAGT ATTCTTGACC TTTTGCAGAA CTTGAAATTA AGTTTGAACG CCATATAAAT 64 20 

AATGATTTTG GATGATTCTT TTTCAAATCA GGATCTTCTA TTGCAAATTG TGTTTGTTTT 6480 

GATTTCACTT CATCAATTGC ACGTTGCAAA ATCGCTTCAT TTGAATCTAT ACCTTCATCT 654 0 

TTAGCTTCTT CTGCAAACAA CAAACTATTT TTATTAAATT GTGGATATGA TGGTAACCAA 6600 
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CCAAGTCTAG CTGCTAAAAC ATTATAATCA GCTGGATGTT GATGCTTTAA CTCCTCTGTT 6660 

TTAGCTAATG GAG AT T T T AA ACGATCTACA TTTGACTCTT CATATTCCA TTGGTCTGTT 672 0 

GCAAAATAGA ACCAACTTGT TCCGTTTTGC AAACGTGGTG GTCCTTGCCA GTCTTTCGCA 67 8 0 

AATGCGACAG TACTCCATCC TTCAATCGGA CGACATTTTT CTTGTCCCAC ATAGTGAGCC 68 4 0 

CAACCGCCAC CATTCACACC TTGACAGCCA CATAACATAA CTAAGTTTAA GATTGAACGA 6900 

TAAATCGTAT CTGAGTTAAA CCAATGGTTA ATACCCGCAC CCATGAT7AAT CATTGAACGC 6 9 60 

CCTTCAGTAT CGATAGCGTT TTGCGCAAAT TCTTTCGCTA CTTGAATGAC AACACTTTGT 7020 

TTTACGCCTG AAATGGCTTC TTGCCAAGCA GGTGTATATT TTGATTCTGC ATCGTCGTAT 7080 

CCTTTTGATT CTAATTTATG AT C AAAACG A CGCACGCCAT ATTGACTTGC CATTAAGTCA 714 0 

AAAATTGTAG CAATACGGAC TTTGTCACCA TTTGCTAAAG TGACTTGTCG AGTTGGAATT 7200 

GGACGATTGA ATATCCCATC TCCATCACTA TCAAAGTATG GGAATTGAAT TGTTTCTAAT 72 60 

TCGTATCCAC CTTCTGTCAT TGATAATGTA GGGTTAATTT T AG AAC CATC TTEGTTTCT 7 32 0 

AGTTTTAAGT TCCACTTCTT ACCTTCTTCC CAACGTTGAC CCATTGTGCC ATTAGGTACT 7380 

ACTAAACTAT CGCTGATTGC AT CAT GAAT A ACTGGCTTCC ATTCGCCTTG CTCTGTTGTT 74 4 0 

TGACCTAAGT CACTCGCTCT TAAAAATCGA CCCGCTTTAT ATCCATTTTC ATCTTCATCC 7 5 00 

AGCATGATAA GAAACGGCAT ATCTGTATAT TGTTTAGCGT AAT T TAT AAA GCGTTCATTA 7 5 60 

GGTTGATTAA CATAATGTTC TTGTAAAATA ACATGCGTCA TTGCTTGTGC AATTGCAGCA 7 62 0 

TCTGAACCAG GATTCGGTGC TAGCCAGTTA TCTGCAAATT TCACATTTTC TGCGTAATCT 7680 

GGTGCTACTG AAATGACTTT TGTACCTTTA TAGCGACTT CAGTCATAAA ATGTGCATCC 7 74 0 

GGAGTACGTG TTAAAGGTAC ATTAGAGCCC CACATAATAA TGTATGATGC GTTATACCAG 7 8 00 

TCACTTGATT CAGGCACATC TGTTTGCTCT CCCCAAATTT GTGGAGAGGC AGGTGGTAAA 7 8 60 

TCTGCATACC AGTCATAAAA ACTAAGCATT TCACCACCAA GCAAATTGAT GAATCGAGCA 7 920 

CCTGCTGCAT AACTAATCAT TGACATCGCT GGAATAGGTG TAAATCCTGC GATTCGATCT 7980 

GGACCATATT TTTTTATTGT ATACAGTAAT TGTGCTGCGA TTATCTCTGT AACGTCTTTC 804 0 

CAATTTGAAC GCACGTGCCC TCCCATACCT CGGGCTTGCT TATATTGTTT GGCTTTGTCT 8100 

TCATTTTCAA CAAT AGPC GC CCATGCAGCA ACGCGATTAC CATTGTTTTC TTCTAATGCT 8160 

TCAGTCCATA AATCCCAGAG TTTTCCACGA ATATATGGAT ATTTGATTCG AAGCGGACTG 8220 

TATTCATACC AAGAGAATGA CGCACCTCGT GGACATCCTC TCGGTTCATA TTCAGGCATA 828 0 

TCCGGACCAC AACTTGGATA GTCAGTTTGT TGATTTTCCC AGTAATCAC ACCATTTTTC 8340 

AC AAAT AC T T TCCAAGAACA TGAGCCTGTA C AG T T AAC AC CATGTGTTGT TCTTACTTCT 8 4 00 
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TTATCGTGGC TCCAACGTTC TCTGTACATT TTTTCCCATT CTCTACTTTT ACTTTCTAGG 8 4 60 

ATCGACCAAT TCCCATTAAA TTTTTCTGTT GGCTTAAAGA AATTCAATCC AAATTTTCCC 8520 

ATATTTATAT CCTCCTACGT ATAAAAATAC GATGTGTAGA TGTCGTGTTT TTAAATACTT 8 580 

TAAAATGCCC AAGACTATTG CTTTAATTAG ATTGTACATT TTTTCACAAA CATAAAATAT 8 64 0 

TAGGGAATCA CCTAATTACT TAAGGAATTT CCCTATCAAT AACGGGATTT CATTGAAATA 87 00 

AT AC AC AAT C ATGTATGGTC ATGCTTATTG CCAATCTAAA TCGTTCAAAT TTGGCACAAC 87 60 

GACAAATAAG GCTTCAACAC GAATATATTC TCTCGGTTGA AACCTTACTT ATTCATTTAT 88 20 

TTTTTATAAA T T AG T G AC AT AACACTGTAT TAGCATCTGC ACGATCGGTT GAAATATATG 8880 

TTACATTTTC TTGCTGCTTA ATAAATGCAT CATAGTAATC ATATTGCGACGAATGATATG 8 94 0 

TGCCATTCGA TGTATCATTT GGGTTTAGCA AACAGCCATA ACCTTCGTCA TATAAATGTT 9000 

CACAGAGCAT AAGGGCGTCA TGTTTAGAAC CACTTACTAC ATAAAATTGC TTCATAGGAT 90 60 

CATATGATTT AGGAGTGTTT T C AG TAT AAT CAACAACTTC CCCTATAATA CATATACCTG 912 0 

GTTTCGCCTC AATTGAATAG TGTTGCAATT TTGAAATAAT ATTACTTAAA CGCCCCTTAA 9180 

CAACAAACTC GTTAAAACAC GATGCTTGAA AGACAATCGC TATCGGGTAA TCAATATCTG 92 4 0 

TGTATTGTTG TATCTGTGTG ATAATTTTCC CTAAACGTTT TACCCCCATA TAAATTGCTA 9300 

ACGTGCCACC ATTCACTAAG G AAT T G AC AT (CACTTCATT TTCTTCTGAA TCTTTAAAGT 9360 

GACCTGTAGA AAATGTCACA CTTTTAGCAA CTGTACGCAT TGTCAAACCT GTCTGCATAG 94 2 0 

TAGCAACTGc tGCGCTCGCT GATGTCACCC CTGGTACAAT TTCAAACGCA ATATGATGTT 94 80 

CATTTAGTAT GTCGACTTCT TCTTGCACAC GACCAAATAT CGCTGGATCG CCACCTIAA 954 0 

GTCTAACAAC CTTGTTATAT CGACGCGCTG CTTCCACGAT ACAGTCATTT ATTTTTTCTT 9600 

GCTGAATATG TTTTGCATAC GGCTTTTTAC CAACATCGAT AATTTCAGTA GTCAAATTCG 9 660 

CAT AT TGT AA AATTAACGGA TTCACTAATC GATCATATAG AAT G AC AT C C gCTTCACGTA 97 2 0 

TTAAACGCTC AGCCTTTTTC GTCAAATAAT TCGGATTACC TGGACCCGCA CCTATCAAGT 97 8 0 

AAACCTTGCC ATATTCCTCT ACAGACATAT ATATACGTTC CCGTCTGTAA CTTCTACCTC 98 4 0 

AT AAAC AT C T ACACAACCTT CATCAGGTTC TTGAACAATA CCTGTATTTA AATCAATTTT 9900 

TTGATCGTGG AGCGGGCAAA ATACATATTC CCCACTCA<T GTCCCTTCAG ACAATGGTCC 9960 

TTGTTTGTGT GGACAGATAT TGTGAATCGC ATGAATTTTG CCACTTTCTG TTAAAAACAA 10020 

CCCTACCTCT TTGCCTTTGA CAATAACCTT TTTTCCAATT AGGGGTGTTA ATTCATCTAT 10080 

AGTTGTCACT TTAATTTTTT CTTTTGTTTC CATGTATTAC ACCTTCTCCA CTTCAAAAAT 01 4 0 

TCTACGTGCT TGAGCATTGC TAGTTATTGC TTCCCAAGGT TCAGCTTCGA CTGCTTTTTT 10200 

AGCATCCATA ATGCGTTCAA ATAGTTCATT TTGTCTTTCT GGGTCAAGTA AGACTTCTTT 102 60 

393 

Copied from 10806206 on 08/12/2004 



TACATTTTCA AATCCAAGTC TTCTTAACCA TGGCGCTGTT CTTTCAGCAT ATATACCTGT 10 320 

TTCGCGATAA TATTGCATCA AAGCACCACA TAATTTGATT ACTTCATCTT CTGTTTCTAC 10380 

AGTTGTTAAA AATTCAGCTT TTTCAACTTC TGTACCACCA TTACCACCGA TATAGATTTG 104 40 

GAATCCATTT TCAACTGAGA TAATACCAAA ATCTTTAACA CCTGATTCAA CACAACTTCT 10500 

TGGGCAGCCT GATACACCCA TTTTGAATTT ATGAGGTGTA TCGATGATT CAAATGTTTT 105 60 

TTCTAAACGA ATGCCAAGTC GTGTCGTGTA TTGCGTACCA AAT C G AC AAA ACTCTTTACC 10 620 

AAC AC AG C T T TTAACTGAGC GTGTTTTCTT ACCATAAGCT GATGcTGAAC GCATACCTAG 10 680 

GTCTTCCCAT ATATTTGGTA ATTCTTCTTT TTTAACTCCA TACAAACCAA CACGTTGTGA 10740 

ACCTGTCACT TTAACTAGTG GCACATGATA TTTCTTAGCC ACTTCTCCTA G AC G AAT C AG 108 00 

TTGGTCTGCA TCTGTAACAC CCCCACGCAT TTGAGGTATA AC AG AAAAT G TACCATCATT 108 60 

TTGAATATTC GCATGGTAAC GTTCGTTAGC AAATCTTGAT TCTCTTTCAT CTTCATGATC 10 920 

ATGTGGATAA ACCATGTTTA AAT AAT AO 1 T GATTGCTGGT CGACATTTTG GACATCCACC 10 980 

TTTATTTTTA AAGTTTAAAA CATGTCGAAC TTCTTTAGAT GTTTTTAAAC CTTTCGCTCT 1104 0 

TATTTGCGTT ACTATTTGAT CGCGTGTCAA AT C AG T AC AA CCACATATAC CAGCAGGTTT 11100 

TGCGGCAACA AAGTCATCTC CTAAGGTGTG CTGCAATATT TGAGCAATTT GCGTTTACA 11160 

TTTACCACAT GAATTCCCCG CTTTTGTTTT AGCCGTTACT TCTTCAACTG TTGTAAAGCC 11220 

ATTTTCCGTA ATCGCATTTA CTATAGTACC TTTATCAACA CCAT TACAAC CACAAATTGT 112 8 0 

TTCATCATCA GCCATATCAG CAATTGATAG CGATGCCTCT TCTCCACCTT TAG TAAGCAA 1134 0 

TGATACAAGT GTGTAATCTT CAGTGGATTC ACCTTTTTTC ATCATGTTAT AAAAGCGTGA 114 00 

ACCATCATCG ATATCACCAT ATAGTACTGC ACCAACTACA TTACCGTCTT T T AAAAAG AT 114 60 

TTTTTTATAG TTATTATCAA CACTATTAAA TATTTCAATA CCTTTAATTT CTGCATTTTC 11520 

TACAATTTGA CCAGCACTAT ACAAGTCACA CCCAOAACT TTTAATGACG TAAATGTTGT 11580 

TGATCCCTTG TATCCGTTCG TTTCTTTATT TGTTAAATGA TCAGCTAATA CTTTACCTTG 11640 

TTCATATAGT GGTGCAACGA GTCCATAAAC TTTGCCGTTA TGTTCTGCAC ATTCACCAAC 11700 

TGCATATACA TTGCTATCAC TTGTTTGCAT CACATCATTG ACAACAATAC CACGATTAAC 11760 

ATCTAGACCT GATTCTTTGG CTACTTCTGT GTATGGTCGT ATACCTACTG CCATAACAAC 11820 

TAAGTCTGCC GGAATCTCGC GTCCATCAGC CAATTTAACA CCCTCAACAT CATCTTCTCC 11880 

TAAGATTTCA GTTGTGTTGG CTTGCATTTC AAACTTCATA CCTTGCTTTT CTAGATCTGC 11940 

TTTAAGCATA TTTCCAGCTT TACGGTCTAG TTGCATTTCC AT C AAC C AT T CAGCTAAATG 12000 

TAACACCGTT ACTTCCATAC CTTGATCTAA TAAACCACGT GCACACTCTA AACCTAGTAA 12060 
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TCCTCCACCA ATTACAATTG CTTTCTTTTT AGTCTTAGCA ATGTTCATCA TTTGTTCAGT 12120 

ATCATCAATC GTTCTCCAAC CAATTACACT CGGTAATGTC GACCTGGAA CTGGAATGAC 12180 

GAATGCTTTA GAACCTGTCG CAAAAATCAA TTTATCGTAT GATACTTCAA TACCATTTGC 1224 0 

AGTAGTAACT GATTGATTTG CTCTATCTAC TTCAATTACA GGATCATTTG TAATTAACTC 12 300 

GATACCATGT TCCTCATACC ACTCATATGG ATTCATAATT GTTTCTTCAA CTGTCATTTT 12 3 60 

ATTTTGTAAA ATATTTGAAA GCATGATGCG GTTATAGTTT GGATAAGGTT CTTTACCTAT 12420 

TACCGTAATA TCATATAAAT CGTTGGCGCG CTCTAATATT TCTTCGATTG TTCGAATGCC 124 80 

CGCCATACCG TTACCAATCA TTACTAGTTT TTGCTTTGCC ATAAAATATG CCCCTTTACT 1254 0 

CCATAATATT TATTTCAAAA AAAG3TATTA ATTTTTCGTT AGTGCTTTTA TATTTTCATT 12600 

G G AAT CAT T A AGCTTTCTAA TCTATCGTTA ATGATTTGCT TTAAAATTGG GTCGAAGTTA 12 660 

ATTGAAGGTG TGAAGTGTAT ATCTGTATTA ATAAC CATGT CATTCATTTG CTGCTTCACT 12720 

TTGTTAACAA GTCTTCCGTC ATATAAAAAT AATGGTACGA C AAT C AAT T T T T G AT ACC G T 127 8 0 

TTCGAGATGC TTTCTAAATC ATGTGTAAAA CTAATCTCTC CATATAGCGT TCTCGCATAT 1284 0 

GTCGGCTTGC TAATTTGCAA ATTTTGAGCG CATATTTGTA ACTCTTCGTG TGCCTTAGTA 12 900 

AACTTTCCAT TAATATTGCC GTGTGCAACA ACCATAACTC CAACTTGTTG TTCGTCACCT 12960 

GCTAATGCGT CACAAATACG TTGTTCAATT AATCGTCTCA TTAAAGGATG TGTGCCAAGT 13020 

GGCTCGCTTA CTTCTACCTT TATGTCTGGA TACCGTCGTT TCATTTCATG AACGATATTC 13080 

GGTATATCCT TGAGATAATG CATTGCACTA AAGATTAGCA ATGGTACAAT TTTAAAATGG 1314 0 

TCAACCCCAC TTTGAATCaA CGTCGTCaTT iCCGTCTCTA AATCCtGATG CTCACTTTCt 13200 

AAAAACGCAA TATCATAGTG ATGTATATCA TCTTTTACTA ATTCAGAAAT AAATGCTTCT 13260 

AACGCTTGaT TCTGTCGTCC GTGCCTCATG CCATGTGCAA CAATGATATT CCCATTCACA 13320 

TTTACCAACC CTTTCACACG TATTGTATAC CAAATCATTT TGTTTTTGTG AAAAGAACA 13380 

CAT T AT AAT G TAAAATCAGG GAATTCCCTG ATGCCTGTAG TCATGCATAT TCCTTATACA 13440 

TTTTCCCTTT TTGTTAAATC AAAAAAAGCG ACCGATATAT GAATCCCTAC TCAACATTTA 13500 

TTTGAGCAAG CATTAATATA TCGGTCGCTT GTAGTGTATA TTATTATCTT AAAATGGTGG 13560 

TTGGCCTAAT ATTGTTTCGT CAAAGCGCTC GGGTATCAAT ACTTTGCGCA T GAT C AC ACC 13620 

TAAATCGCCA TCATCATTTT CATGTTCGCT GTATATTTCA TAACCTCTTT T T T CAT AAA T 13 68 0 

TTTAAGTAAC CACGGATGCA ATCTTGCAGA TGTACCTAAA GTAACTGCCG CTGACTTTAA 1374 0 

CGTATCTCGC AAAAATGCTT CTTCAACATA AGTAAGTAff TGGCTACCAT AGCCTTTCCC 138 00 

TTCATACTCA GGATTTGTCG CAAACCACCA GACAAAAGGA TAACCCGAAA TACTTTTCAC 138 60 

ACTTCCCCAA GGATATCTAA CCGTAATCGT AGATATAATT TCATCATCAA TTGTCATGAC 13 920 



395 

( opied horn 1080(ii>0(i on 08/ 1/ /<<0 I 



AAATGTAGTA TTTTTATCTA TATTTTCTTT AACAGCATCT AAATTAGCAT TAATCGAAGs 



CCAAT CAAT A CCTAGTTCTC TTAGAgGCGT AAATGCTTCA TGCATGAGTT CTTGCAATTT 



TTCTGCATCT T 



3980 
14040 
14051 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

TAATCCTCAA CTTnGATTAT ATGGCTTGGG CGCATATGAA CTGCTTAGTT TAGTGTATGA 60 

CATTCATACA GTTCGCATGA C T AT CAT AC A ACCTCGAATA GATAACTTTT CTACTGAAGA 12 0 

GTTACCAATC TCAAGATTAC TTCAATGGGG AACCGATTTT GTTAAACCCT TAGCCAGACT 180 

TGCTTATAAC GGTGAAGGTG AGTTTAAAGC AGGTAGTCAT TGTAGATTCT GTAAGATAAA 24 0 

GCATTCATGT AG AAC AC G T G CAGAATACAT GCAAAATGTG CTCAAAAGC CACCACATTT 300 

GTTGAGTGAT GAAGAGATTG C AG AAC T T T T ATATAAACTG CCTGATATCA AAAAATGGGC 3 60 

TGATGAAGTA GAGAAATATG CGTTAGAACA AGCGAAAGAG AATGATAAAA CGTATCCAGG 4 20 

TTGGAAGCTA GTCACGGGAC GTTCAAGGAG AGTGATAACT GATACAAAAG CAGTCCGAGA 4GB 

CAGGTTAGTT GAAGCGGGTT ATAAACCTGA AGATATTACA GAAACCAAGT TACTTAGCAT 54 0 

TACGAATTTA GAAAAATTAA TCGGCAAAAA AGCATTTTCT AAAATTGCAG AAGGCTTTAT 600 

AGAAAAGCCG CAAGGTAAAT TAACACTTGC TACCGAGTCT GATAAACGAC CAGCTATAAA 660 

GCAATCTGCT GAAGATGATT TTCACAAACT ATAAAAATTA AAAAGGACGG TATATAAACA 7 20 

TGAAAGCAAA AGTATTAAAT AAAACTAAAG TGATTACAGG AAAAGTAAGA GCATCATATG 78 0 

CACaTATTTT TGaACCTCAC AGTATGCAAG AAGGGCAAGA AGCAAAGTAT TCAATCAGTT 84 0 

TAATCATTCC T a AAT C AG AT ACAAGTACGA TAAAAGCCAT TGAACAAGC ATAGAAGCTG 900 

CTAAAGAAGA AGGAAAAGTT AGTAAGTTTG GAGGCAAAGT TCCTGCAAAT CTGAAACTTC 960 

CATTACGTGA TGGAGATACT GAAAGAGAAG ATGATGTGAA TTATCAAGAC GCTTATTTTA 1020 

TTAACGCATC AAGCAAACAA GCACCTGGTA TTATTGACCA AAACAAAATT AGATTAACGG 108 0 

ATTCTGGAAC TATTGTAAGT GGTGACTATA TTAGAGCTTC AAT CAAT T T A TTTCCATTCA 114 0 

ACACAAATGG TAATAAGGGT ATCGCAGTTG GAT T G AAC AA CAT T C AAC T T GTAGAAAAAG 1200 

GCGAACCTCT TGGCGGTGCA AGTGCAGCAG AAGATGATTT TGATGAATTA G AC AC T GAT G 12 60 
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ATGAGGATTT CTTATAAGTC AATAGGTGGG GTTTTTAGCC CCACTTTAAT T T T AAAG AAA 1320 

TTGAGGTGTC AAGAATTTGA AATTTATGAA TATAGATATT GAAACATACA GCAGTAACGA 138 0 

TATTTCGAAA TGTGGTGCCT ATAAATACAC AGAAGCTGAA GATTTCGAAA TTTTAATTAT 144 0 

AGCTTATTCG ATAGATGGTG GAGCGATTAG TGCGATTGAC AT G AC T AAAG TAGATATGA 1500 

GCCTTTCCAC GCTGATTATG AGACGTTTAA AATTGCTCTA TTTGACCCTG CTGTAAAAAA 1560 

GTATGCATTC AATGCTAATT T C G AAAG AAC TTGTCTTGCT AAACATTTTA ATAAACAGAT 162 0 

GCCACCTGAA GAATGGATTT GCACAATGGT TAATTCAATG CGTATTGGCT TACCTGCTTC 168 0 

GCTTGATAAA GTTGGAGAAG TTTTAAGACT AC AAAG C C AA AAAGATAAAG CAGGTAAAAA 174 0 

TTTAATTCGT TATTTCTCTA TACCTTGTAA ACCAACAAAA GTTAATGGAG GAAGAACrAG 1800 

AAACCTACCT G AAC AT GAT C TTGAAAAAtG GCAACAATTT ATAGATTaCT GTATTCGAGA 18 60 

TGTAGAAGTA GAAATGGCGA TTGCT 1885 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

TAATCCTTAG TTCACTGnCA AATTTCAAAA CACCAGTTCC CTCTATCTGC ATCCATAGAA 60 

ACTGnATGTT TGTGTCAATA ACCGGATTAT ATTGTGATGn TGTTTGTAAC TCGATTAAGT 120 

TATCATCTTT CGAAAAATTA TCTACTACCA TTATTCAACC ACCTTTCCTT CGAATAAAE 18 0 

CCATTTACCA ACkCCACCAG TACCAAAGTT TCTAACTAAA AATTGATGTG CAGACGGGAA 24 0 

GTTATTACGT CTTAATACTT GTGTTGTATT ACCTGGTGTA TTCGATTTTA CTTCTAATAT 300 

CCAACCTGCA ATACCTTTAA AGTCTTTAGG AAAATCAGTA AATCGGTTTG ATTCTTCAGT 360 

AG T GAT AT AG AAATCTAAAC C AAC GAT T T T TAAATCTGAT AATTTTGTAA TACTCTTAGG 4 20 

GATATGTTCC CAATAACCGG CGTTTTGCGG GCAGAAATTC CATGCTCCGT TGTTTTTCTT 4 80 

ATTGAAAATG TCAATGACAC GTTCGAATTT AAGCATATTT CTACCTGTGC TGTTTCTGGt 54 0 

AAGTACTTGT CTTAGAGCAC CATTATAGTG TCCAGGCAGT ACATCCAAGA ACCACCCTGC 600 

ATCTCTAAAC GCTTTCGGTA ACGGGAAATC TAATGCATTT TGTGTGTCTT GaCGTATAGA 660 

TATAGTAATG ACCAACTTCC GTAATATCAC TTAGATATGC TGGGTTCTGT ATTGGTAACG 720 

G T T T AAC AC G TCCGCCTGAA TCAGTCATTG ATACTTGAGG TGCGATGTTT TTCAAGAATT 870 

GGTTTACACC TCTTTGACCG ATAGAATAAA TTGAATGATG TCTGTTGTTA CCAGGTCCAA 840 
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TAGTTACCCC GATTAGAAGT GCTTTACGTC CTGTTTCTAG ATCGTAATAC ATATCTAGAC 900 

CCTCAGCCTC TTGGAAATCT CCTTTAAAGT TGTTATTCAC ACCGCCTATA TCGATGCGAC 960 

GTTTAAATAA CAATTCTTTC GTTTTGATAT CGAAGCCTTG TAAGTAGTTA GGGTTGGCTG 1020 

TATTCGAATC ACCTGTATAC CAATATAAGA TACCTGCATC AT AAG T GAT A CCTTGCATAG 1080 

GTTGTGTATC TGAAGTGTAT TCCATAGGTA TATCCATTTG ATACAATACT TTGTCTATAC 114 0 

CTTTATCAAT ATCGTCAGCA CTTCTAACCT CAACAAAGTT CAACGAATC TTAAGTTGTC 1200 

TTTCAGTGGG TTTATATTCA CGTCTAAAAA TCATTAAATT TTCTACCGGA TTATAAATCG 1260 

C T G AC G TATA TCTGTCGTTA AATATATTCG GCATGACATC TTGCATTTCA TTACCATAAG 1320 

TTATTTCTCC AGTTCTATAT TGGAAACGTA CAAACTTGTT GTTTTTGTTA CTGTCCAATA 1380 

CAGCTGAATA AATCCATAAT TCTCCATCAA TGTATCTATA CGCATTGTGT GTACCGTGAC 14 4 0 

CGCCGTTTTT AACAAGCAAT CTATCAATAA ATTGTCCGTT GGGCTTCAAT C T AG AT AAC A 1500 

TGTAATGATT ACCTGGACGA GCTTGCGTCA TATAAATAAT TTTCGTTCTA GGGTCTACCC 15 60 

AAAATGATTG CATTACTGCA TTTGTATATG GCGATAAATC AG T GAT AAAT TCCGGTTCTT 1620 

GCTCTTTTGG TTCGAATCGG TATTCTGTCG CTCGATATTC TTTATAGTGT TCATCTACAG 168 0 

CTTTCTCAAC CTTTTTAGTG AAAACATCTA GTGTTGAATA ATCATGATAC AAACGATCTT 17 4 0 

GCAATGTCTT ATGACCATAA CCTGTATTAT CAACGCGCGC GTCTTTTAcT TCGTCTATAC 1800 

CGTCGCCGTT ATGACCTAGT ACCATGTTGC TAAATCGACC G T T T AAAT AT GTTAAAAAGT 18 60 

CAGAGACGTT ACTTGTAACA TTTAAATGTT CATACTTTAT TTGTTCTCCA TCATGTGCGA 192 0 

ATACCTCTTT ATTTCTGTGG TATTCAAGAG AG AAAT T AAA ATCCGTCAGC ATGTCTGAAA 198 0 

TAAGTTTAAA GTTATACTCA TTTTCATCTA CATATCTGTA GTCAAAGACT CTACTTAAAT 204 0 

CTGTAATTAG TTTATTACTC ATGTTTTCCT CCTTTACTAT CCATAAAACT GATmATAATT 2100 

TTTAATAAGC TCATACATAA TAACTTCATG ACCTCTTTCA TTAGGATGTA ATCCATCAGG 2160 

CATGCTAGAT TTTCTAAATG CTGGATTATA TGGTTTSAA TAATCTGTGT GATAAGCATC 222 0 

ATATACTGGT ACATCCAATT CACTACAAGC CAATATCTGA GCATTGACAT AATCCTCTAA 228 0 

AGTTAACCCT AGTTTGTTTT TGTCCGTATC TTTACGGCGT ATCGTTGTAC CACTCATAGG 234 0 

GCATTGCCTA GTAGCTGTCA T T AC AAG TAT TTTTGAAGCT GGATTATTTT TCCTGATAAC 24 00 

TTCAATTGCA GAACAAAAGG CGCCGTAAAA CGTTTTAGTG TCGGTTTTAT CAGTGCCTAT 24 60 

CGGTACGCCT GCCCAATAAC CATGTAACCA GTCAT CATCT GTACCTTGTA ATATGATTAG 2 52 0 

GTCTCCTCTT ATTTGCTCTG CTTGTCTaTA AATGCTGTTT TCTaCCGCTT CTTTACCTAT 258 0 

TGGAACTGTT GCCATTGTTG CGCCACCTCT TGCAAGGTTG GTCGTTTTAG CTTTTAACTT 2 64 0 
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CTTGCCTAAC ATTTCT 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



2656 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

AAAATGAGGG TTCTAGCGGA AATTACCAAA AGCGTGGTTC ATACTATGGG CAGCGTAATC 60 

GTATTTCAAA AGAAAAAACA CCTAAATGGT TAGaAAATAG AGATAAACCT AGTGAAGAAG 12 0 

ATTCGGCTAA AGATAATAGC GTAGATGATC AAC AAT TAG A GCAAGATCGA CAAGCATTTC 180 

TAGATAAATT ATCTAAAAAA TGGGAGGAGG ACAGTCAATA ATGAAGCAAT TTAAAAGTAT 240 

AAT T AAC AC G TCGCAGGACT TTGAAAAAAG AATAGAAAAG ATAAAnCAGA AGTAATCAAT 300 

GACCCAGATG TTAAGCAATT TTTGGAAGCG CATCGAGCTG AAT TmAC G AA TGCTATGATT 3 60 

GATGAAGACT TAAATGTGTT AC AAG AG TAT AAAGATCAAC AAAAAC AT T A TGACGGTCAT 4 20 

AAATTTGCTG ATTGTCCAAA TTTCGTAAAG GGGCATGTGC CTGAGTTATA TGTTGATAAT 4 80 

AACCGAATTA AAATACGCTA TTTACAATGC CCATGTAAAA TCAAGTACGA C G AAG AAC G C 54 0 

TTTGAAGCTG AGCTAATTAC ATCTCATCAT ATGCAACGAG ATACTTTAAA TGCCAAATTG 600 

AAAGATATTT ATATGAATCA TCGAGACCGT CTTGATGTAG CTATGGCAGC AGATGATATT 6 60 

TGTACAGCAA TAACTAATGG GGAACAAGTG AAAGGCCTTT ACCTTTSGG TCCATTTGGG 72 0 

ACAGGTAAAT CTTTTATTCT AGGTGCAATT GCGAATCAGC TCAAATCTAA GAAGGTACGT 7 80 

TCGACAATTA TTTATTTACC GGAATTTATT AG AAC AT T AA AAGGTGGCTT TAAAGATGGT 8 40 

TCTTTTGAAA AGAAATTACA TCGCGTAAGA GAAGCAAACA TTTTAATGCT T GAT GAT ATT 900 

GGGGCTGAAG AAGTGACTCC ATGGGTGAGA GATGAGGTAA TTGGACCTTT GCTACATTAT 9 60 

CGAATGGTTC AT G AAT T AC C AACATTCTTT AGTTCTAATT TTGACTATAG TGAATTGGAA 102 0 

CATCATTTAG CGATGACTCG TGATGGTGAA GAGAAGACTA AAGCAGCACG TATTATTGAA 1080 

CGTGTCAAAT CTTTGTCAAC ACCATACTTT TTATCAGGAG AAAATTTCAG AAACAATTGA 114 0 

ATTTTAAAAT GATTGGTGTA TAATGAATAC AAATCTAAAT CGTTTAAATG ATTGAAGACA 12 00 

AGATGATCTA ATCAATATTA CACAGAAAGC CATTGTTTGA T GAG AAT AT G GTTAATAAAT 12 60 

TAGATGATTA CTACTTCATT TATGGTATTT GTAATGAATA CCCGGATCAA GACGTTATC 1320 

T T AAG C AG AG ATATTTAATA CATAAATTGT ATTAGATAAC TAGGGTGGTA CCACGACGAT 138 0 

CTCGTCCCTT GTATAGGGGC GGGATTTTTT GTTTTTTTCA GACATAAATG TTTGTTGGTG 14 4 0 
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TCATAAATTC CCTGTTTATT GTTAATAGGT TTAATGTTAA AACGATGATT GTTGTTCAAT 1500 

TTTTTAACGA GGTCAGATAA AAGTATTTAT AAAGCAAATA GGAGGGTTTA ACATGGAACA 1560 

AATTAATATT CAATTTCCAG ATGGTAATAA AAAGGCGTTT GATAAAGGTA CTACTACTGA 1620 

AGATATAGCA CAATCAATTA GTCCTGGATT ACGTAAAAAA GCTGTTGCCG GCAAATTTAA 168 0 

CGGGCAACTT GTAGATTTAA CTAAACCGCT TGAAACTGAT GGATCAATTG AAATTGTGAC 17 4 0 

ACCAGGTAGT GAAGAagcGT TAGAGGTATT ACGTCATTCT ACTGCACATT TAATGGCACA 1800 

CGCGATTAAA AGGTTATATG GTAATGTTAA ATTTGGTGTA GGTCCTGTAA TAGAAGGTGG 18 60 

ATTCTACTAT GACTTCGACA TTGACCAAAA CATCTCATCT GATGACTTTG AACAAATTGA 192 0 

AAAAACAATG AAACAAATCG TTAACGAAAA TAT GAAAAT C GAACGAAAAG TGGTTTCACG 198 0 

AGATGAAGTG AAAGAG T T AT TCAGCAATGA TGAATACAAA TTAGAATTAA TCGACGCGAT 2040 

TCCTGAAGAT GAAAATGTAA CATTATATAG TCAAGGTGAT TTTACTGATT TATGTCGTGG 2100 

AGTTCACGTT CCATCAACAG CTAAAATTAA AG AG T T T AAA CTATTATCTA CAGCAGGTGC 2160 

ATACTGGCGT GGAGATAGTA ACAACAAAAT GTTACAACGT ATATACGGTA CTGCTTTCTT 222 0 

TGATAAAAAA GAATTGAAAG CACATTTACA AATGTTAGAA GAGCGTAAAG AACGTGATCA 22 8 0 

TCGTAAAATT GGTAAAGAGT TAGAACTATT CACAAATAGC CAffiTAGTTG GTGCTGGTTT 234 0 

GCCATTATGG TTACCTAACG GTGCAACAAT TAGACGTGAA ATTGAACGTT ACATTGTTGA 24 00 

TAAAGAAGTT AGCATGGGAT AT GACCACGT TTATACACCA GTACTTGCTA ATGTTGATTT 2 4 60 . 

ATACAAAACA TCTGGTCACT GGGATCACTA TCAAGAAGAT ATGTTCCCAC CAATGCAGTT 2 52 0 

AGATGAAACT GAATCTATGG TATTACGTCC AATGAACTGT CCACATCATA TGATGATTTA 2580 

TGCGAATAAA CCACATTCAT ATCGTGAATT ACCTATCCGT ATCGCTGAGC TAGGAACGAT 2 64 0 

GCATAGATAT GAAGCAAGTG GTGCTGTATC AGGATTACAA CGTGTTCGTG GTATGACTTT 2700 

AAATGATTCA CATATCTTTG TTCGATCTGA TCAAATTAAA GAAGAATTCA AACGCGTTGT 2 7 60 

AAACATGATT ATTGATGTGT AT AAAG AC T T TGGTTTCGAG GATTATAGCT TTAGATTAAG 2 82 0 

T TAT AG AG AC CCTGAAGATA AAGAAAAGTA CTTTGATGAT GATGATATGT GGAATAAAGC 2880 

T GAAAAT AT G CTTAAAGAGG CAGCGGATGA GCTTGGCTTA TCGTACGAnGAAgCGATTGG 2 94 0 

TGAAgCGGCA TTCTATGGTC CGAAACTAGA TGTTCAAGTT AAAACAGCGA TGGGTAAAGA 3000 

AGAGACATTA TCAACAGCAC AACTTGATTT CTTATTACCA GAACGTTTTG ATTTAACTTA 3060 

TATTGGTCAA GATGGTGAAC ATCATCGTCC AGTTGTTATT CATCGTGGTG TTGTATCAAC 3120 

AATGGAACGA TTTGTAGCAT TTTTAACTGA AG AAAC T AAA GGTGCATTCC CAACTTGGTT 318 0 

AGCGCCAAAA CAAGTTCAAA TCATTCCAGT TAACGTTGAT TTACATTATG ATTATGCGCG 32 4 0 
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CCAATTACAA GATGAATTGA AATCTCAAGG CGTTCGTGTA AGTATTGATG ACCGTAATGA 3300 

AAAAATGGGT TATAAAATCA GAGAAGCTCA AiTGCAAAAA ATACCTTATC AAATCGTAGT 33 60 

TGGGGATAAG GAAGTTGAAA ATAATCAAGT GAATGTGCGT CAATATGGAT CGCAAGACCA 34 2 0 

AGAAACAGTT G AAAAAG AT G AATTTATCTG GAATCTAGTT GATGAAATTC GTTTGAAAAA 34 8 0 

ACATAGATAG ACAGTTGTCG CAATAAAATG CTTTAAAACT TTTATTGCGT ATCAAGTTTT 354 0 

ACAGGGTTGA TTATGCGTGA TGAATCCTGT AT AT T AC AAG TTAGTTAAAA TATTAAATTG 3600 

AGTTAGAGGT TGCATGTTTA ATTAGTAACT TGTCAGAAGT ATTTATGGTA CATAAGTTGA 3660 

ACAAGTGAAA GGTAAAGATG CCGAAATAGA TATAAACCAT AAATTATATC TATTGGGACA 3720 

GTTTTCGAAT AGGAA.CTGTA C T G T C AC AG A ATGTGATGTG CTACCTTATA TAGATAATTG 37 8 0 

CCAAAGTGGT TGCATATCTT AAAGGTATGT AGCCACTTTT TTACTTTTAA TATCACTATG 38 4 0 

TTCTGTAAAA AAGGGTATGA AAGTGAATAA AGGTTATTTA TTTCTTGGCC TCTAAAACAT 3900 

GGAAAGGGAG CTTATATGTC AAAAGTTCAA AATGAAAGTAACAATGTTGT CAAAAGGGGA 3 9 60 

CTTAAAGATC GTCATATTTC TATGATTGCG ATTGGGGGTT GTATTGGTAC AGGTTTATTT 4 02 0 

GTAACTTCTG GTGGAGCAAT TCATGATGCA GGTGCTTTGG GTGCATTAAT AGGATACGCA 4 08 0 

ATTATCGGAA TAATGGTATT TTTCTTAATG ACGTCACTTG GCGAAATGGC TACGTATTTG MO 

C C AG TAT C AG GTTCATTTAG TACATATGCT AC AAG AT T T G TTGATCCATC TTTAGGGTTT 4200 

GCGCTTGGTT GGAACTATTG GTTTAACTGG GTAGTGACTG TAGCAGCAGA TATTACGATT 4 2 60 

GCAGCACAAG TCATTCAATA TTGGACACCA TTGCAAGGCA TACCCGCTTG GGCATGGAGT 4 32 0 

GCGTTGTTCT TAGTTATAAT TTTTAGTCTG AATTCGTTAT CAGTTCGCGT CTATGGTGAA 4 380 

AGTGAATACT GGTTGGCATT GATAAAAGTG GTTACAGTTA TTGTTTTCAT TGCAATTGGT 444 0 

TTATTAACGA TTGTCGGAAT CATGGGTGGT CATGTTGTAG GATTCGAAAT ATTTAATAAA 4500 

GGTGAAGGTC CAATTCTTGG TGGCAACTTA GGAGGAAGTT TGTTATEAT TCTAGGTGTA 4 5 60 

TTCTTAATCG CTGGTTTCTC ATTCCAAGGT AC T GAG T T AA TTGGTATTAC GGCTGGTGAA 4 62 0 

TCAGAAAATC CTGAACGTGC TGTGCCGAAA GCAATTAAAC AAGTATTCTG GAGAATTTTA 4 68 0 

TTATTTTACA TTTTAGCCAT TTTTGTTATC GGTATGTTAA TTCCTTATGA TAGTAGTGCA 4 74 0 

TTAATGGGGG GTAGTGATAA TGTAGCAACG TCTCCATTCA CATTAGTGTT TAAAAATGCT 4 800 

GGATTTGCGT TTGCAGCATC ATTTATGAAT GCAGTCATTT TAACGTCTGT GTTA 4 854 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

ATCAAAAATT GATTGTTTTC nATTTTTTGT TTCAGCGCGG GATCTTTTAC GTCTTTTGTG 60 

AAAACGaTTT TATTATTAAC TAC ITT TACT GGATAACTTT TGTATGTCGA GTCAGTAGCA 120 

TTTTTTCTAT CGTTTGTAGT TGTGTCATAT TCACCAgTTA TTTTATGTGT GTTCTTATCT 180 

ACCTTTAACA ACATACGGTC TTCTTTTAAA AGCTCATCTG ATCCAACAAC TGAATAAGAG 24 0 

GATTCTATAT ACCATGTGTC TTGATCATTA TTTTCATAAT GGGGATTATCGTGACCATCA 300 

ATTTCATAAA GCGTTTCTAA GTTTTTAATA GGATACGTAC TTAGTACTTT TTTAAGACCA 3 60 

TCTTTCAAAT GAATTTGTTC CCACTTCATT GCCAAAAACA TATCGCCACT GACTACAATT 4 20 

GAAATAATAA TAATTGCTGC TAAGTTTAAC CAGAAAATTT TATGTGCTTT CATACATTCC 4 80 

CACCGTTTCT CAAAATACTT CATTAACACT ATAATAATAT ATTTTGAAAA ATATTTACAT 54 0 

C AG TAT T AAA GTGAATATCA AATTTTAAAT TTATGAAAAT AATAGATATT TATAAAAAGC 600 

G G AAAAG AG A TACAATAAAA AACTGCATGA CGTTTGAGAC GTCACACAGT GTAACTAAAA 660 

ATTTAAAAAG TTGTTGCTAA TTTTTCAGCA TATTAATAC TAGTTGCTTT AATTTCTTCA 720 

GTCTTATGAG GTTCAGCATT GTGTCCTTCA ATAATGATTG TTTCATATGA TGGCACACCT 7 80 

AAGAATGTCA TAATTGTTCT TAAATAACGG TCACCCATTT CAAAATCAGC AGCAGGTCCT 8 40 

TCAGTATAAT ATCCACCACG TGATTGAATG TGTAATACTT TTTTGTCAGT TAGTAAffiCT 90 0 

TGTGGTCCTT CAGCAGAATA TTTAAAAGTT TTACCTGCAA TTGAAATAGC ATCAATATAT 9 60 

GCTTTAACTA CAGGTGGGAA AGAAAGGTTC CACATAGGCG TTACAAATAC ATATTTATCT 1020 

GCACTTAAAA ATTCTTCTAA AATGTCACTC AATCTTGAAA CTTTCATTTG TTCATCATCA 1080 

GTTAACGTTT CGCCATTACT CATTTTTCCC CAACCAGTTA ATACATCTTT GTCAATAACT 114 0 

GGAATATAAG TTTCArATAA ATCAATATGT TTCACTTCAT CATCAGGATG TTGTTGTTGA 12 00 

TATGTTTCGA TAAATGCTTT ACCAGCCGCC ATAGAATTTG ATACCAGTTC AT T AAAAGGG 12 60 

TGTGCTGTAA TATATAATAC TTTTGCCATT TGAAAATTCT CCTCTGkTTC TGTTATTTTC 132 0 

TTAAGTATAA T TAT TAT ACT CGATATAAAA TTTAATATCA AT C AAAAT AT TCAAATTACC 1380 

ATCATTTTCT TCATCTATAT nTGGCAGTAC TACTAAAGTA TGAGTGCATT TAATTATGAa 14 4 0 

ATAGTTGATT TaGAATAtAT ACTTAATACC CAAAATATAT GAAGGATGGA TGCCACTATG 1500 

AC r AAGCGAC CAAAACGTAT TTTGGCAACA ATTATCATTT TTCTTTCACT ATTATTTACG 15 60 

ATTATTTATA TAGATGACAT TCAAAAATGG TTTAACCAAT ATACCGATAA ATT G AC AC AA 1620 

AATCATAAAG GACAAGGACA CTCAAAATGG GAAGACTTTT TTAGAGGGAG TCGGATTACT 168 0 
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GAGACTTTTG GTAAATATCA ACATTCACCA TTTGATGGTA AGCATTATGG CATTGATTTT 1740 

GCATTGCCAA AAGGTACACC AATTAAAGCG CCGACGAATG GTAAAGTAAC ACGTATCTTT 18 00 

AATAATGAAT TGGGCGGCAA GGTATTACAG ATTGCCGAAG ACAATGGAGA ATATCACCAG 18 60 

TGGTATCTAC ACTTAGACAA ATATAATGTC AAAGTAGGTG ATCGACBCAA AGCAGGTGAT 192 0 

ATTATTGCAT ATTCAGGCAA TACAGGTATA CAAACGACAG GCGCACATTT ACATTTTCAA 198 0 

AGAATGAAGG GTGGCGTAGG TAATGCATAT GCAGAAGATC CAAAACCGTT T AT C GAT C AG 204 0 

TTACCTGATG GGGAACGTAG CCTATATGAT TTGTAGTTAT AGAAGGGTGC CCGCAGTCTA 2100 

AAAAATTAAG CAATCATTGT GTGAGTATGA TACTTACATA ATGGTTGCTT TTTTCAATGA 2160 

AAATCGTAAT GCTAAGTCAT ACTTGTTTGA TTTAGATATT ACTTAAAATG TAAGACAAGG 22 2 0 

TTGTTAGCAT TGGCAGTGAA ATATCGCACA TAAAAAACAT TATTGTCACA CTAGAAAATA 22 8 0 

GTTGTGCACT ATATCAATTT TCTGTATSLA AGTTTAATTC TGACAGTAAT GTAAACGTTT 234 0 

ACAATTTATG ATTGACATTA ATAATGACTG AATATATGAT TTATGTAAGT ATTTGTGCAA 24 00 

CGTTTTCACA AAGTGTATTG CACaAyCAAA CTGtAAACaA aGTATGGGGg GCCATAACAT 24 60 

GGCAGAACTA AGTTAGAGCn TATTAAAA 24 8 8 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4093 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 108: 

TTTTCTTTAT TTCAAmCTGT ATATTaATGA TGTCACTTCA TTTGATACGA TTCTTGATAA 60 

CCTATTCAAA ATTCCGCCAA ATAACATAAA TAT T AT AT AA ATGCCGATAC TTTTAATCAT 120 

TTTCTACTTT TTCTTCGATA CGGAAACTTG TTTTCGAATT GAACACTTCA CCAGCTTTTA 180 

AAATTGACGG TGCTTTTTCA CCATATAAAT TAATATCATT TGGTAAAAAT TGTGTTTCTA 240 

ATGTAAAGCC AGAATGTGGT T T AT AAAT AT TAAATGGACT ATCCCACTCA TCAGGCTGGT 300 

TAAAAGTAAA GAACACAACA TGAGGCATAT CTGTATCGAC CTCTAACATA AATTCATGAT 3 60 

TTTCAACATA CATTTTATGT TCACCAACTG TAAATGGGTG ATCGAGACCA CCAAAACGTG 420 

TCATTTGTTG GCTTAATTGC GCATGTTGTG ATGTAAAAAT ATCACTCAAT TTTATCTTGT 4 80 

TATCTTCAAA CACTTCATGT AAATCTAGAA TATCACCTGT AACAATATTT CGCTCATCTA 54 0 

ATACATACAT ATCTAATTGA TTACTTGAAA TGCGATGATT AT C AAC G AC A TTATTRCTC 600 

GATTCAAATT G AAG T AC AC A TGATTCGTAG GACTAAACAA TGTGTCTTCT GATGCAACTG 6 60 
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CTTCGTATTC AATCGACCAT TGGTGATCCG CATCATAAAT ATGTGTAATC GTCACATCGA 72 0 

TATCACCCGG GAAATGATCA TCAGCTGATT TCAACACCGT CTTAAATATA ACTTTAATTT 780 

GAGCAATTTC ATTTCTAATT TCATAATCAA ATAACTTATT GTCCAAACCA TGACATCCAC 840 

CATGTAAATG ATGTTCACCG TTGTTTTTTT CTAACTGATA TTCTTTACCT TTCAACTTAA 900 

ATTTAGCATT ATCAATTCTA CCGCTATATC TTCCTATAGA AGCACCAAAT T T AAAAGG AT 9 60 

TACTATGATa AAATTCATCC GCTTCAACAA CATTTCC^G AACAATATTA TTATCATGAT 1020 

ATTTCCAAGA CACTACTCTT GCTCCATAAT TCGTAAAAAT AATTTTAGTT TCATCATTAT 1080 

CAATTTTGAT TAAATCTACA CCTTGTCTTT GGTGCTCAAC TTCAACTATC ATTTTTACTT 1140 

CTCCCTTCTA ACCACAAGTG TTCAAGCTCT GCTGGGTAGC AACATTACTA AAAC AC C T AC 1200 

AATACAAATG ATTGCACCGA T AACAT CAT A TTTATCTGGC ATTTGTTTAT CTACGACCAT 12 60 

CGCAAAAATC AAACTCATGA T G AT AAAT AC GCCACCATAT GCTGCATATA CTCTTCCGAA 132 0 

TGATGGAAAT GAT T G AAAT G TCGCAATGAC ACCATATAAC ATGAGTATCG CACCGCCTAT 138 0 

TAGCCCAACA AGTGAAGACT GTCCTTCCCT AAGCCACAGC CAAATCAGGT ATCCCCCACC 14 4 0 

TATTTCACAT AAGCCAGCTA ATATAAATAT AAAAATCGGA TATAACATGA AATCACTCCA 1500 

TCACACATTT GCTATCAATA ATCTATCGGC TACATATCAT TTGTTTACAT TTCTTCTTAC 15 60 

TTCACATTCC CATTTTAAAA AGTTCGTTTT CACATTCATA TTGTEACTT TTTTAGACAT 1620 

TATTCTATAG CTAAATATAA AAAAATAAGA GTAACACGCT TTCATCATCA TTTTATATGA 168 0 

TAAATGTGTG TCACTCTCAT CAATTTTATT T T T T AAAT AC ACGTTTCATT GAATTAAATA 17 4 0 

AGCCACGTTC AAATGTAAGT ACTGAATCTT TATATGTTTT AATTGCAATC CATATCAAGA 18 00 

CAGCTACCAT TACAATTGAG ATTAAAGAAC TTAAGATGAC CTCATATATT TGAAGCCCTG 18 60 

AAGTTTGAGC GCGTACAACT AATTGAAATG GCGCTAAAAA CGGAATATAA CTTGTGATTA ' 1920 

AAGCAAGTTG TCCATCAGGA TTATTTATCG TGAATATCGC GATATAAAAT GCAATCATAC 198 0 

CAAGTAATGT CAGTGGCATC AAAG AT IS AT TTAAATCTTC TATTCTAGAT GTTAATGATC 204 0 

CGAGGATGGC TGCAAGTAAT ACATACGCCG TAATTCCAAC AATACTACTT ATAATTCCGA 2100 

CAATAATAAT TTGCCAAGAC AATTGATTCA TTTCCACGTT AAAACCTTGT AGCAAGTCTT 2160 

TTAAGTCAAA GGCAAAAATG CATATAACTG CCATCAATAC AATTAAAATA ACTGAGTCA 222 0 

CGGCAACTGC TATAACGCCT AAAATTTTAG CAAAAATATG TTGGATTGGT GAAATACTTG 2280 

T AAT AAT CAT TTCAATGACA CGCGATGTTT TCTCACTAGC AATTTCCATA GCTATTTGAG 2340 

ATGCATAATT TAAAACAATG AAGAACATTA GAAAGATAAT GCCATmaGcT AAAGCATAGT 24 00 

TGAAAATCTT TTGTCCTTCT GATACTTTAT CGACTTCATC AT T AGAAATC ACCTTATTAT 24 60 



404 

( opied horn 1( 80(ii>0(i on 08/ 1/ /<<0 I 



CAACTTTACT TTGTGCTTGT AATTTTTGTA AGTCTTCTTT GTTGATATTT AATTCCCCGG 2520 

CTACCATATT TGTTTGAATA GCTGTAAGCA GTGCTTGTAC TTTTTGTGAA TCTTCATGAC 2580 

TTACTCGCTT CTCACTAATG ATTGTCCCTT GTAKGTGCG ATTTTGATTC ACCTTGATAA 2 64 0 

TATAAGCTTT ATCAAGTTTA TGTTTTTTTA CTTCTTTTTC AGCATCTTCT ATAGAAACTT 2700 

TAGTAAACTT AGCATCACTA TGAAATGTAT TCGCCTGTTG CTTGAAAACC TTATAGATTT 27 60 

GTTCATTCGG TGCTGCTACA CCAATTTTAT CTGGACCATC ATCAAACATG TTAATAATCT 2 820 

TATCAATGTT AGATAGGCCA ATCATTAAGG CAGCAATAAT AATCATAAAA ATTACAAATG 28 80 

ATTTAGCTTT AATTTTTTTG ATATATGTCA AAGTAAATGT CGCCCAAAAC TTATGCATCC 2 94 0 

TTGCCACCAA CCTTCTCAAT GAATATATCT TGTAATGATG GTTCTACAAC TTGGAATCGT 3000 

TTAACATAAC CTTGATGTGC CACAACTTGA TAAATATCTT TGGCTACGTC TTCATTCTCA 30 60 

ATCGTCAACT GAAGACCTTG CTTCATGTTT T C AC TAT G AA TGATGCCTCT AATGTTTGTT 3120 

AAATCTGGTA GTGTTGTTTC TGATTCAATG ACAACTTTCT TGTTACCATT AGATGCACGT 318 0 

ACATGATTGA TATCACCAGA AACAACAAGT TGACCTTTAT (TAAAATACA AAC AT CAT C A 324 0 

CATAATTCTT CAACATGCTC CATACGGTGA G AAC TAT AAA CGATTGTACT GCCCCAATCA 3300 

TTTAAGTCTT TAACTGCTTc TTTTAATAAC TCAACATTAA CTGGGTCTAG AC C AC T G AAA 33 60 

GGCTCATCTA ATATTAGTAA TTCTGGTTTA TGTAACATAC TTGCTAACAG CTGAATTTTT 3402 

TGTTGATTCC CTTTTGATAG ACTATCAATT CGTTTTTTGC GGTTTTCAGT AATATCAAAA 34 8 0 

CGCTCAAGCC AATACGATAT TTGCTGTTGT ATTTCTGTTT TTGACATTCC CTTTAAAGTT 3540 

GCCAAATATT TCAATTCTTC TTCAACTGTC AATTTCCCAT GTAAACCGCG TTCTTCCGGT 3600 

AAATAACCAA T AC GAT T G T A CATTGTTTTA TCTAGTTTTT TACCGTTATA CGTrrTGTGT 3 660 

CCTTCAGTTG GTTCACTTAA GCCTAAAATC ATACGAAATG TCGTTGTTTT ACmT GCACC A 37 2 0 

TTTCTTCCTA GAAAACCTAA CATTTTACCT GATTCTAACT TTAATGAAAT ATCATTTACT 37 8 0 

GCCGTCATCT TGCCAAAACG TTTCGTAACA TGTTCAATTA CAAGTCCCm ACTTTGCCTC 38 4 0 

CTAAAAAnAT ATGTATTTAT CTTAATATAA CATTTCCATT CTCTATAAAT GCAATATTTT 3 900 

TAAAATGAAT TTATTTTTAA AATTTCTGAA AT T G AAAAAT TTAAATAGTG CCATTTTTGC 3 9 60 

ATGTTAAGTA TCATTAGCAC TAGATATGTT TTTTCCATGC CTTTATTGCC TTATTTGTAA 4 020 

AAAAAT AT C C CArCATCATT GTGCCgAGAT ATTTATATCA TTACAAtTGA TTATCTGTTT 4 080 

CTTnCCGGTG TTT 4093 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17846 base pais 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

TGCCAAACTA CCTTTTGACA GTCGTTGCTG TACTTCAGGA TGATCAATCA CATATnTTAC 60 

TTTATCAAAT AGGGCATCTT CATCfiTTTTT AG T AAT T AAA TAACCATTGA AATCTGAAGT 12 0 

AATCAGTTCG TTAGGTCCAT ATTTAATATC ATAACTAATA ACTGGAACAC CATGTGCTAA 180 

AGATTCAAGT AGCGCTAAAG AGAAACCTTC CATGTTACTT GTTATTAAAC TCAAATAGGC 24 0 

ATCGCTATAT TCTTGGTCTA GATTGCTTAA AAAGCCGCGT AAGTAAACATGATTTTCCAA 300 

TCCATATTTT TGTATCAATT CATTTAATTT TTTACTTTCA GAaCCAAAAC CATACATATG 3 60 

AaGCTCTATT TTTGGGACAT AC GAT AC T AA GCGTTTAATT AATTCAATTT GTTGATGTAA 4 20 

TTGTTTTTCA GGTGAATAAC GAGCAACGGA AATTAATTTA ACACTGCGCT GATCTAATGT 4 80 

TTGGACTGGT GTATCAATTG TTTCACTATA GCCGACAGGA AT AT T AACAA CTGGAATAGT 54 0 

ATGGTTAATA CGTTTTTCAA CATCTAATTT TTGCTGCTCA GTAGAAACGA TAATTGCACG 600 

ATATCGAGAT AAATTTTCAA ACATCGCTTT ATATACATTT TTAAATGGCG AT G AAT C T AA 6 60 

TGCATCAATA TTTTTAATGT GTGTACTGTG AJSCACAGCT ACTACTGGGA TTGACTCAGG 7 20 

CGTTAAGTTG AAAATAGGTG CTGTGTACAC ATTACGATCA CTGAAAAATA AATCCCCATG 780 

TTGATATAGT TGTTTAATGA AAAATGCGCC TAATTCCGTT TCATTATTAA AGAAATATTG 84 0 

TTTGTTAGCA TAG T AAAC AA TAATTTTTTG TACTTCTGGT TTGCCATCCT TGTAAGA&A 900 

ATACTTTTCT AATTTTGTGT CACCTTCTGG AT T AT AGAAA AATTCACATA ATGTTTGTTG 960 

TTTAT CAACA AGAATCCTAC TACAACTTAA AAAGCCACGC ACATCATAAA AATCACGTTT 102 0 

TACTTtTCGT CTTTGACTAT CAAAATGATT TACATAATCT AATATACGAT ATTTAGGATC 1080 

TTGAAAATGG GCATACATTA AGAAACGCTC TTGATCATAT ATTCTAAAGT CATGACTATT 114 0 

TTCAACATGT TTTAAAGTAT AAT G AC AT T C ATCAGTCCAA TACGACAACC AGTCAAATGG 1200 

TTCATTGCGT TCTAAATATG TTGCTTCTTG GAAGAAATCA T AC AT AT T AA TATAGTCAGA 12 60 

ACTAGTAATA TAATTTTGGG CATTTCTATA TAAATATCTATTCCATGACA GAAATACACA 1320 

TTGCGCTGGT CTTCCCATTT CTTTAAATAA AT T T AAAC G A TTAATAATTG CTTTCTCTAT 1380 

CCCAGTTAAA TTAACACCTA AAC TAT T AC C TACAAAATAA TTCATTTACA ACACCACTTA 14 4 0 

TATCTATTTT TT AT AAT TAT ATCACATAAT ATT T AAT T AC TTCTTTTAAC TGGAAGATGT 300 

GTTTATTTAT AAAAC AACAA ATTTTGATAT TTATAATGAT AGTAGTTATT CAATCAcTAC 15 60 

G AC C c AAT AT ATCATkGTAG AGCTTAGGAT ATTGATTTAT GACTCAGGCA CATCAAATGa 1620 
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GAgGATTTAT AAAr GAG AT A TACAACTCTA GAAGGTATAA TAAAAACGCG CAACTAATGT 168 0 

TACGCGTTTG AATTAATCAT ATGATATTAT TTGCGATACT TTAATTTAGC GAAAgcATCA 17 4 0 

TGTTGATGGA TAGACTCTTC ATTACGACAT TCGATATCGA AACCGTCTAA CCAATCAAAT 1800 

TCAACTAAGT CCGCGGCAAT TAAACGAATT AAGTCTTCGA CAAAACGTGG ATTTTCATAT 18 60 

GCACGCTCTG TCACACGTTT TTCATCAGGA CGTTTTAAAA TAGGGTffiAG AATTGAACTT 1920 

GCATTAGCTT CCATTGCATC TAAAATTTTA TTTTTATAGT CATCAACTAT GTCTTGATCT 198 0 

TTATTAATAT ATGTTTTAAC AGTGACAACA CCACGTTGGT TGTGCGCTGA ATACTCACTT 2 04 0 

ATTTCTTTTG AACAAGGGCA TAGCGTTGTG ACAGTTGCTT CAATAGTAAG TTCTTTACGT 2100 

GTAnCTTTAT CACCGTCAAT TGCTAATCCA TAAGTGACAT CGGCATTACC AACTGCTTTA 2160 

ATATTTGTGG TTGGACTATA GCGATCAAAG AACCATTTCC C AG AAAC AT C AACGCCTGCC 2220 

GCATTTTGTT TCATATTCGT TTGTAAAGTG CGTAACACCT GATAAAGTGT ATTAAATTCA 2 28 0 

AGTTCAATAC CATTATCATA GTGCTTTTCA ACACTTTCGA TTATACGGCT CATATTAATA 2 34 0 

CCTTTTTCGT CTTTTGTTAA ACTTGTTGAA AAACTAAATG TGCCAGCTGT TTGATACTGG 2 4 00 

TCAACAAGTA CAGGGTACAC TAAGTTTTTA ATACCAACTT CTTCTATTTC AAATAAAAAA 2 4 60 

TCTTTATGTG TACTTTGTAA ATCTGTCATT TCGTTCTTAG TAGTAGGTTT CGTGCTTCA 2520 

ATAGGATCTA CGGAACCAAA GTGTTTCCAA CGACCTTCTC GTGTCGATAA ATCAAATTCA 2 58 0 

GTCATTTTTT TCCTCCGTTA AG AT T T AAAG TGATATGTCC AATATGGTTC GACTGTTAAA 2 64 0 

AAGCTGTGTT GTTTACCATC GATTTCAGGA CTTGCTAATT GTTTTAAAAA TGGACCTGTT 2700 

TGAGAAGCAT GTGCTTCAAA TGCCTTAATT TTAAGTTCTT TAAAATCTGT AATATCATTT 2 7 60 

TGAATATCAG GTTCTCCAAG AGCTTCGGTT GCATCATTAC TGAACGCAAC TAAAGTTAAA 2820 

CGAGGGCGTT CTTCTTTAGG CATGCGTTCA ACCGTTCGAA TTACAGCGTC TGCTGTTGCT 2 88 0 

TCGTGATCAG GATGTACTGC ATATCCAGGA TAAAATSAAA TAATCAATGA TGGATTTGTA 2 94 0 

TCATCGATTA AAGATTTAAT CATACCATCT ATATGTTCAT AGGGTTCAAA TTCGACAGTT 3 00 0 

TTGTCACGTA AACCCATTTT TCTTAAATCA GTAATACCGA TAACTTTACA AGCTTCTTCT 3060 

AGTTCACGCT CACGAATACT TGGTAATGAT TCGCGTGTTG CAAATGGGGG ATTACCTAAA 312 0 

TTTCTGCCCA TTTGTCCTAG GGTTAAACAT GCATATGTTA CAGGTATGCC TTTTTGGATA 318 0 

TAATTTGCTA ATGTGCCTGC AG AT GAG AAG GTTTCATCAT CAGGATGTGG AAATATTACT 32 4 0 

AATACATGTC TTTCGTCAGT CATGTTGATG CCTCCTCTAT AAAT TAAATG GTCGCTCACT 3300 

AATTTGAAGT GCTGCAGCGA GTTGACCTTC GTAATTAAAA CCTGCAATTA AAAAT T CAT C 33 60 

ATGCTCATTG AC C T C AAAAT GCGTTAGACC TTGTACATAA ACCCAACCAC CATTTGATAG 34 2 0 

TTTAAGACCA AT GCGATAAG GTTCTTTATT ACCACCTTTT AGTTGTGCAT GCGTATATGT 34 8 0 
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TATTTGTATG TTTCTTAAAA AAGTACCAGC ATTAAAAACA CGTGATCGA AATGGTTCGC 354 0 
ATAGGCCCCA TTTGTCGTTT CAACATGCAG ATACACAGGT TTATGTTCAA AAGAAGCAAG 3600 
TAAATCTATA ACTTCTTGTT CTTTAATTGG TTCCAACACG TTCACTCCTT ACACTATCAA 3660 
TGTGTTTATC TTTCTATTTT ACTAAAAACT ATTCGATAAT TGTATACGAT TGCTCAATTA 3720 
TTTATAAATT AATTTTCATG AAGGGTAATT ACTCAGGATT ACGTAATCAT ACAGCATTAG 37 8 0 

TTTTTTACTT TTAAAAATCA AAAATTTGTT GGAATTTGAA AAGTGTTAAA CAT T AAAAAT 384 0 

GATGCTATAT TAATGGTGTA TGAATGAATT CATAAGTTTT TAAAATGTAT TAAATTTGTG 3 90 0 

GAGGCATGTA AAC AAT G AAA GTATTAAACT TAGGATCGAA AAAACAAGCA TCATTCTATG 3960 
TTGCATGTGA G T TAT AT AAA GAGATGGCAT TTAATCAGCA CTGTAAACTA GGTTTAGCAA 4 020 

CTGGTGGTAC AAT G AC AG AT TTGTATGAAC AACTTGTTAA GTTGTTAAAT AAAAAT C AG T . 4 08 0 

T AAAC G TAG A CAATGTATCC ACGTTTAATT TAGACGAATA TGTAGGTTTAACCGCATCAC 4140 
ATCCGCAAAG TTATCACTAT TATATGGATG ACATGCTTTT CAAACAATAT CCTTATTTTA 4 200 

ATAGAAAGAA CATTCATATT CCAAATGGAG ATGCCGATGA TATGAATGCG GAAGCGTgCA 4 2 60 

AAATATAATG ACGTTTTAGA ACAACAAGGT CAACGTGATA TTCAAATTTT AGGTATTGGT 4 32 0 

GAAAATGGTC ATATTGGATT TAATGAACCT GGTACGCCGT TTGATAGCGT TACTCATATC 4 380 

GTTGATTTGA CTGAAaGTAC TATTAAGGCT AATAGTCGAT ATTTTAAAAA CGAaGATGAT 4440 
GTTCCAAAGC AAGCCATTTC GATGGGACTT GCTAATATTC TTCAAGCCAA ACGTATCATT 4 500 

TTACTCGCAT TTGGTGAAAA GAAACGTGCT GCTATTACAC ATTTATTAAA TCAGGAAATT 4 5 60 
TCTGTTGATG TTCCAGCCAC ATTACTTCAC AAACACCCGA ATGTTGAGAT ATATTTAGAC 4 620 

GACGAAGCTT GCCCGAAAAA TGTTGCGAAA ATTCATGTCG ATGAAATGGA TTGATTGCAA 4 680 

TGTTTAATTA AGAAATGCCT CGGGAAAGGT TCCAATAGAA AGATAAAAAG CATTGGA0G 47 40 
ATGATTTTTA GTGGAATTAC AATTAGCAAT TGATTTATTA AACAAAGAAG ACGCGGCTGA 4 8 00 

GTTAGCAAAT AAAGTAAAAG ATTATGTAGA TATCGTAGAA ATCGGTACGC CAATCATTTA 4 8 60 

CAACGaAGGT TTACCAGCAG T T AAAC AT AT GGCAGACAAC ATTAGTAATG TAAAAGTATT 4 920 

AGCAGACATG AAAATTATGG ATGCAGCTGA TTATGAAGTT AGCCAAGCAA TTAAATTTGG 4 980 
CGCGGATGTA AT T AC AAT AC TAGGTGTTGC AGAAGATGCA TCAATTAAAG CAGCTATTGA 5 04 0 

AGAAGCTCAT AAAAAT AAT A AACAATTACT AGTTGATATG ATTGCTGTTC AAGATTTAGA 5100 

AAAACGTGCA AAAG AAC TAG ATGAAATGGG T GC T GAT TAT AT T GC AG T AC ACACTGGTTA 5160 

TGATTTACAA GCAGAAGGGC AATCACCATT AGAAAGTTTA AGAACCGTTA AATCTGTTAT 5220 

T AAAAAT TCT AAAGTTGCAG TAGCAGGTGG AAT T AAAC C A GATACAATTA AAGATATTGT 5280 
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CGCTGAAAGT CCTGATCTTG TTATTGTTGG TGGCGGAATC GCAAATGCAG ATGATCCAGT 54 0 

AGAAGCTGCG AAACAATGTC GCGCTGCAAT CGAAGGTAAG TAATATGGCT AAATTTAGTG 54 00 

ACTATCAATT AATTCTAGAT GAATTAAAGA TGACTTTGTC ACATGTTGAA GCGGATGAGT 54 60 

TTTCAACTTT TGCATCCAAA ATACTACATG C T G AAC AT AT ATTTGTAGCT GGCAAAGGAC 5520 

GTTCAGGATT CGTGGCGAAT A5TTTTGCAA TGCGCTTAAA TCAGCTCGGC AAACAGGCAC ' 5580 

ATGTTGTTGG AGAATCAACG ACACCTGCGA TTAAGTCGAA TGATGTATTT GTAATTATCT 5 64 0 

CTGGTTCAGG TTCCACGGAA CATTTAAGAT TATTAGCAGA CAAAGCAAAA TCAGTAGGTG 57 00 

CTGACATCGT ATTAATTACT ACAAATAAAG ATTCTGCAAT AGGCAACCTA GCTGGGACGA 57 60 

ACATCGTTTT GCCTGCAGGT ACAAAATATG ATGAACAAGG CTCGGCACAA CCATTAGGAA 5820 

GTTTGTTTGA AC AAGCAT C T CAATTATTTT TAGATAGTGT TGTAATGGGA TTGATGACTG 5880 

AAATGAATGT T AC GGAACAA AC GAT GC AAC AAAATCATGC TAATTTAGAA TAAAATAAAG 5 94 0 

ATAGTCGATA ATATGATGCC TAGGCAGAAA TATTATCGAT TATTTTTTTA TTTAAATAAT 6000 

AAATTATAGT ATAAT AT CAA TAATAAACGA ATAGGGGTGT TAATATTGAA GTTTGACAAT 60 60 

TATATTTTTG ATTTTGATGG TACGTTGGCA GACACGAAAA AATGTGGTGA AGTAGCAACA 6120 

CAAAGTGCAT TTAAAGCATG TGGCTTAACG G AAC CAT CAT CTAAAGAAAT AACGCATTAT 618 0 

ATGGGAATAC CTATTGAAGA ATCATTTTTA AAATTAGCAG ACCGACCATT AGAT GAAGCA 62 4 0 

GCATTAGCAA AGTTAATCGA TACATTTAGA CATACATATC AATCTATTGA AAAGGACTAT 6300 

ATTTATGAAT TTGCGGGTAT AACTGAAGCC ATTACAAGTT TGTATAACCA AGGSAAAAA 6360 

CTTTTCGTGG TGTCTAGTAA G AAGAG T GAT GTATTAGAAA GAAATTTATC GGCTATTGGA 6420 

TTAAATCACT TGATTACCGA AGCTGTTGGA TCCGATCAAG TAAGTGCATA TAAACCAAAT 64 8 0 

CCTGAAGGCA TACACACAAT TGTGCAACGC TACAATTTAA ATAGCCAACA AACGGTGTAT 65 4 0 

ATTGGTGATT CAACGTTTGA TGTTGAGATG GCACAACGTG CTGGTATGCA ATCTGCAGCT 6600 

GTCACTTGGG GTGCACATGA TGCAAGGTCA TTACTTCATT CAAATCCGGA TTTTATTATT 6660 

AATGATCCAT CAGAAATTAA TACCGTATTA TAAAACTTGT TAAAACAGAG AATACCATGG 6720 

TTAAAATGCA TATTCATAAA TATTAGATTA TACTTAjAAA TATTTCGCTT TAGATTAGGA 67 80 

ATTTAAAATA AATATTTATT AAACATTATG AATTTTTAAA GAGTAATGTC TGACTCGTTG 68 4 0 

ATAATTTATT TTTGTAAAAA TAAATTAAAG TAATGACAAA GTTATTGAAG TAAATTGAGT 6900 

ATAAACATTT AAATACGATG TCGAAAATGG CGATAGCATA TCACTTACAT GAAGTTGTGT 6960 

GctATCGCTA TTTTTAGTTA TAATTCCAAA AAGTTAATCG TTCGATGATT TAAGAATTAT 7020 

TATTGTTTAA TTCAAATGTA TGAGGGTATA AAATCATTGA ATTTAATTCG ATAAAGCGAA 7 08 0 

ATTTTTGAAC AAACATACTT TTGTATTTAT ATAAAAGTTT AAATTCTTAT AAATTTGACA 714 0 
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AAACTAATTA ACTCCGTATA ATTATGAAAC ATACAAGAGG GAGTGTATGA ATTCATGGAT 72 00 

TTTAATAAAG AGAATATTAA CATGGTGGAT GCAAAGAAAG CTAAAAAAAC CGTTGTTGCA 7 2 60 

ACCGGTATCG GTAATGCAAT GGAATGGTTC GATTTTGGTG TCTATGCATA TAcAACTGCG 7 32 0 

T AC AT TGGAG CGAACTTCTT CTCTCCAGTA GAGAATGCAG ACfflTCGACA AATGTTGACT 7 38 0 

TTCGCAGCAT TAGCCATTGC GTTTTTATTA AGACCAATTG GTGGTGTCGT ATTTGGTATT 7 44 0 

ATTGGTGACA AATATGGACG TAAAGTTGTA TTAACATCTA CAATTATTTT AATGGCATTT 7 500 

TCAACATTAA CCATTGGATT ATTGCCAAGC TAT GAT C AAA TTGGACTTTG GGCACCAATA 7 560 

CTATTATTGC TTGCAAGAGT AC T ACAAGGG TTTTCAACAG GTGGAGAGTA TGCGGGGGCA 7 62 0 

ATGACATATG TTGCCGAATC ATCTCCAGAT AAGCGTCGTA ACTCATTAGG TAGTGGACTA 7 68 0 

GAAATTGGGA CATTATCAGG TTACATAGCT GCTTCAATTA TGATTGCTGT ATTAACATTC 7 74 0 

TTTTTAACAG ATGAACAAAT GGCATCATTT GGTTGGAGAA TCCCATTCTT ACTCGGTTTA 7 8 00 

TTCCTAGGAT TATTCGGCTT ATATTTACGT CGTAAGCTGG AAGAATCACC AGTTTTCGAA 7 8 60 

AATGATGTTG CAACACAACC AGAAAGAGAT AACATTAACT TTTTACAAAT CATCAGATTT 7 920 

TATTACAAAG ATATATTTGT ATGTTTTGTA GCTGTTGTAT TCTTCaATGTTACAAACTAT 7 980 

ATGGTAACTG CATATTTACC AACCTATTTA GAACAAGTTA TTAAATTAGA TGCAACGACA 8 04 0 

ACAAGTGTAT TAATTACTTG TGTCATGGCA ATAATGATTC CATTAGCATT AATGTTTGGT 8100 

AAGTTAGCGG ATAAAATAGG TGAAAAGAAA GTATTTCTAA TTGGTACTGG TGGGCTAACA 8160 

TTATTCAGTA TCATCGCATT TATGTTATTA CAT T C AC AAT CATTTGTTGT AATAGTAATC 8220 

GGTATATTTA TATTAGGATT TTTCTTATCA ACTTACGAAG CGACAATGCC AGGGTCGTTA 828 0 

CCAACGATGT TTTACAGTCA TATAAGATAT CGAACTTTAT CAGTAACATT TAATATCTCT 8 34 0 

GTTTCGATAT TTGGTGGTaC GaCGCCATTA GTkGCAmCaT GGTTaGTTAC GAAAACTGGA 8 4 00 

GATCCATTAG CmCCTGCGTA TTATTTAACA GCAATCAGTG TTATTGGCTT TTTAGTTATT 8 4 60 

ACATTCTTAC ATTTAAGTAC AG C AG G AAAA TCTCTAAAAG GTTCGTATCC AAATGTAGAT 8 520 

AACGAGCAAG ATAGAGCTTA TTATGCAGAA CATCCAAAAG AAGCATTATG GTGGGTTffiA 8 58 0 

GAACGTAAGA ATTAGAGATT TTAATaAAAA GTATAAATCA ATCGTATATA AGCACTTTAA 8 64 0 

AGCTAGTAGG TTCTGCTAAC TTTAAAGTGC TTTTTAAATT GAGAACTGTA ATTAGCCGTA 87 00 

ATAAAGTTTT TGTATATACA TAAACCCCCA CTGCAATGAT TATCGCAATG GGGGAAAGAG 87 60 

GGGACTTAAA GCATATGTTT AGCTTTGAAT ACTTAAAATT CTCTTGCTAT TGAAATGTTA 8 8 20 

GGATGTAAAT ATGTCTTAGA GTATTTTGTC CAACGCAATT AAT AT T GAGA CTCTAACCTT 8 8 80 

CAATATTATT ATAGAGAACA C AAAC T T AAA TAGATTGGGT GACTTATTTG TGTCAGTTAT 8 94 0 
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TGCGATTGCG ATAACTTCTT TTCTCTATAT ACATATAGTAACGTCTTATC TAATAAAAAA 9000 

CATGGTACTA CAGTATCAAA TTTATCTAGG GCTTAAGTTT GATTTTTATA ATAGGCAGGT 90 60 

TTACCTGATA AAAATACTTA TTCATTATAT AATGTTAACA ATATGTATTT TAAAGTTTAC 9120 

ATTGAGTGAG GGATATTGAT GAACGTAATT TTAGAACAGT T G AAAAC AC A TACT C AAAAT B6 0 

AAACCTAATG ACATAGCATT ACATATCGAT GATGAAACAA TTACATATAG TCAACTAAAT 92 4 0 

GCCCGCATCA CTAGCGCAgT TGAATCTTTG CAGAAATATT CACTTAACCC TGTCGTTGCT 9300 

ATTAATATGA AATCACCGGT GCAAAGTATT ATTTGTTATT TAGCTTTGCA TCGTTTACAT 9360 

AAAGTGCCTA TGATGATGGA S3GTAAATGG CAAAGTACTA TACATCGTCA ATT GAT T G AA 94 2 0 

AAATATGGTA TTAAAGATGT AATTGGAGAT ACAGGTCTCA TGCAGAATAT AGACTCACCG 94 8 0 

ATGTTTATTG ATTCAACGCA ATTACAGCAC TACCCCAATT TATTACATAT TGGTTTTACT 95 4 0 

TCAGGGACAA CTGGACTGCC AAAAGCATAT TATCGTGATG AAGATTfiTG GTTGGCTTCT 9600 

TTTGAAGTTA ATGAAATGTT GATGTTAAAA AATGAAAATG CAATAGCAGC CCCTGGACCA 9 660 

CTATCGCACT CGTTAACATT ATATGCGTTA TTGTTTGCTT TAAGTTCCGG TCGTACTTTT 9720 

ATAGGACAGA CCACTTTTCA TCCTGAAAAG TTACTTAATC AATGTCATAA AATATCATCA 97 80 

TACAAAGTTG CTATGTTTCT TGTTCCAACG ATGATTAAAT CATTATTGTT AGTTTACAAC 98 4 0 

AAT GAAC AT A CAATCCAATC ATTTTTTAGC AG T G GAG AT A AGCTGCATTC TTCTATTTTT 9900 

AAAAAGATAA AAAATCAAGC AAATGACATA AATTTGATTG AATTTTTTGG TACATCGGAA 9960 

ACCAGTTTTA TCAGCTATAA CTTGAATCA5 CAAGCACCAG TTGAATCAGT AGGTGTGCTA 10020 

TTTCCAAATG TGGAATTGAA AACAACGAAT CACGATCACA ATGGTATAGG AACTATTTGT 10080 

ATAAAAAGTA ATATGATGTT TAGTGGCTAT GTAAGTGAAC AATGTATAAA T AAT GAT G AA 1014 0 

TGGTTTGTTA CTAATGATAA TGGCTATGTA AAAGAGCAGT ATTTATATTT AACGGACGT 10200 

CAACAGGATA TGTTAATTAT TGGTGGTCAA AATATATATC CAGCACATGT TGAACGCCTT 10260 

TTAACGCAAT CTTCGAGCAT TGATGAAGCA ATTATCATCG GTATTCCAAA TGAGCGTTTT 10320 

GGTCAAATAG GCGTATTGCT TTATTCTGGT GAT G T G AC AC T T AC AC AT AA AAATGTAAAA 10380 

CAATTTTTAA AAAAGAAAGT GAAaCgnTaT GAAATTCCAT C GAT GAT T C A T CAT G T AG AA 104 4 0 

AAGATGTATT ACACTGCAAG tGGTaAAATT GCTAGAGAAA AAATGATGTC GATGTATTTG 10500 

AGAGGTGAAT TATAATATGA AT C AAGC AG T CATAGTTGCA GCTAAACGAA CTGCATTTGG 10560 

GAAATATGGT GGCACTTTAA AACATTTAGA GCCaGACAA TTGCTTAAAC CTTTATTCCA 10 620 

AC ATT TT AAA GAGAAGTATC CAGAGGTAAT AT C T AAAAT A GAT GAT G TAG TTTTAGGTAA 10 680 

TGTTGTTGGG AATGGTGGCA ATATTGCAAG AAAAGCATTG CTTGAAGCGG GGCTTAAAGA 10740 

TTCAATACCT GGCGTCACAA TCGATCGGCA ATGTGGGTCT GGACTTGAAA GTGTTCAATA 10800 
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TGCATGTCGC ATGATCCAAG CCGGAGCTGG CAAGGTATAT ATTGCAGGTG GTGTTGAAAG 108 60 

TACAAGTCGA GCACCTTGGA AAATCAAACG ACCGCATTCT GTGTACGAAA CAGCATTACC 10920 

TGAGTTTTAT GAGCGTGCAT CATTTGCACC TGAAATGAGC GACCCATCAA TGATTCAAGG 10980 

TGCTGAAAAT GTGGCCAA3A TGTATGATGT TTCAAGAGAA TTACAAGATG AATTTGCTTA 1104 0 

TCGAAGTCAT CAATTGACAG CGGAAAATGT AAAGAATGGA AATATTTCTC AGGAAATATT 11100 

ACCTATAACC GTTAAAGGAG AAATATTCAA CACTGATGAA AGTCTAAAAT CACATATTCC 11160 

GAAAGATAAC TTTGGCCGAT TTAAGCCCGT GATCAAAGGT GGfiCCGTTA CCGCTGCGAA 1122 0 

TAGTTGTATG AAAAATGATG GTGCAGTTTT ATTGCTTATT ATGGAAAAAG ATATGGCATA 11280 

CGAATTAGGT TTCGAGCATG GTTTATTATT TAAAGATGGT GTTACGGTAG GTGTTGATTC 11340 

TAATTTTCCT GGCATTGGTC CAGTACCAGC CATTTCCAAC TTACTAAAAA GAAATCAATT 114 00 

AACGATAGAA AATATTGAAG T CAT T G AAAT TAAC GAAGCG TTCAGTGCAC AGGTAGTTGC 114 60 

CTGCCAACAA GCTTTAAATA T T T C AAAT AC GCAATTAAAT ATATGGGGTG GTGCATTAGC 11520 

ATCAGGTCAT CCATACGGTG CAAGCGGTGC CCAATTAGTG ACTCGATTAT TTTATATGTT 11580 

T G AC AAAG AG ACTATGATTG CATCTATGGG GATAGGGGGA GGTCTAGGAA ATGCAGCATT 11640 

ATTTACTCGA TTCTAACCAG CGATTAAATG TGTCATTTTC TAAGGATAGT GTGGCTGCAT 11700 

ATTATCAGTG TTTTAACCAA CCTTATAGAA AAGAAGTACC ACCATTAATG TGTGCGTCAT 11760 

TATGGCCAAA ATTTGATTTA TTTAAAAAAT ATGCAAATAG CGAACTGATTTTAACAAAAT 11820 

CAGCAATTAA TCAAACTCAA AAGATAGAAG TAGACACAAT ATATGTAGGG CATTTAGAAG 11880 

ATATTGAATG CCGACAGACT CGCAATATCA CACGTTATAC AATGGCTTTA ACATTAACTA 11940 

AAAATGATCA ACATGTCATA ACGGTtACAC AAACTTTTAT TAAGGCGATG AAGTAGAGAT 12000 

GGAGTTTAAT GAGATATGGA TAAATGAATA TTTGGCGCTC GTAAATGATG AT AAT CCAAT 12060 

ACATAATGAG ATTGTGCCAG GACAATTAGT GAG T C AAAT G ATGCTGATGG CTATGTCATT 12120 

AGAGACAAAC CAGTGTCAAa TTAACTACGT TAAACCTATT TTAATAAATG AAAATATCGA 12180 

ATTCATTGAA CAACACGAAC ACGAAAT TAT ACE AAT T AAT GACGATGGAG AG AT T AAAAT 1224 0 

AAAAATTTCT TTGAGCACAA AAAAATAACC GATATTAGCT GCATGAACGC ATATTAATTA 12300 

GGAGATGAAA GGACAGCTAA TATCAGTTAT GTATTGTTAT TATTATTGGG AACAGAGATG 12360 

AATATAGGTT ACGTTTCTTT CTTTGCACGG GGATGCATTA AT C T AAAAT A ATAATAASA. 12420 

CTATATCAAT GTTTAATAAA TTCTGGATTA TTGGAACGAT TAGTCAATTT AACTAACTTT 124 80 

CATATGATCT ATATCGTCTT GTAATAAAGA GAGCAATTTG AATATTTCAG TATCACTAAA 1254 0 

TGAATCGTCA CAT T T AAT T G AAACATGCTG AAACGTTTTG GTTATAATTT CAT AAAC TGG 12600 
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TGCGCCTTCA TGGTGATACT GTCGATAAAT AATCATAACC TATATTACCT CCTTTGCTAC 12 660 

TCTATGGTTA TATTATAAAT AACATTTTTA TGTGTGACAT CAACCTTAAG TATCAACTTT 12 72 0 

T TAT C AG AC A TAGAACGTAT GATTTACTAA GACTATTTAT GTATAAAAGT TCTAAATAAA 1278 0 

TATATATTTA TAGAGTCGCC TGGCAGTCAT TTGGGaAATATAACATATAT GAT TAG AG AG 128 4 0 
GCATCTATCG CAAAAGAATG ATAATGATAG AGGTATTGAG CATATAGATG AGTTTAAGTT 12900 
CATCTTGAAA ATAAAGGGTT ATTTAGTCAT AGATGTAGAT GTATAGGAAA TATTTGTATG 12 960 

TATTGTTCGA TATGTATGAA ATTTTCAATA AAAGCTAATA ACGCTTATAT GTAACTTTCA 1032 0 

AATTTAAATT ATATACAGAG CATGATGATT ATAAAAAAAT AACCACATCA CATAAATTGA 1308 0 

GTTCATACCC AATTTAAGTG GTGTGGCTAA TAATGTTGAT T TAT AG AT G A ACCGCCTAAT 1314 0 

CGTTAAACCT CTGTTACTTC AACATCGATA TGTTCAATAC GGTTGTATGC ACCGTGATCC 13200 

ACAGGACCAA CAAAATCAT T CATTTTCCAA CCGTTTTTAA TAGCAGAAGC GACGAAAGCT 132 60 
TTCGCGCTAA TCACAGCTTC TTTCGGTGAC TTACCGTTAG CTAAATATGC AGTTGTTGCC 13320 
GCAGCAAATG TACAACCAGC ACCATGGTTA TAACTTTGTT GGAACATGTC TGTTGTTAGT 13380 

TGATAAAATG TTTGACCATC ATAGTATAAG TCATACGATT TATCTTETC TAAAGCTTTG 13440 
CCACCTTTAA T GAT G AC AT G CTGTGCGCCT T T AT CAAAGA TAATTGTTGC AGCCTTTTTC 13500 
ATATCTTCAA TTGAATTTAA TTTACCTAAT CCTGATAATT GACCCGCTTC AAATAAGTTT 13560 
GGTGTCACTA CCGTTGCTTT AGGTAGTAAA TATTTAATCA TCGCCTCAGT ATTTCCAGGA 13 620 

TTAAGCACTT CATCTTCGCC TTTACAAACC ATGACAGGAT CTACTACAAA ATATTGTGCA 13 680 

TTAGATGCCT CATATACTTC TCCAGCACGT TTGATTATCT CCTCAGTACC TAACATACCT 13740 
GTTTTAATAG CATCAGGTCC GAT T GAT AAA GCCGTTTCAA GTTGTTTTTC AAATACATCC 13800 

ATTGGTAATG GTGTAACATC GTGTGACCA 1 GTATCTTTAT CCATAGTAAC GATGGCAGTT 138 60 
AAAGCGACCA TGCCATACGT ATCTAATTCT TGGAACGTTT TCAAATCTGC TTGCATACcT 13920 
GCGCCAGcAC TTGTGTCAGA ACCGGCAATT GTTAAAACTT TCTTTAAAGC CATTGAGCTT 13980 

CACTCCTACA TAATAATATT GTATTCATCA TATCATTTTT AACCTAATTG AAAATATTA 14040 
AGCATTCAAT AT TT GAT GAT TGTTGAAATG AATCATTCAT ACTATTGTAA CTTTTGAAAA 14100 
TGTCATTCAC TTTAGATAAG TGTGATATGT TAAAATATGT CCTGAGGTGA GATTGAATGG 14160 
AATGGTCGCA AATTTTTCAT GACATAACAA CGAAACATGA CTTTAAAGCT ATGCATGATT 14 220 

TTTTAGAAAA AGAATATTCG ACTGCAATCG TATACCCTGA TAG GG AAAAT ATATATCAAG 14 280 
CGTTTGATTT AACACCGTTT G AAAAT AT C A AAGTTGTTAT ATTAGGACAA GACCCGTATC 14 340 

ATGGTCCAAA CCAAGCACAT GG AT T AGCAT TTTCAGTGCA ACC TAACGCA AAATTCCCTC 14 4 00 

CATCTTTACG TAATATGTAT AAAG AAT TAG CAGATGLTAT TGGATGCGTT AGACAAACAC 144 60 
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CGCATTTACA AGATTGGGCA AGAGAAGGCG TCTTGTTATT GAATACAGTT TTAACCGTAA 14 520 

GACAGGGTGA AGCAAATTCT CATCGTGATA TTGGTTGGGA AACATTTACT GATGAAATTA 14 580 

TTAAAGCAGT GTCTGATTAT AAAGAACATG TTGTCTTTAT TTTGTGGGGG AAACCTGCAC 14 640 

AGCAAAAAAT AAAGCTTATC GATACATCTA AACATTGTAT TATAAAATCA GTGCATCCTA 14 700 

GTCCACTGTC TGCATATAGA GGATTCTTTG GATCAAAACC GTATTCCAAA GCGAATGCCT 14 7 60 

ATTTAGAGTC AGTAGGAAAA TCACCAATTA ATTGGTGTGA AAGTGAGGCG TAGATGTTGA 14 820 

ATAGAGAAAC TTTAATAGCA CGAATTGAGC AAGAATTAGT ACAAGCAGAG CAGGCACAGC 14 88 0 

ATGACCATGA CTTTGAAAAA CATATGTATG CCATACATAT ATTAACATCT TTATATGCTT 14 940 

C AAC AT C AAA TACACCACAT ATTGGTGAAC AACAAATGAA TCGTCGTATT GCTAACCATA 15000 

ATCAAATGCC AC AAT CACAA ATAACGCAGC C AAC T CAT C A AGEACAGTT GCTGAAATTG 15060 

AAGCGATGGG TGGTAAAGTA AATACGCATT CAGCACATCA TCATAATAAG TCATATTCAC 1512 0 

AACCTTCAAA CCAACAACAA AGATTAGCGA C AG AT GAT G A CATTGGCAAT GGTGAATCCA 15180 

TATTTGATTT TTAAAAAGCA ACAATGAAAC ATAATTACTT AATAGCTTGT TAAGTATGTA 15240 

GGTTAATAAT CAAGACGCAT ATACTTTTAT TCGAGTGTTC GGATTTAAAC ATTTATTAAT 1530 0 

ACTGAATTAT ATAAGGAGAG GTAGCAATGA AAT T ATT TAT TATTTTAGGT GCATTAAACG 15360 

CGATGATGGC TGTCGGTACA GGTGCATTTG GTGCGCATGG TTTACAAGGA AAAATAAGTG 1542 0 

ATCACTATTT ATCAGTATGG GAAAffiGCAA C G AC G TAT C A AATGTACCAT GGCT TAGCAT 154 8 0 

TATTAATTAT AGGTGTAATT AGTGGTACAA CTTCAATCAA TGTTAACTGG GCTGGCTGGT 1554 0 

TAATATTTGC TGGTATTATT TTCTTTAGTG GATCATTATA TATTTTAGTA TTAACTCAAA 15 600 

TTAAAGTTTT AGGTGCGATT ACGCCAATTG GTGGCGTATT GTTCATCATTGGATGGATAA 15660 

TGTTAATCAT TGCGACATTC AAATTTGCTG GTTAAATTTT AAAAC T T TAG ATTACCTATG 15720 

TAACTAAACA TTAAATTTTT AATAAAAATA ATCAAGAAAA AG AG T T AC AA ACTCATCTTT 157 80 

TGGGTATAGA ATACCTTCGA GGTGAGTTTT TATTTATGGA AAAAAAGAAT AAGCAAATAG 1584 0 

AT AGAGGC GA T T T AAAAC AA AACCTATCTG AAAAGTTTGT ATGGGCGATT GCATATGGTT 15 900 

CATGTATCGG ATGGGGCGCA TTCATCTTAC CAGGAGACTG GAT T AAGCAG TCAGGTCCGA 15 960 

TTGCAGCATC AATTGGTATA GTTATTGGTG CAT TAT T AAT GATATTAATT GCGGTTAGTT 1602 0 

ATGGCGCATT AG T AG AG AG A TTTCCAGTAT CiSGGGGCGC GTTTGCCTTT AGTTTCTTAA 16080 

GTTTCGGCAG AT AT G T GAG T TTCTTCTCAT CATGGTTTTT AACTTTTGGT TATGTCTGTG 1614 0 

TCGTTGCTTT AAAtGCGACC GCATTCAGTT TACTAGTTAA ATTCTTATTG CCAGATGTCT 16200 

TAAATAATGG GAAAC TAT AC ACCATTGCGG GCTGGGACGT TTATATTACG GAAATCAHA 162 60 



414 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



TTGCGACCGT ATTACTACTT GTATTCATGC TAGTAACGAT TCGTGGCGCA AGTGTATCTG 16320 

GATCATTACA ATATTATTTC TGTGTGGCGA TGGTAATCGT CGTATTATTG ATGTTCTTTG 1638 0 

GTTCATTCTT TGGTAATAAT TTTGCACTTG AAAAT T T AC A ACCGTTAGCT GAACCTAGCA 16440 

AAGGATGGTT AGTGTCTATT GTGGTTATTG TATCCGTGGC ACCATGGGCA TATGTTGGAT 16500 

TTGATAATAT TCCACAAACA GCAGAAGAGT TTAACTTTGC ACCAAACAAG ACATTTAAGC 165 60 

TTATCGTGTA CAGTTTATTA GCAGCATCAT TAACTTATGT TGTCATGATT TTATACACTG 16620 

GTTGGTTATC AACAAGTCAT CAAAGTTTAA ATGGGCAGTT GTGGTTAACA GGTGCTGtTA 16680 

CACAAACAGC ATTTGGTTAT ATTGGATTAG GTGTATTAGC AATTGCAATT ATGATGGGTA 16740 

} 

TATTTACTGG TTTAAATGGA TTCTTGATGA GTTCAAGTCG CTTGTTATTT TCTATGGGAC 16800 

GTTCAGGTAT TATGCCAACA ATGTTTAGTA AATTACATAG TAAATACAAA ACACCATATG 135S0 

TCGCAATCAT ATTCCTAGTA GGAGTGTCGT TAATTGCACC TTGGCTAGGA AGAACTGCAT 16920 

TGACTTGGAT TGTAGATATG TCATCTACTG GTGTATCCAT TGCCTACTTT ATTACATGTT 16980 

TGTCTGCAGC GAAATTATTC AGTTATAACA AACAAAGTAA TACGTATGCA CCGGTTTACA 1704 0 

AAACGTTTGC TATTATCGGC TCATTTGTAT CATTCATTTT CTTAGCGTTG TTATTAGTGC 1710 0 

CAGGTTCTCC TGCAGCACTG ACTGCACCGT CTTATATTGC ATTACTTGGA TGGTTAATCA 17160 

TCGGTTTAAT ATTCTTTGTG ATTCGATATC CTAAATTGAA AAAT AT GGAT AATGATGAAT 17220 

TAAGTCGCTT GATTTTAAAT AG AAG T G AAA ATGAAGTTGA TGATATBTT GAAGAACCTG 17280 

AAAAAGAAAA AACTAAATAA TAAAAGAATC GCACAATAAA CCTTCTTCAT TCGGAGGCGT 17 34 0 

ATCGTGCGAT TTTTTGTATT ATAAATTGAC ATT T AAG AC G AGGCAGCTGA ACCTTATATA 17400 

TAATTGCTAA GAGTTAGGGC TGAGCCATTT CTAACAAATA TTTATAATCG TTTAAAAGAT 174 60 

TTCACGAACC CAGAAACAAT TAATTTGGAA ATTTGGTCGG CGAATAATAA ACCTAATGCG 17520 

ATGGCGCCTG CAATAAGTGT AACCTCTAGC ATGGTATTGA TTGCTGTACT G AAAT T T AAT 17580 

AAGACTAAAT TTTTTGTAGC ATCGTATGCT AAGCCACCAG GTAC TAATGG AAT GAT AC C C 17 640 

GTTACCATAA AAATGATGGC AGGTTCTTTT TGTTTACGAG C CAT AT AAT G ACTTAACAAG 17700 

CCTAATGCTA AACTACCAAA GAAACTAGAG TATATAGTGT GCAC AT T AAA GCCGTTGAAG 17760 

AATAAGGTGT AAACCATCCA TCCACACGTA CCAACGAAAC CACATGATAG ATATAATTTT 17820 

CTAGGTGCAT CAAAAAT G AC GCAGAA 178 4 6 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

ATTGACACTT GGTGAAAGTA ATATCGCCGC GCTATTTTGG CAAAATGGAC ACTTAGAACC 60 

TGAGTTACAA GATGAACAGC CAATTAATAT ATTAGGATCT GkTCAAATCA ACGAATGGAA 120 

TGGTAATCAA TCACCGCAAA TAATTATTCA AGATATTGCG AT GAATGAAC AGCAAATATT 18 0 

AG AT TAT AG A AGTAAGCGAA AAAGTTTACC TTTTACAGAA AAT GAT G AAA ATATTGTCGT 24 0 

GCTTATTCAT CCTAAAAGTG ATAAAGTAAA TGCGAATGAA TATTATTATG GTGAAGAAAT 300 

TAAACAACAA ACTGATAAAG TAGTATTAAG AGATTTACCA ACGTCAATGG AAGACTTGTC 360 

TAATTCCTTG CAACAACTGC AATTTTCTCA JCTTTATATA GTTTTGCAAC ATAATCATTC 420 

GATTTACTTC GATGGTATAC CTAATATGGA TATTTTTAAA AAGTGTTATA AAGCAT T AAT 4 80 

AACTAAACAA GAAACAAATA TCCAGAAAGA GGGTATGTTA TTGTGTCAAC ATTTAAGTGT 54 0 

GAAACCAGAT ACACTTAAAT TCATGTTGAA AGTTTTCTTA GACTTAAAAT TTGTAAfflCA 600 

AGAAGATGGT TTAATTCGAA, TCAATCAACA AC C T GAT AAA AG AT C GAT T G ATTCCAGCAA 660 

AGTATATCAA TTAAGACAAC AACGTATGGA TGTTGAAAAG CAATTATTAT ATCAAGATTT 720 

TTCAGAAATA AAAAATTGGA TAAAGTCACA ATTGTCGTGA GCAATTTAGG AGGAAATATT 780 

AATGGATTTA AAGCAATACG TATCAGAAGT TCAAGATTGG CCGAAACCAG GTGTTAGTTT 84 0 

CAAGGATATT ACTACAATTA TGGATAATGG TGAAGCATAT GGCTATGCAA CAGATAAAAT 900 

TGTAGAATAC GCAAAAGACA GAGATGTTGA TATCGTTGTA GGACCTGAAG CGCGTGGCTT 9 60 

TATCATTGGC TGTCCTGTAG CTTATTCAAT GGGGATTGG TTTGCACCTG T T AG AAAAG A 1020 

AGGGAAATTA CCTCGTGmAG TCATTCGTTA TGAGTATGAC C TAG AAT AT G GTACAAATGT 1080 

TTTAACAATG C AC AAA GAT G CAAT TAAACC AGGTCAACGT GTGTTAATTA C AG AT GAT T T 114 0 

ATTAGCTACT GGTGGTACGA TTGAAGCAGC AATAAAATTA GTTGAAAAAT TAGGCGGTAT 12 0 0 

CGTAGTAGGT ATTGCATTTA TAATTGAATT GAAATATTTA AATGGTATTG AAAAAATTAA 12 60 

AGATTACGAT GTTATGAGTT TAATCTCATA CGACGAATAA TAAATAATAT AATTTTATCA 1320 

AATGAAATCC TTCATCAAAT GTATAAGAAC CAATGACTTA ATTAAAAAAG TTGTTTAAGT 1380 

TTTCTTAACA TGAGATGTTA GGATTTTTTA TTTACTGAAA ATGTTAGATG ATTGAGCATT 14 4 0 

ATACCTTAAT AACATCGTTT ATTTATTTCA TAAATTGTAG TATCATAGAA C T AAT AT T T A 1500 

AAAAATGAAA CAGTAGATTT AGGTCGAATT TTTGTAAAAG TTTTAAAAGT AGGAATAGTA 1560 

TACAAATTAA ACTCGCTCAA GTAAAATTAA TATTACGATT AAT GAG AC A GGATAAATAT 1620 

TTATCGTCGA CGGACGTATG ATTGGTGTGG GACAAATACT ATTCAACAAG AGTACCTAAA 168 0 
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TCATTGTTTA AGGCGAAGTA AT AAAT AT G A ATGGGGTGTA TCATATAATG AACAACGAAT 17 4 0 

ATCCATATAG TGCAGACGAA tTCTTCACAA AGCAAAATCA TATTTGTCAG CAGATGAATA 18 00 

TGAGTATGTT TTAAAAAGCT ATCATATTGC TTATGAAGCA CATAAAGGTC ■ AGTTCCGAAA 1860 

AAACGGAT T A CC AT ACATTA TGCATCCTAT ACAAGTTGCA GGTATTTTAA CAGAAATGCG 192 0 

ATTAGACGGA CCGACGATTG TCGCAGGTTT TTTGCATGAT GTAATTGAAG ATACACCGTA 198 0 

TACATTTGAA GATGTAAAAG AAATGTTGAA TGAAGAAGTT GCTCGAATTG TTGATGGTGT 2 04 0 

GACGAAGCTT AAAAAAGTAA AATACCGCTC AAAAGAAGAA CAACAAGCTG AAAATCATCG 2100 

CAAGTTATTT ATTGCGATTG CCAAAGATGT ACGCGTAATT TTGGTGAAAT TAGCAGACAG 2160 

AT T AC AT AAT ATGCGTACCT TGAAAGCCAT GCCGCGCGAA AAACAAATTA GAATTCTCG 2220 

AGAAACATTA GAAATTTATG CACCATTAGC ACATCGTCTT GGTATTAATA C AAT C AAAT G 22 8 0 

GGAACTAGAA GATACGGCTC TTCGTTATAT TGATAATGTG CAATATTTTA G AAT AG T C AA 234 0 

TTTAATGAAG AAGAAACGTA GTGaACGTGA AGCGTATATC GAAACGGCTA TTGATAGAAT 2400 

ACGTACTGAA ATGGACCGAA TGAATATCGA AGGCGATATA AATGGTAGAC CTAAACATAT 24 60 

TTACAGTATT TATCGGAAAA TGATGAAGCA GAAAAAACAA TTTGATCAAA TTTTTGATTT 252 0 

GTTGGCGATA CGTGTTATTG TCAATTCTAT TAATGATTGT TATGCGATAC TTGGGTTGGT 258 0 

GCATACGTTA TGGAAACCGA TGCCAGGACG TTTTfflAGAT TATATTGCAA TGCCTAAACA 2 64 0 

AAATTTGTAT CAGTCATTGC ATACTACAGT AGTAGGCCCA AATGGAGACC CGCTCGAAAT 2700 

CCAAAT AC GA ACGTTTGATA TGCACGAAAT TGCTGAGCAT GGTGTTGCAG CACACTGGGC 27 6 0 

TTACAAAGAA GGTAAAAAAG TAAGTGAAAA AGATCAAACT TATCAAAATA AGTTAAATTG 28 20 

GTTAAAAGAA TTAGCTGAAG CGGATCATAC ATCGTCTGAC GCTCAAGAAT TTATGGAAAC 28 8 0 

CTTAAAATAT GACTTACAGA GTGACAAAGT ATACGCATTT ACCCCAGCGA GTGATGTTAT 2 94 0 

TGAGTTGCCA TATGGTGCTG TGCCGATTGA TTTTGCTTAT GCGATTCACA GTGAAGTAGG 3000 

TAATAAGATG ATTGGTGCCA AGGTGAATGG CAAAATTGTA C C AAT T G AC T ATATTTTACA 3 0 60 

AACAGGCGAT ATTGTTGAAA TACGTACTAG TAAACATTCA TATGGACCAA GTCGTGATTG 3120 

GTTGAAAATT GTTAAATCGT CTAGTGCCAA AGGTAAAATT AAAAGTTTCT TCAAAAAACA 318 0 

AGATCGTTCA TCTAATATTG AAAAAGGCCG AATGATGGTT GAGCTGAAA TAAAAGAGCA 32 4 0 

AGGATTTAGA GTCGAAGATA TTTTGACAGA G AAAAAT AT T CAGGTTGTTA ATGAAAAATA 3300 

TAACTTTGCA AATGAAGATG ATTTATTCGC AGCTGTAGGA TTTGGCGGCG TGACATCCTT 3360 

ACAGATTGTT AATAAATTAA C T GAAAG AC A ACGTATTTTA GATAAACAAC GTGCTTTAAA 3420 

TGAAGCACAA GAAGTTACGA AATCATTGCC TAT T AAAGAC AACATCATTA CTGATAGTGG 34 8 0 

TGTCTATGTA GAAGGTTTAG AAAATGTACT TATCAAGTTG TCAAAATGTT GTAATCCTAT 354 0 
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ACCaGGTGAT GATATTGTAG G T TAT AT C AC CAAAGGTCAC GGTATTAAAG TACATCGCAC 3 600 

TGATTGCCCA AATATTAAGA ACGmACTGA ACGACTAATT AATGTTGAAT GGGTAAAATC 3660 

AAAAG AC G C A AC T C AAAAAT ATCAGGTTGA TTTAGAGGTA AtGCGTATGA CCGAAATGGC 37 2 0 

TTGTTGAATG AAGTACTACA AGCTGTTAGC TCGACAGCCG GCAATTTAAT TAAAGTTTCA 37 8 0 

GGACGTTCAG ATATTGATAA AAATGCAATA ATAAATATTA GTGTCATGGTGAAAAACGTG 38 4 0 

AATGATGTTT ATCGTGTGGT AG AAAAG AT C AAACAACTTG GTGATGTTTA TACAGTAACA 3900 

AGAGTTTGGA ACTAGAGGTG CAAAATATGA AAGTAGTTGT AC AAAG AG T T AAAGAAGCAT 39 60 

CGGTGACGAA TGATACATTA AATAATCAAA TCAAAAAAGG ATATTGTTTA TTAGTCGGTA 4 02 0 

TCGGTCAGAA C T C T AC AG AG CAAGATGCAG ATGTAATTGC AAAGAAAATT GCTAATGCAA 4 080 

GATTATTTGA AG AT G AC AAT AATAAATTAA ACTTTAATAT CCAACAAATG AATGGTGAAA 414 0 

TACTATCAGT T T C AC AAT T T ACTCTCTATG CAGATGTAAA AAAAGGTAAC CGTCCAGGTT 4200 

TCTCAAATTC TAAAAATCCT GATCaAGCGG TLAAAATTTA TGAGTATTTT AATGcaTGCG 4 260 

CTACGAGCGT ATGGTCTTAC TGTGAAAACA GGTGAATTTG GAACACACAT GAAT G T TAG C 4 32 0 

ATAAATAATG ATGGTCCAGT CACTATTATT TAT G AAAG T C AGGACGGCAA AATTCAATGA 4 380 

AAAAAATAGA GGCATGGTTA TCTAAAAAGG GTCTTAAAAA TAAACGTACT CTAATAEAG 4440 

TGATTGCCTT TGTCTTATTT ATCATCTTTT TATTTTTATT GCTGAATAGC AATAGTGAAG 4500 

ATAGTGGGAA CAT C AC GAT A ACTGAAAATG C T GAAT T AC G TACAGGTCCA AACGCTGCGT 4 5 60 

ATCCAGTCAT ATATAAAGTT GAAAAAGGTG ACCATTTTAA AAAGATTGGT AAAGTAGGTA 4 620 

AATGGATTGA AGTTGAAGAT ACATCCAGTA AT GAAAAAGG TTGGATAGCT GGATGGCACA 4 680 

CAAATTTAGA TATTGTCGCG GAT AAT AC G A AG G A G AAAAA TCCTTTGCAA GGTAAAACAA 4740 

TAGTGCTTGA TCCTGGTCAT GGAGGTAGTG ACCAGGGTGC TTCAAGCAAT ACTAAATATA 4 8 00 

AAAG T T TAG A AAAAGATTAT ACGTTGAAAA CAGCAAAAfi. ATTGCAGCGT AC T T TAG AAA 4 860 

AAGAAGGCGC AACTGTTAAG ATGACAAGAA CAGACGATAC ATATGTTTCA CTAGAAAATC 4 92 0 

GTGATATCAA AGGCGATGCC TATTTGAGTA T AC AT AAT GA TGCGTTAGAA TCATCTAATG 4 98 0 

CAAAT GGAAT GACaGTTTAT TGGTATCATG AT AAT C AAAG AGCTTTAGCA GATACGTTAG 504 0 

ACGCTACGAT T C AG AAG AAA GGTCTACTTT CTAATCGCGG TTCAAGACAA GAAAATTATC 5100 

AAGTGTTAAG ACAAACAAAA GTTCCTGCTG TTTTATTAGA ATTAGGTTAT AT TAG T AACC 5160 

CAACTGATGA AACGATGATT AAAGATCAAT TACATAGACA AATTTTAGAA CAAGCAATTG 5220 

TTGATGGCCT TAAAATTTAT TTTTCTGCGT AGGGCTTGCA AAAATATGTG AAAGTAGTTA 52 8 0 

TCATTGATAT TGAATTTTAT AACTAAAACC GTTAGTATTC TTGAAATGGT AAATGAAATA ' 534 0 
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GGTAGCAATC TAACTAAGAT TGTGTAGGAA TATAATCCAT AG AC T G AAAG ATTATGCTGA 



5400 



GTAGTTTATA TACATTGAAC ACAAGAAGAG GTGCTTTATG AAAAGXAAG CCGTTAAACG 



5460 



TACGTTaAAC GTTTTGAGTG GGTTTATTAA ATGCACGCTT ATAAAAAGTA AT GAT GAT T A 



5520 



CAATTAGGCA TGTTTTTTAA ACCA 



5544 



(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1067 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

AAAAGATTGC AAATATAAAT GGCATGTTTA ATATGTTAGA ACAACAAATC ATTCATAGCC 05 

AAGATATGGC TCATTTTAGA AGTGAATTTT TTTACGTCAA TCATGaGCAT CGAGAAAACT 120 

ATGAAgCACT CCTAATTTAT TACAAAAATA GTATCGACAA TCCTATTGTA GATGGTGCAT 180 

GTTATATTTT AGCCCTACCT GAAATTTTCA ATAGTGTTGA TGTTTTCGAA TCAGAGTTAC 24 0 

CATTTTCATG GGTATATGAT GAHYATGGCA TTACCGAAAC AATGAAATCA CTTAGCATTC 300 

CATTACAATA TTTAGTTGCA GCAGCTTTAG AAGTAACTGA TGTGAATATA TTTAAGCCTT 3 60 

CAGGATTTAC AATGGGAATG AATAATTGGA ATATTGCTCA AATGCGAATC TTTTGGCAAT 4 20 

ATACAGCAAT TAT T AG AAAA GAAGCACTAT AACATTAATA ATTAATTAG T AT AAAGAT G 480 

ATTCACAACA ATCATCTTTA TAGCTTTTTT ATGTCTAATT ATTTTTGAGG AAAATmACAA 54 0 

TCGTTATTCT ATGTTATTAT TAGTAAATTA AATGTAAATG ATTTTGTTAG GATGTGGAGG 6 00 

AATTTTATGT TTTCAAAAGT AAACAATCAA AAGATGTTAG AAGATTGCTT CTATATAAGA 6 60 

AAGAAAGTGT TTGTAGAAGA ACAAGGCGTC CCTGAGGAAA GTGAAATTGA TGAATATGAA 720 

TCTGAATCTA TTCACCTCAT TGGATATGAT AATGGACAGC CAGTTGCCAC TGCTCGAATA 780 

CGCCCTATTA ATGAAACAAC TGTCAAAATA GAACGAGTAG CTGTGATGAA ATCACATCGT 8 40 

GGACAAGGAA TGGGTAGAAT GCTTATGCAA GCTGTAGAAT CATTAGCTAA AGATGAAGGT 900 

TTTTACGTAG CTACTATGAA TGCCCAATGT CATGCTATCC CATTTTATGA AAGTTTAAAC 9 60 

TTTAAAATGA GAGGTAATAT ATTTCTTGAG GAAGGCATCG AGCATATTGA AATGACAAAA - 1020 

AAGTTAACCT CGCTTAATTA AAAAAAGTTG TATCTATTTT AGAAACA 10 67 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 18613 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

AAGACGtAtG ATAACAACAA TACgTGTAGT GAAAGATTTT AATCTACATA TTACTGACAA 60 

AGAATTCATT GTATTTGTTG GACCATCGGG ATGTGGTAAA TCAACAACAT TACGAATGGT 12 0 

TGCTGGACTA GAGTCTATCA CATCTGGAGA TTTTTATATT GATGGGGAAC GCATGAACGA 180 

TGTTGAACCA AAGAATAGAG ATATTGCGAT GGTATTTCAA AACTATGCAT TATATCCACA 24 0 

TATGACTGTT TTTGAAAATA TGGCATTTGG GCTAAAGCTA CGTAAAGTAA ATAAAAAAGA 300 

GATTGAACAA AAAGTTAATG AAGCAGCTGA AATATTAGGA TTAACTGAGT ATCTTGGTCG 3 60 

TAAACCAAAA GCGTTATCTG GCGGACAGCG TCAACGTGTT GCTTTGGGCA GAGCTATTGT ' 420 

TAGGGATGCG AAAGTCTTTT TAATGGATGA AC CAT TAT CG AATCTTGATG CGAAyTtCGA 4 80 

GTACAAATGC GCACAGAAAT ATTGAAATTA CAT AAG C G AC TTAATACTAC GACAATTTAT 54 0 

GTTACACATG ATCAAACTGA AGCATTGACG ATGGCTAGTC GAATTGTTGT TTTGAAA&T 600 

GGCGACATTA TGCAAGTCGG CACACCTAGA GAAATATATG ATGCCCCTAA TTGCATATTT 660 

GTGGCGCAAT TTATCGGCTC ACCAGCAATG AATATGTTGA ATGCTACAGT TGAAATGGAC 72 0 

GGATTGAAGG TAGGAACACA CCATTTTAAA TTACATAATA AAAAATTTGA AAAGTTAAAA 780 

GCTGCTGGCT ACTTAGACAA GGAAATTATT TTAGGTATTC GAGCTGAAGA CAT T C AT GAA 840 

GAACCAATAT TTATTCAAAC TTCTCCAGAG ACACAATTTG AATCTGAAGT AGTTGTATCC 900 

GAACTGTTAG GTTCAGAAAT TATGGTACAT AGCACATTCC AAGGAATGGA ATTGATTTCT 960 

AAAT TAG AT T CAAGAACTCA AGTGATGGCG AACGACAAGATTACACTAGC ATTTGATATG 102 0 

AATAAGTGTC ACTTTTTTGA TGAAAAAACA GGAAATCGTA TCGTCTAAGG GGGAGTATTC 108 0 

ATGTCTAAAA TTTTAAAATG TATCACGTTA GCCGTGGTAA TGTTATTAAT CGTAACTGCA 114 0 

TGTGGCCCTA ATCGTTCGAA AGAAGATATT GATAAAGCAT TGAATAAAGA TAATTCTAAA 2100 

GACAAGCCTA ACCAACTTAC GATGTGGGTG GATGGCGACA AGCAAATGGC GTTTTATAAA 1260 

AAAATTACGG ATCAATATAC TAAAAAAACT GGCATCAAAG TAAAGCTTGT AAATATTGGT 1320 

CAAAATGATC AACTAGAAAA TATTTCGCTA GACGCTCCTG CAGGAAAAGG TCCAGATATC 138 0 

TTTTTCTTAG C AC AT G AT AA TACTGGAAGT GCCTATCTAC AAGGCTTAGC TGCTGAAATC 14 4 0 

AAAT TAT C AA AAGATGAGTT GAAAGGTTTC AATArGCAAG CACTTAAAGC GATGAATTAT 1500 

GACAATAAGC AACTAGCATT GCCAGCTATC GTTGAAACAA CCGCACTTTT T T AT AAT AAA 1560 

AAAT TAG T G A AAAATGCACC GCAAACGTTA GAAGAAGTTG AAGCTASGC TGCCAAACTA 162 0 
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ACTGATAGTA AAAAGAAACA ATACGGTATG TTATTTGATG CTAAAAATTT CTATTTTAAT 168 0 

TATCCGTTTT TATTCGGCAA TGATGATTAT ATTTTCAAGA AAAATGGCAG TGAATATGAT 174 0 

ATTCATCAGC TAGGACTAAA TTCAAAACAT GTCGTCAAGA ATGCTGAACG ATTACAAAAA 1800 

TGGTACGACA AAGGGTATCT TCCTAAGGCA GCAACAC AT G ATGTCATGAT TGGTCTTTTT 18 60 

AAAGAAGGAA AAGTAGGACA ATTTGTCACT GGACCGTGGA ACATTAATGA ATATCAAGAA 192 0 

ACGTTTGGTA AAGATTTAGG AGTAACAACA TTACCTACAG ATGGTGGCAA ACCTATGAAA 198 0 

CCATTTCTAG GTGTACGTGG TTGGTATTT^ TCTGAATATA GTAAACATAA GTATTGGGCT 204 0 

AAAGATTTAA TGCTGTATAT CACTAGTAAA GAT AC AT T AC AAAAATATAC AGATGAAATG 2100 

AGCGAAATTA CTGGACGTGT TGACGTGAAA TCATCTAATC C AAAT T T AAA AGTGTTTGAA 2160 

AAGCAAGCAC GTCATGCTGA ACCGATGCCT AATATTCCTG AAATGCGACA AGTTGGGAA 2220 

CCGATGGGCA ATGCAAGCAT ATTTATTTCA AATGGTAAGA ATCCTAAACA AGCGTTAGAT 2 28 0 

GAGGCGACGA ATGATATAAC GCAAAATATT AAGATTCTTC AT C CAT C AC A AAATGATAAG 234 0 

AAAGGAGATT AGTTATGACG AAACGTAACC CTAAATTAGC GGCATTATTA TCTGTTATAC 2 400 

CTGGTTTGGG ACAGTTTTAT AATAAAAGAC CCATTAAAGG GACGATATTT TTTATCTTTT 2 4 60 

TCATCAGTTT TATTTCTGTT TTTTATAGCT TTTTAAATAT TGGTTTTTGG GGATTGTTCA 2 52 0 

CATTAGGGAC AGTACCTAAG TTAGACGATT CTCGTGTCTT ACTTGCACAA GGTATTATTT 2 58 0 

CTATCTTACT CGTTGCTTTC GCAATCATGC TATATACAT TAATATTTTA GATGCATATC 2 64 0 

GTAATGCTGA ACGATTTAAT CGCAATGAGG AAATAAAGGA TCCGAAGcGC GTATGGTGGC 2700 

AAcATGGGAC AAGACGTTCC CATACTTACT AATCTCACCA GGTACATTCT TATTGATATT 2 7 60 

TGTAGTTGTA TTTCCATTAA TAyyTATGTT TGGAGTAGCA TTTACAAATT ACAATTTATA 2820 

CAACGCGCCT C C GAG AC AC A CATTAGAATG GGTTGGTTTA GATAACTTTA AAACGTTATT 2 88 0 

CACAATTGGC GTTTGGCGTA AAACATTTTT CAGTGTTATT ACTTGGACAT TAGTATGGAC 2 94 0 

GCTTGTTGCA ACGACACTTC AAAT T GC AT T AGGGCTGTTT TTGGCAATTA TTGTAAATCA 3000 

CCCTGTCGTC AAAGGTAAGA AATTTATCCG TACTGTGTTA ATCCTACCTT GGGCTGTACC 3060 

ATCATTTGTG ACAATTTTAA TATTTGTAGC GTTATTTAAT GATGAATTTG GTGCGATAAA 3120 

TAATGATATT TTGCAACCTT TATTAGGTGT AGCACCAGCA TGGTTAAGTG ATCCGTTTTG 318 0 

GGCAAAAGTG GCATTAATCG GCATTCAAGT ATGGCTTGGA TTCCATTTG TCTTTGCACT 32 4 0 

GTTCACTGGA GTACTGCAAA GTATTTCATC AGATTGGTAC GAAGCAGCAG ATATGGATGG 3300 

TGCGTCTAGT TGGCAAAAGT TTAGAAACAT CACATTCCCG CATGTCATTT ACGCCACAGC 33 60 

GCCATTGTTA ATTATGCAAT ATGCAGGTAA TTTCAATAAT TTTAATCTTA TTTATCTATT 3420 

TAATAAAGGC GGTCCACCAG TGTCAGGGCA GAATGCTGGT AG T ACAG AT A TCTTGATATC 34 8 0 
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TTGGGTGTAT AATCTGACAT TTGAGTTTAA CAACTTCAAC ATGGGTGCAG TTGTGTCATT 3540 

AATTATTGGA TTTATTGTTG CTATTGTCGC ATTTATTCAA TTCAGACGTA CAAGTACGTT 3600 

T AAAG AT GAG GGAGGTTTAT AAGAT3ACAA AGAAGAAAAA CATATTAAAA GCAATCGGTA 3 6 60 

TTTACAGTTT TATAGCGATG ATGTTTGTCA TCATTTTATA TCCACTACTG TGGACATTTG 3720 

GCATTTCCCT TAATCCAGGT ACGAACTTGT ATGGTGCCAA AATGATACCA GACAATGCAA 37 8 0 

CATTTAAAAA TTATGCATTC TTACTATTCG ATGACAGTAG TCAATACCTGACTTGGTATA 384 0 

AAAATACGCT TATCGTAGCA TCTGCAAATG CACTGTTTAG TGTGATATTT GTCACGTTAA 3900 

CAGCATATGC TTTTTCTAGA TATCGCTTTG TTGGTCGTAA ATACGGGCTG ATTACATTTT 3 9 60 

TGATTTTACA AATGTTCCCT GTATTAATGG CAATGGTCGC AATCTATATT TTGCTAAATA 4020 

CAATTGGATT ATTAGATTCT TTATTTGGAC TAACACTGGT ATATATTGGT GGATCAATAC 4 080 

CGATGAATGC CTTTTTAGTG AAAGGTTACT TCGATACGAT TCCAAAAGAA CTTGATGAAT 414 0 

CTGCCAAAAT TGATGGTGCA GGGCATATGC GTATTTTCTT ACAAAT T AT G CTTCCATTAG 4200 

CTAAGCCGAT TTTAGCAGTT GTTGCTTTGT TCAATTTTAT GGGGCCATTT ATGGACTTTA 42 60 

TATTACCTAA AATACTATTA AGAAGTCCTG AAAAATTCAC ATTAGCAGTT GGATTGTTCA 4 32 0 

ACTTTATTAA TGATAAGTAT GCAAATAATT TCACAGTGTT TGCAGCAGGG GCAATTATGA 4 38 0 

TTGCAGTACC TATAGCAATC GTATTCTTGT TCTTGCAACG CTATTTAGTA TCAGGTTAA 4440 

CAACAGGTGC GACAAAAGGT TAGTTTGAAA TTAGGAGTGG GGCAGAATTG ATAAAGAACC 4 5 00 

ACTAATGACG AT AAAG AT T A AAAGGAGGAC GTTATGATGA CGATTAAAGT TGGAATCATT 4 5 60 

GGGTGTGGTG GTATTGCGAA TGGCAAGCAC ATGCCAAGTT TACAAAAAGT TGAAAATGTT 4 620 

GAAATGATCG CATTTTGTGA CGTAGACATT TCGAAAGCAG CGAGTGCGGC AGAAGCATAC 4 68 0 

GGAACTGACA ATGCAAAGGT T TAT GAT GAT TACAAAGCAT TGTTAAAAGA TGACACGATT 4 74 0 

GATGTTATCC ATGTTTGTAC GCCAAATGAC TCGCATTGTG AAATTACTGT AGCAGGGTTG 4 8 00 

CATGCTGGTA AACATGTGAT GTGTGAAAAA CCAATGGCTA AAACGACAGC AGAAGCTCAA 4 8 60 

AAAATGATAG ATACAGCTAA ATCAACAGGT AAAAAATTAA CAATAGGTTA TCAAAATCGT 4 920 

TTCCGAGCAG ATAGTCAATT TTTACATCAA GCAGCGCAAC GTGGCGACTT AGGAGACATT 4 98 0 

TACTTCGGAA AGGCACATGC CATTCGTCGT CGAGCAGTAC CAACATGGGG TGTCTTTCTA 054 0 

GACGAAGAAG CTCAAGGTGG AGGACCATTA ATCGATATCG GTACACACGC TTTAGATTTA 5100 

ACGTTATGGA TGATGGATAA TTATGAACCA GAATCAGTGA TGGGTTCAAC AT TC CAT AAA 5160 

TTAAATAAAC AGCATCATGC GGCAAACGCT TGGGGTTCAT GGAATCCAGA TGAATTTACA 5220 

GTTGAAGATT CTGCGTTTGG ATTTATTAAA AT G AAGAAT G GAGCGACGAT CATTTTAGAA 5280 
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TCCGCTTGGG CGATTAATTC TTTAGAAGTG GATGAGGCAA AATGTTCATT ATCAGGAACT 534 0 

AAAGCAGGTG CTGATATGAA AGATGGTCTA CGTATTCATG GTGAAGACAT GGGTACACTT 54 00 

TATACCAAAC ACGTTGAATT GGAAAACAAA GGCGTCGACT TTTATGAGG TAATGAAGTG 54 60 

GATGAAGCTG AAGAAGAAGC AAAAGCTTGG ATTGATGCAG TTGTAAATGA TACTGAACCA 5520 

GTTGTGAAAC CGGAACAAGC AATGGTAGTT AC AAAAAT T C TTGAAGCGAT TTATCAGTCT 558 0 

GCAAAATCAG GCAAAGCAAT TTACTTTGAA T AAC AT CAT A CGGTAAGGAG GCACATCATG 564 0 

AC AAAAT TAA AAGTTGGTGT GATAGGTg'tT GGTGGTATTG CACAAGACCG TCATATTCCA 5700 

GCATTGCTGA AACTCAAAGA CACAGTCTCA TTAGTTGCAG TACAAGATAT TAATACAGTG 57 60 

CAGATGATTG ATGTTGCGAA gCGCTTTAAT ATACCTCATG CAGTTGAGAC ACCTAGCGAG 58 20 

CTGTTTAAAC TTGTTGATGC GGTGGTCAU TGTACACCTA ATAAATTCCA TGCTGATCTT 5880 

TCTATAGAAG CATTGAACCA TGGTGTCCAT GTATTGTGTG AAAAGCCAAT GGCGATGACG 5 94 0 

ACGGAAGAGT GTGATCGCAT GATTGAAGCG GCTAATAAAA ATCACAAATT ATTAACTGTC 6000 

GCATATCATT ATCGTCACAC AGATGTGGCA ATTACTGCTA AAAAAGCAAT TGAB.CAGGT 60 60 

GTGGTTGGTA AACCTTTAGT AGCACGTGTA CAAGCGATGC GTAGGCGTAA AGTGCCTGGC 6120 

TGGGGTGTTT TTACCAATAA AGCGTTGCAA GGTGGCGGTA GTTTAATCGA TTATGGTTGC 6180 

CACTTGTTAG ACTTATCTTT GTGGCTACTA GGTAAAGATA TGGTGCCGCA TGAAGTGCTA 62 4 0 

GGAAAAACAT ATAATCAATT GAGCAAACAA CCGAATCAAA TTAATGATTG GGGAACATTT 6300 

GATCATACTA AATTTGATGT C GAT GAT CAT GTTACTAGTT ATATGACATT TGCCAATCGA 63 60 

GCAAGCATGC AGTTTGAATG TTCGTGGTCT GCAAATATCA AAGAAGATAA GGTTCACGTT 6420 

AGTTTATCAG GAGAAGATGG CGGTATCAAT TTATTECAT T T GAAAT AT A TGAGCCCCGC 64 8 0 

TTTGGAACTA TTTTTGAAAG CAAAGC TAAT GTTGAGCATA AC G AAG AC AT TGCTGGTGAG 65 4 0 

AGACAGGCGC GTAACTTTGT CAATGCGTGT TTAGGGATAG AAGAGATTGT GGTGAAACCG 6600 

GAAGAAGCAC GCAATGTAAA TGCCCTTATA GAAGCGATTT ATCGTAGCGA TCTTGATAAC 6660 

AAGAGCATAC AACTTTAATG ATTATCATAT AT GAT AC AAA ATTCTCAATA TAAAAAGAAG 67 2 0 

GAGTGCTTTT CAATGAAAAT AGGTGTATTT TCAGTATTAT TTTACGATAA AAATTTTGAA 67 80 

GATATGTTAG ATTATGTCTC AGAATCTGGA TTGGATATGA TTGAAGTTGG AACAGGTGGT 68 4 0 

AACCCAGGAG ATAAATTTTG TAAGTTAGAT GAGTTGTTAG AAAATGAAGA CAAGCGCCAA 6900 

GCATTTATGA AGTCAATCAC AGACAGAGGC TTACAAATAA GTGGTTTCAG TTGTCATAAC 6960 

AATCCAATTT CTCCAGATCC GATAGAAGCG AAAGAAGCCG ATGAAACGTT AC G T AAAAC A 7 02 0 

ATCCGTTTAG CAAATCTATT AGACGTGCCA GTTGTTAATA CATTTCTGG CATTGCAGGA 7 080 

TCAGATGATA CCGCTAAAAA GCCTAATTGG CCTGTTACAC CTTGGCCAAC AGCCTACTCT 714 0 



423 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



GAAATTTATG ATTATCAGTG GAATGAAAAG TTGATACCAT ATTGGCAAGA TTTAGCTGAG 7200 

TTTGCAAAAG AGCAAGATGT AAAAATTGCC ATAGAGTTGC ATGCAGGATT TTTAGTGCAT 7 2 60 

ACACCATATA CAATGTTGAA GTTACGTGAG GCTACAAATG AATATATCGG TGCTAACTTA 7 32 0 

GATCCTAGTC ATCTATGGTG GCAAGGTATT GACCCAATTG CTGCGATTCG CATATTAGGC 7380 

CAAGCAAATG CAATTCATCA CTTCCATGCT AAAGATACGT ATATTAATCA AGAAAAT GT A 7440 

AATATGTATG GTCTAACTGA TATGCAACCA TATGGTAACG TTGCGACAAG AGCATGGACA 7 500 

TTCCGTACAG TTGGTTATGG ACATAGTCCA TATGTATGGG CAGATATCAT AAGTCAACTT 7 5 60 

ATTATTAATG GATATGATTA TGTATTAAGT ATTGAACATG AAGATCCTAT TATGTCAGTA 7 620 

GAAGAAGGTT TCCAAAAAGC TTGTCAAACT TTGAAATCTG T T AAT AT T T ACGAC AAGCCA 7 68 0 

GCAGACATGT GGTGGGCATA AT AC GAAC T C GAGGTTAGTC TGAAGTTTGT CTGAAGTAAG 77 4 0 

ACTGGTGGCA GTGTTGAATA AATGCATATG TCGCCAAGCC ATTGCCAAAA AT T T C AC AC C 7800 

TTAAATCAAG TCATTGTTTG TAAAGAAGGT GTACTTTATA TAAGTATATA GCGATGGTCA 7 8 60 

TACCCATTCA CAGTAACAAT CCTCACCATT GAAAAGAGTA TATAACCTTT T C AAT AG T G A 7 92 0 

GGTATATGAT AATAAAAAAA GCCTGTTGTC ACAATGGTCA T AGAC AC G AC ATACTTTAAA 7 980 

GGTTTCTGAA TAT AAT AT T T C AG AAT GC AC TTTAAAGATG GACGTCGATG TAG AC T AAAG 8040 

TGATGACAGG CTTTCATCTT TTTAAATATT CATAATTTC TCTTCTTGTT TAATACGTAC 8100 

ATATAAGAAA TACGCATACG G T AC T AAT AA AATAGTTGTA TATGTTGCGT GTGTTAATAA 8160 

TAATACACCG ATTAATTCAG GAATGATGTT TAAGAAGTAA TTTGGGTGTT TTGTAATTTT 8220 

ATATAATCCA GATTTAATAA TAGGATGGTT AGGTAAAATG AATAATTTTA ATGTCCA&T 82 8 0 

AC C AC C T AAA GTTTTAATAA CCATAAATAA CATGATATAA GCAAAGATTA AT AT AAC T AA 8 34 0 

GCCAATACCA TTTGCAAAGC TAAATGTATC TTTATTAATA AATGCCTCTA CACCAGCCAA 8400 

TACATAAATT AAAACGTGTG TTATTGCTAA AAACTTCGAA TTTTTAACGC CATATTCAAC 8 4 60 

TGCACCGTCT GCTTTTAATT GTTTTGAGTG ATTAATAGAT ATCTTTAAGC TGACAAGTCT 8 52 0 

GATACAGAAA AAGATAAGTA ATATAGATAG AATCATGATG TCCTCCGTCA ' TTATGTCATA 8580 

TGTATAAGCG TTGATTTTGA CAACATAAAG TATTTTATAG ATAAAGCTTG TCAAATACTA 8 64 0 

TTAACTATTT ATTAATTTTA G T AC AT AAA T ATGTTTCTAAGTATGTGTTT ATGTTCAGTA 8700 

TTTTGGATAA TTTAATAATT T T AAGGAT AT TAAGCGCTTA CACCGACGTG ATATATTTGG 87 60 

CTTAACGAAA ATGATTGAGG T G AC AG AG AT GAACTTTTTT GATATCCATA AGATTCCGAA 8820 

CAAAGGCATT CCATTATCGG TACAACGTAA ATTATGGCTT AGAAACTTCA TGCAAGCTTT 8B8 0 

CTTCGTAGTG TTCTTTGTTT ATATGGCTAT GTATTTAATT CGAAACAAC T TTAAGGCGGC 8 94 0 
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ACAACCGTTT TTAAAAGAGG AAATTGGATT ATCTACATTA GAACTTGGTT ATATCGGATT 9000 

AGCATTTAGT ATCACGTACG GTTTAGGAAA AACATTACTT GGATATTTTG TCGATGGACG 90 60 

TAACACAAAA CGTATTATCT CGTTCTTACT TATCTTATCT GCGATTACAG TTTTAATTAT 9120 

GGGATTTGTT TTAAGTTACT TTGGTTCTGT AATGGGATTA TTAATTGTAC TTTGGGGACT 918 0 

TAACGGGGTG TTCCAATCAG TTGGTGGACC TGCAAGTTAT TCAACGATTT CAAGATGGGC 92 4 0 

GCCAAGAACG AAACGTGGCC GATACTTAGG ATTCTGGAAT ACATCAETA ATATCGGTGG 9300 

TGCCATAGCA GGTGGTGTTG CACTTTGGGG TGCTAATGTA TTCTTCCATG GAAATGTTAT 9 3 60 

AGGGATGTTC ATTTTCCCAT CGGTGATTGC ATTACTTATT GGTATCGCAA CATTATTTAT 9420 

CGGAAAAGAT GATCCGGAAG AATTAGGATG GAATCGTGCT GAAGAAATTT GGGAAGAGCC 94 80 

GGTCGATAAA GAAAATATTG ATTCTCAAGG TATGACGAAA TGGGAGATCT TTAAAAAATA 9540 

TATCCTGGGA AATCCTGTTA TATGGATTCT ATGTGTTTCA AACGTCTTTG TAT AC AT T G T 9600 

ACGAATCGGT ATTGATAACT GGGCACCGTT ATATGTGTCA GAGCATTTAC ACTTTAGTAA 9 6 60 

AGGCGATGCA GTTAATACGA TATTCTAC1T TGAAATTGGT GCATTAGTTG CAAGTTTATT 9720 

ATGGGGCTAC GTATCAGACT TATTAAAAGG TCGTCGTGCA ATTGTAGCTA TTGGCTGTAT 97 80 

GTTTATGATT ACATTTGTTG TCTTATTCTA CACAAATGCT ACAAGTGTCA TGATGGTTAA 98 4 0 

CATTTCATTG TTTGCATTAG GTGCGTTAAT CTTTGGTCCG CAATTATTAA TTGEGTATC 9900 

ATTGACTGGT TTTGTTCCTA AAAATGCCAT CAGTGTAGCA AACGGAATGA CAGGTTCATT 9 9 60 

CGCGTATCTA TTCGGTGACT CAATGGCGAA AGTTGGTTTG GCGGCTATTG CTGATCCAAC 10020 

ACGTAACGGT TTAAACATCT TTGGATATAC ATTAAGTGGA TGGACAGATG TTTTCATCGT 10080 

CTTCTATGTT GCATTATTCC TAGGCATGAT TCTATTAGGA ATCGTTGCTT TCTATGAAGA 1014 0 

AAAGAAAATT AGAAGTTTAA AAAT TTAATA TAAATCGGAT TAAAAGTATC GCCAATCTAT 10200 

TGCAATATAG TTGGCAATCC TGCCCCGACG GCATGTGCGT G AAGAGAT G A AAGATACTGC 102 60 

TTCTACCCTT GC AAAT AT AT CATCTCTATG TCTCGGGCA GATCATAATT CCCTGTTATG 10320 

AAGTATCCTT ATTTGCCCGA CTTAGGGTGA CTCAATGAAT TTACTCCTTA CAATAAAGAC ' 10380 

ATATAGCGGT GTCAATATTG TAGGGAGTAT TGTTTTATAT TTAAACTCTC TAAAAAGCGG 10440 

ACT G AAAG AA AAGTGAAAAC TTCTCTATCA GTCCGCTTTT TCATAGAACA AAATGGAGGC 10500 

GCCATAATCA TTAGTTATGT GCTAATCTAT TTTGCTTGCT TACAATAATC ACTTGGCGAC 105 60 

ATTTGTAAAT ATTTTTTAAA ATGATAGCTA AACATTTTAT ACTCTGAAAA GCCTACTTTG 10620 

TCTGCAATTT CATAGTGTTT GTAATGTCGA TCTAACAATT GCAGAGATTG TAAAATACGA 10680 

TAGCGAT T T A AATAATCCAC AATTGTAATA CCAACATGAT CTTTAAATGT TCGCATCGCA 1074 0 

TACGATTCAC TAACATCGAT ATGTTGAATT AAATCTGAAA CAGtCACTTT CGTTTGATAA 108 00 
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GATTGCTTAA TTTGATCCAC AATCTGGTTT ACATAATAAT CATCGTATTC TACTTTTAAT 108 60 

AGTGGTTGGA AGGCATCATG ACAAGATGCT AAGCTACGGC CGTCTGTGA TTGTTGCTCT 10 920 

AATAAGGTAC GGACAAGTCT TCCTAAAATA ACTTCTAATT GTGCATGGTC TACTGGTTTT 10980 

AATAAATAAT CAAGAACATG ATGTTGAATG CCGGCTTTCA TATATTCAAA GTCATCGTAA 11040 

CTCGATAATA TGATGACATT ACAATCTAGA TGCGCAATAT CAT T GAG T AA ATCGACGCCA 11100 

TTTTTACGTG GCATACGAAT ATCAGTAATT ACTAATTCTG GCTGATGTTG TTGAATTAGT 11160 

GATAATGCTT CAACACCATC TTTAGCAGTG TATATTGTAT TGAAATGATA GTCTCCCCAA 11220 

GGAATGATTT GCTTTAATCC TTCTCGAATA ATTCGTTCAT CAT C AC AAAT AACTACCTTA 11280 

AACATCTACA TTCCCCCTTG AAAGEGTAT T T T AT AAC AA ATTAACGTAC CTTGATTACG 11340 

CTTTGAAAAA ATATGGAGTC GTGCATGTGA AC CAT AT T G A ATCATTGCTT TATTGTGTAA 114 00 

ATGATTTAAT CCCAAATGCT TAGTATCAAA TACATCATTA TTAAGAGATT GGCGTACATA 114 60 

TTGCAGGCGA GATGACGACA TCCCGATACC ATTGTCGCAA ACTAAAACATGTAAATTCTG 11520 

ACGTGCCAAT GTCAGGCGTA TAGTAATGTC CAATGACTCA GTATCTCTAC CATGTTTAAT 11580 

AGCATTTTCT ATGAGTGGCT GAAGCATCAT TTTACCAATT GTCTGGTGAC GCGCTTCTTC 11640 

AGAACTTTCA ATATGGAGCT TAATCATGTC ATCAAAACGG aTGTTTTGTA TTGCAACATA 117 0 0 

CTGTTCAATG TAGTTCAACT CTTCGTTTAA TTCCACTGTA TGTGAGTTTG TACGTAATGA 11760 

GTAACGTAAC ATTTGCGATA ATTGTTGGAC CACAGTTtGT GCTAATTTCG GAGATAACGT 11820 

AATTAAATAT TGTATTGTTT GCATCGTATT GAATAGGAAA TGAGGCTGGA ATTGGCGTTC 1188 0 

TATTTCCTTT AAC TGAATAT CACGCAAGCG ACGTTCTGTA TGCTCGATAG AATGGATCAG 11940 

TTGCTCATTT GATTCAAATA AATCGTAAAT ATAATTATTA ATTTCTTCTA GTTCACTGTT 12000 

GTTTTTTAAA GGCGTATATG TACCTAGATG ACGATTTTTG GCATAGTAAA TTTTTTGAAT 120 60 

AATCGTTTCG ATATCTTTTG TTTGTCGTTT AGCCATATTA TCTGCGCTAA TGAAACC&A 12120 

TATTACTAGT AAAACAAGAA CTACGGCCAT AACAATTAAC AAC G T GAT AC CATCTTCAAT 12180 

GTTTTCATGT ATATCTTTAT AAATAATGAG ACGATGGTCA GCATGGTTTA AT T T T AC AG A 12240 

TTCATTCATA AATCCGAATT GTTGTGGTcT ATACTTTTCA CCTATAGTAA AACGGTCATC 12300 

GTTGGCGTAT AAAATATTGT CAT AT T GAT C AmCGATAAGT GCGAATTGTC GGTTATCTTT 12360 

CtTAATTTCA CTTAAACGTG GGGTGTtAGC CATATAAATt TTaAGCATAT ATGTACTATT 124 2 0 

TTTGAATTTA AGCTGATGCG TTGAAAATAA ATACATATTT TTAGTGTTTA AATGTTCATA 124 80 

ATTATTGGTT ATAAACTGAT TTGGTCCAGA TAATTCATAATAAAGTGTTG CGGGCTGTTG 1254 0 

GkGTATTAAT TTTAATAATT CACGTTTTGT AGCGGTCACA T C AT GAT GAT TTGyTAAATC 12 600 
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GAGCTCTTGA AACGAATTAT TATGCTGTGT AATAAATGTC TGAATCTGCT TTTCAGTATG 12 660 

ATGTAAAGAT GACTGACTTT CATCAACATG TTGATGAATC GTACGATGCT CAATCCAAAT 1722 0 

AT AG AT GGC A TAGAAGCTTA CTAGTCCAAT AATAATGACT AAAAATACTG GAAAAATAGT 12780 

AGACnCAAAT AACGATCGTC TTAATTGATG TCTATAAGGT TTGTATGCCn TCATTGAATC 128 4 0 

ATCTCCAAAA ATTTATGATG TGGAATATCC GGTAATTTAG ATTTCGGTAT TAAAGGTATG 12 900 

TTCTTAAGAT TTTCGATAGA CTGATCGCTT TGTTCACTAA CATCCTTTCG AATTGACTTG 12960 

GCATCGAACT CTGCAACTAA TCGTtGTTGT ACTGAGCGGC TTGTTAAATA TTGCACTAAC 13020 

TTTTTACGCT TAGGATGAGG GTGTGCATTT TTAACTAAAG CAATrCCATC AACATTTAAC 1308 0 

ATTGTTCCTT CAATTGGATA AACGATTGAT ACAGGATAAC CTTTGTTTT CCATGTGCGT 13140 

GCATCTTGTT CGTAGCTTAG ACCTGCGTAA TATTTACCTT TTGCAACATC TTCAATGACT 13200 

TTAGACGTCT TTGACAGTTG CATCGCATGG TTTTGGAATT GATGCACATC ACTTACTCGA 13260 

TGATGCATGC TATAAATAGC ACGCATATGT TGATAGCCTG TCGTTGTTGT ATTTGGATTT 13320 

GAGTACGCAA TTTTACCTTT AAGTATAGGT TGTAATAAAT CTTGATAACC TCGAATCTTA 13380 

ATATCTCCTT GTAAATCTGA ATTCACTACT ATAACTGTTG GCATTAATAG AAAACTAGTA 13440 

ACATATTTAT TGTTCGAGCG ATAATCCTCT AATTGCTGTG TTACAGATGT ATCTTGATAG 13500 

GGAACAAAAT CTTCTGGATG ATCAATTGTT TCTGACAACA CACCACCCAT AAAG AC AT C A 135 60 

CCACGCTCCG AAAAATCTTC GTTATGCAAG TTTGAAAGCA GTACTTGAGT AGATCCGTGT 13 620 

TTAATTTCAA TTTTGACATG CTCTTGTTTT TCAAATTCAT TTAAAATTGG ACGAATCAAG 13 680 

TTTGATTGAT ACGGAGAATA AACTGTTAAT ACATTTTTAT CGGATTCAGA GTG&GCGTA 1374 0 

TTAGCGCATG CTGaTAAAAA AATGAGAAAT AATAGCAAGA TATAAATTTT TGATTTCATG 138 00 

ATATCCCATC AATTCTATGT ATATTTTAAT ACAATAATTT TAGCAATAAA TGACGCATAA 138 60 

GTAATGTTAA ATATTTAGAA ATGTTTATAG ATGACTTGTT AAGACGTTGC AAATGTTGTG 13 920 

ATAGCACAAA ATTTTTGTTT GTCAAGACGA TTTACCGAGG CTGTAAAATC AAACTGTTAT 13980 

ATTTTATTTG TAGCTGTTAT ATAAAAATCG GCAAGATATT GAACGGTTCA AAAGTGAATT 14 040 

TTTACGTCAA TAAAAGTATT TAATCCAGTC TCTTCATATA TAAAAGTAAA TCTTTCTAAG 14100 

TGTTGATTTA ACGCTTATCA ACAATCATTT TTTAT.&ACA AATATATACT CCTAAATTAA 14160 

CTTTTAAAGC AAT G AAAAT A GTGAACATTA TAACTGTTGT GTAACAGAAT GCAATTAGCA 14 220 

TATTACTGTT ACACAAATTA GTACAGTTTC TATGTTTTGA CATACATTTG ATGAAAATTG 14 28 0 

TACATAATTT ATGTGAAAAA AATCACAACA AACATGCTAC AATGACTATG AAAACGTTAA 14 340 

CATAGCATTT CAAATTCACA ACATTATACA GATGGAGGCG TTTAGTATGT TAGAAACAAA 14 4 00 

TaAAAATCAT GCAACAGCTT GGCAAGGATT TAAAAATGGA AG AT GG AAC A G AC AC G TAG A 144 60 
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TGTAAGAGAG TTTATCCAAT TAAACTACAC TCTTTATGAA GGTAATGATT CATTTTTAGC 14 520 

AGGACCAACA GAAGCAACTT CTAAACTTTG GGAACAAGTA ATGCAGTTAT CGAAAGAAGA 14580 

ACGTGAACGT GGCGGCATGT GGGATATGGA CACGAAAGTA GCTTCAACAA TCACATCTCA 14 640 

TGATGCTGGT TATTTAGACA AAGAT T TAG A AACAATTGTA GGTGTACAAA CTGAAAAGCC 14 700 

ATTCAAACGT TCAATGCAAC CATTCGGTGG TATTCGTATG GCfiAAgcAG CTTGTGAAGC 147 60 

TTACGGTTAC GAATTAGACG AAGAAACTGA AAAAATCTTT AC AG AT TAT C G T AAAAC AC A 14 820 

TAACCAAGGT GTATTCGATG CATATTCTAG AGAAATGTTG AACTGCCGTA AAGCAGGTGT 14 88 0 

AATCACTGGT TTACCTGATG CATACGGACG TGGACGTATT ATCGGTGACT ATCGTCGTGT 14 940 

AGCTTTATAT GGTGTAGATT TCTTAATGGA AGAAAAAATG CACGACTTCA ACACGATGTC 15000 

TACAGAAATG T C AG AAGAT G TAATTCGTTT ACGTGaAGAA TTATCAGAAC AATATCGTGC 150 60 

ATTAAAAGAA TTAAAAGAAC TTGGACAAAA ATATGGTTTC GATTTAAGCC GTCCAGCAGA 1512 0 

AAACTTCAAA GAAGCAGTTC AATGGTTATA CTTAGCATAC CTTGCTGCAA T T AAAGAAC A 15180 

AAACGGTGCA GCAATGAGTT TAGGTCGTAC ATCAACATTC T TAG AT AT C T ATGCTGAACG 152 4 0 

TGACCTTAAA GCAGGCGTTA TTACTGAAAG CGAAGTTCAA GAAATTATTG ACCACTTCAT 15300 

CATGAAATTA CGTATTGTTA AATTTGCTCG TACACCTGAT TACAATGAATTATTCTCTGG 15360 

AGACCCAACT TGGGTAACTG AATCTATCGG TGGTGTAGGT ATTGACGGAC GTCCACTTGT 154 2 0 

TACGAAAAAC TCATTCCGTT TCTTACACTC ATTAGATAAC TTAGGTCCAG CTCCAGAACC 154 80 

AAACTTAACA GTATTATGGT CAGTACGTTT ACCTGACAAC T T C AAAAC AT ACTGTGCAAA 1554 0 

AATGAGTATT AAAACAAGTT CTATCCAATA TGAAAATGAT GACATTATGC GTGAAAGCTA 15 600 

TGGCGATGAC TATGGTATCG CATGTTGTGT ATCAGCGATG ACAATTGGTA AACAAATGCA 15 660 

ATTCTTCGGT GCACGTGCGA ACTTAGCTAA AACATTACTT TACGCTATCA ATGGTGGTAA 15720 

AGATGAAAAA TCTGGTGCAC AAGTTGGTCC AffiCTTCGAA GGTATTAACA GCGAAGTATT 157 8 0 

AG AAT AT G AC GAAgTATTCA AGAAATTTGA TCAAATGATG GATTGGCTAG CAGGTGTTTA 158 4 0 

CATTAACTCA TTAAATGTTA T T C AC T AC AT GCACGATAAA TACAGCTATG AACGTATTGA 15900 

AATGGCATTA CAT GAT AC AG AAATTGTACG TACAATGGCA ACAGGTATCG CTGGTTTAC 15960 

AGTAGCAGCT G AC T CAT TAT CTGCAATTAA ATATGCACAA GTTAAACCAA TTCGTAACGA 16 020 

AGAAGGTCTT GTAGTAGACT TTGAAATCGA AGGCGACTTC CCTAAATACG GTAACAATGA 1608 0 

CGACCGTGTA GATGATATTG CAGTTGATTT AGTAGAACGC TTCATGACTA AATTACGTAG 1614 0 

TCATAAAACA TATGGTGATT CAGAACATAC AATGAGTGTA TTAACAATTA CTTCAAACGT 16200 

TGTATACGGT AAG AAAAC T G G T AAC AC AC C AGACGGACGT AAAGCTGGCG AACCATTTGC 162 60 
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TCCAGGTGCA AACCCAATGC ATGGCCGTGA CCAAAAAGGT GCATTATCTT CATTAAGTTC 16320 

TGTAGCTAAG ATCCCTTACG ATTGCTGTAA AGATGGTATT TCAAATACAT TCAGTATCGT 16 380 

AC C AAAAT C A TTAGGTAAAG AACCAGAAGA TCAAAACCGT AACTTAACTA GTATGTTAGA 16440 

TGGTTACGCA ATGCAATGTG GTCACCACTT AAATATTAAC GTATTTAACC GTGAAACATT 16500 

AATAGATGCA ATGGAACATC CAGAAGAATA TCCACAGTTA ACAATCCGTG TATCTGGTTA 15560 

CGCTGTTAAC T T CAT T AAAT TAACACGTGA ACAACAATTA GATGTAATTT CTCGTACATT 16620 

CCATGAAAGT ATGTAACAAA ATTTAAGGTG GGAGCACTAT GCTTAAGGGA CACTTACATT 16680 

CTGTCGAAAG TTTAGGTACT GTCGATGGAC CGGGATTAAG ATATATATTA TTTACACAAG 1674 0 

GATGCTTACT TAGATGCTTG TATTGCCACA AT CC AG AT AC TTGGAAAATT AGTGAGCCAT 16800 

CAAGAGAAGT CACAGTTGAT GAAATGGTGA AT G AAAT AT T AC CAT AC AAA CCATACTTTG 168 60 

ATGCATCGGG TGGCGGTGTA ACAGTCAGTG GTGGCGAACC ATTGTTACAA ATGCCATTCT 16920 

TAGAAAAATT ATTTGCAGAA TTAAAAGAAA ATGGTGTGCA CACTTGTTA GACACATCGG 16 980 

CTGGATGTGC TAATGATACA AAAGCATTTC AAAGGCATTT TGAAGAATTA CAAAAACATA 1704 0 

CAGACTTGAT ATTATTAGAT ATAAAACATA TTGATAATGA CAAACATATT AGATTGACAG 17100 

GAAAGCCTAA TACACACATC CTTAACTTCG CGCGCAAACT GTCAGATATG AAACAACCTG 17160 

TATGGATTCG ACATGTCCTT GTGCCTGGTT ATTCTGATGA T AAAGAC GAT TTAATTAAAC 17 220 

TAGGGGAATT TATTAATTCT CTTGATAACG TCGAAAAGTT TGAAATTCTG CCATATCATC 17280 

AGTTAGGTGT TCATAAGTGG AAAACATTGG GCATTGCATA TGAATTAGAA GATGTCGAAG 17 340 

CGCCCGATGA TGAAGCTGTT AAAGCAGCCT ACCGTTATGT TAACTTCAAA GGGAAAATTC 174 00 

CCGTTGAATT AT AAAT AC AA T T C AG AC C G A AAAGAAAGCA TATGCAACTT CAAGAGTGAA 17 4 60 

GGGGCATATG CTTCTTTTTC AATTGAGTAT TGAGTATTAG CAAGACGTAG TAAGTATATG 17 52 0 

AGACAACTTC TACAATGGTT- GAAGGAAGAC GTTTTTGTAA GTAGCTATGC TGAHAAGAA 17580 

TGTGATGTCT TGTTAAAGGT GGGGTTCCAA TATCATCATT TAGCTGATGT TGAATGGGTT 17 64 0 

ATTATTTGCT ACTTGCATAT GAATATGAGT CTTTTCAAAT TTTTATTGAC CCTGAGTAAT 17700 

GAAAAAT AT T AAGATGAAAC TTAATAT TAA AgCAATGCGG AGCGTGATTA TGAAGAGAAT 177 60 

TAGTAAAGAT ATATGGGCAG TATTTAAATT ACTGTATCaA AATAAAGGGC GTTTTAGCAT 17 82 0 

TAATGCCTTA CTATTGCAGT T AAT CAT GAT TTTTATTAGT AG T AC AT ACT TAATTTTACT 178 80 

ATTTAATATG ATGTTAAAAG TAGCTGGcAA AGCCAACTTA CGATTAACAA TTGGACGGAA 17 94 0 

ATCGTTAGTC ATCCCGCCAG TGTGATACTT CTTATATAT TCATATTAAG TGTTGCCTTT 18 000 

CTGATTTATG TAGAGTTTTC ATTGTTAGTT TATATGGTTT ATGCCGGCTT T GAT C G AC AG 18060 

ATTATTACAT TTAAATCCAT TTTTAAAAAT GCCTTTGTAA ATGTGCGTAA ACTCATAGGT 18120 
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GTACCAGTTA TTTTCTTTGT CATTTATTTA ATGTTAATGA TACCCATTGC CAACCTAGGA 18180 

CTAAGTTCAG TAT T AAC AAA AAATATTTAC ATACCTAAAT TTTTAACGGA AGAACTTATG 18240 

AAAACGACGA AAGGTATAAT CATTTACGGT ACCTTTATGA TTGCTGTATT TATATTAAAT 18300 

TTTAAATTAA TATTTACTCT ACCGTTAACG ATTTTAAACC GCCAGTCGTT ATTTAAAAAT 18360 

ATGAGACTAA GTTGGCAAAT TACGAAGCGA AATAAGTTTC GGCTTGTTAT AGAAATAGTT 18 4 20 

AT AT TAG AAC TCATCATTGG TGCGATTTTA ACATTAATTA TTTCAGGAGC AACATATCTT 184 80 

GCTATTTGTG TAGATGAAGA AG G AG AT AAG TTTTTAGTCT CATCAATTTT ATTTGTTGTA 1854 0 

TTGAAAAGCG CATTGTTCTT CTATTATkTA TTt ACGAAAT TAtCATTAAT CAGTGTGTTA 18 600 

GTACTGCACT TAA 18 613 
(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1214 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

AAAGTTTTAA AAGGGGTGAG ATACTTGGCG AATAATCCAT TCCAGCTTTG CGTTTAAAAG 60 

GAATTATACT TGCCATTGTC GGTGCTTGTT TATGGGGATT AGGTGGTACT GTTTCTGATT 120 

TCTTGTTCAA ATATAAGAAT ATTAATGTCG ATTGGTACGT CACTGCTCGA CTTGTAGTCA 180 

GTGGTGTTTT CTTACTTATT ATGTACAAAA TGATGCAACC CAAACGTTCA ATATTTAGCG 240 

TATTCCAAGA TCGACGTATG TTAGGCAAAT TACTTATCTT CAGTATACTG GGCATGTTAG 300 

TAGTACAATA TGCTTATATG GCATCTATTA ATACAGGTAA TGCTGCGATT GC AAC ATT AC 3 60 

TACAATACAT TGCGCCAGTT TATATTATTA TTTGGTTTGT CATAAGAGGC GTTGCAAAAC 420 

T AAC AT TAT T TGATGTGCTT GCTATTATCA TGACACTATT AGGAACATTT TTATTATTAA 480 

CAAATGGTTC ATTTTCTAAT TTAGTCGTCA ATCCTGCAAG TTTATT.TGG GGTATTTTAG 54 0 

CTGGTGTAGC ACTCGCTTTT TACACAATTT ATCCTTCAGA CCTACTTAAC CGCTTCGGTT 600 

CGATTCTAAT TGTCGGGTGG GCAATGCTTA TTTCTGGTGT TGCGATGAAT TTACGCCATC 660 

CAATTTGGCA CATTGATATC ACTAAATGGG AC AT AT C AAT TATATTATTT TTAATCTTTG 7 20 

GTATTATCGG TGGTACCGCA CTCGCATTTT ATTTCTTTAT CGACAGTTTA CAATACATAT 780 

CAGCGAAAGA AACAACATTA TTCGGAACTG TTGAACCTGT CGTAGCCGTT ATCGCAAGCA 84 0 

GTCTATGGTT ACATGTGGCA TTCAAACCAT TTCAAATCGT AGGCATCATT CTTATTATGA 900 
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TTTTAATTTT ATTACTATCA CTTAAAAGAC AACCTGAAAC AT TAG AT G AA TAAGAAAACT 960 

CTGATAATCA CTTTAGCAAG TAACTATTAT TTAACAACGT AGTTACCTTA TAGGTGATAT 1020 

CAGAGTTTTT TATTTTAGTT AATAATATTT TTCACTTGGT ATAAAAAaGC GTCGTCGCTC 1080 

TGGTAATCGG AAATACTGGA ATAAAATATG GAATTGGGTA ATAATCCCAG GTATAAAAG 114 0 

TCCATGTTCC GATAnCCTnT CCGCAnCTCC AACCAAATTT GCCGATAAGG TTCCAAAAGG 1200 

CATCCTGGGG GTAC 1214 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

ATTTTGGTTT CATTCACGAT GGGGTnATAC AGCAAACACA nCTAAAATAA CTATCAATAG 60 

TAGCGTTGAA ATGCGTAACT TGAAATTCTT CTTCATATAG ACAAAaCCTC ATTTTTCTTA 120 

CTTAGACAAT AAAAAATATG CCACTACAAT CGCTAATATT ACGATTAAAA AAGAAGCGTT 18 0 

AACGATTACT TTCATCGTTG TTCTATCTCT GAAC AT CAT A TTAAAGACAA C TAG AC T AAT 24 0 

TGATAATGAA ACAGCAAAAA AAGTAATAGC TAACACTAAT TTCATCATAA ATAGACAGAC 300 

TAAACCTATG ACTAATAATG TAT TAG AAA T TACAGCTGAC GTTTTTAACA TTCTCGaATT 360 

AATATGCACT CACCCTTTTT ATTTAAATAA CTTACATAAT CATAATAATA CATGATGTTT 420 

CATAGGCCTG TCGAT GATTG ATTCACAATA GCACGTGATT TTTTTGTTTT TCAATATAT 4 80 

TCATTTATTC CATCAAAAAC ACCCTTTTTA AT T T T T AC AA AAATTAAAAA AAGTGCTCCT 54 0 

ACACTGCTTG CATGTAGAAA CACTTTTTCA TTGTAATGTT ATTCTTCTCG AGACATACCT 60 0 

TTTAGCATAT TAAGCATGTA TGTTAAACTA CGGTTCATGT CGTCATCTTT CAATACGCCC 660 

AAT AG AC T T C TTATAGTTGT CTTAGCATTT GGACTCGCTT GATTGGCAAC GTGTAATCCT 72 0 

TTATTAACTT TATTTAGGAA GTCGCTTAAA TCTGATACAT TGAGTTCACC TAATAAAAAT 78 0 

ACCATTGAAG CCATATTAGA TAATAGCCCT GTATAAATAT CTTTATTAAG TTCAACTGCA 84 0 

AATTTATTTA TGATGACTTG ACGTCCTCGA ATTGCACST TTAAAGCATC TAATAGTTTT 900 

GCATCATCTA ATGTTTTAAT AAGCTTGATT GCTTTTAATA TACTATCTTT ATTCGCTGCA 960 

ATTGCCTCTG TAACTTCATT TAAACTTTCT AACTTAATTT GTTCTTCTGA TTTTTCTAAG 102 0 

CGTCTAATTT TAGAAGATAT TCTCTCAGCC AT TAT T TAT C CACCTGATTT CCCGGGAAAA 1080 

CATAATCTGA ACGTTCCCAT TTTTTCTGTA C T T GAAC AC T GTACTGCGGT TGACGTTTTT 114 0 
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TATTGACACG GAAATTATTA GGGTTCAACG GTGACTTACC ACGTTTCGTA ATTACCTCCA 1200 

AAC G AC AG C T AGTACGTTTA TAAGATGGTG TATCCGTGTA TTGATCAACA TC AC T a T TAG 12 60 

TTAATAAGTT AATTGCACCT AGATCTCCAT TTTCCATCGC aTCaTTATTT AATGGAATAT 132 0 

AGATTTCTTT ACCTTTAACA CGATCTGTCA CGTGAACTTG TAATACCGCT TCTCCTGTyT 1380 

C AG AAAT C AG CTTAACTTCT GCACCTTCAT GAATGCCTCT ATCTTCAGCA AGCTCTGGAG 14 4 0 

AAATTTCAAC AAATGCACGT GGCACTTTGT ATTTAATCAT TGGTCBTTGA TAAGTCATAT 1500 

TACCTTCATG GAAGTGCTCT AACAATCGAC CATTGTTTAC ATGAATATCA TAAATTTCAT 15 60 

CTTGCTTAAA GTAATTATCA AATGATAATG GGAATAATTT TGCTTTACCA TTATCAAAAT 162 0 

TGAATCCTTC TAAGTATAGA ATAGGCTCAT CAGTACCATC AGGTTGTACT GGCCATTGTA 1680 

AACTATTGAA TCCTTCTAAA C GAT CAT AAC TTACCCCAGC ATATAGAGGT GTTAAGCGTG 174 0 

CTACTTCATC CATAATTTCA CTAGGATGCT TGTAATTCCA ATCAAATCCT AAT C TAT TAG 18 00 

CAATTGCTTG GAAAATTTTC CAGTCAGGTT TTkAATCACC AAGAGGTTCT AATGCTTGGT 18 60 

ATAAACGTTG AATACGACGT TCGGTATTTG TAAAAGTACC GTCTTTTTCA AGTGAAGGAC 1920 

TTGCTGGCAA TACAACATCT GCGTATGTTG CTGTGAATGT T AAAAAT T C A TCTTGGACTA 1980 

C CAT G AAAT C TAATTTTTCA AACGCAGCTT GTACAAAATT AATATTTGAA TCCACAATAC 204 0 

CCGTATCTTC AC C AT AT AAG T AC AAT GAG T GTACTTCTCC G T C AT GT AT A OTTCTACCA 2100 

TTTCATGATT ATCTTTACCA GCTTTTGGAT TCAATTTAAC GCCATATTCT TTTTCAAATT 2160 

TAGCGCGAAT ATCATCCGCT TCAATACTTT GATAACCAGT AATCTTATCA GGCATACTTC 2220 

CCATATCACT ACATCCTTGA ACATTATTAT GTCCACGTAA TGGATACGCA CCAGTACCAG 2280 

G AC G AC G ATA ATTACCTGTT ACTAATAATA AGTTTGAAAT CGCTGTACTT GAG T C AC T AC 2 34 0 

CAATGTCTTG TTGTGTAATA CCCATTGCCC AAC AAAT T AC AAC AG AT T C A GCTTTAGCAC 2 4 00 

ATTCTTCAGC AAATTTAATC AATTCTGATT CAGGAATACC TGTTGCTTCT T C AG C AAAAG 2 4 60 

CCATTGTAAA TGTTTCTAAT GATTTGTAAT ATTCATCAAA ATCATCTACC CACTCATCAA 2520 

TAAATGCTTT ATCGTGTAAA T CAT GAT C AA T AAT AT ACT T AGTCACTGCA CTTAACCACG 2 58 0 

CTAAATCCGT ACCTGGTTTA GGTTGATAAA AACGATCCGC ACGTTCTGCC ATTTCATGTT 2 64 0 

TTCTAATATC AAATACATGT ATTTTTTGAC CAAATAATTT TTGTGCACGT TTCATGCGTG 27 00 

ATGCGATAAC TGGATGAGCT TCGGCTGTAT TAGTACCTAT CAATACAGAC ATTGCCGCTT 2 7 60 

TTTCTAAATC T T C AAT AC T A CCTGAGTCAC CGCCGTGTCC AACCGTTCTA AATAAGCCTT . 2820 

TTGTTGCAGG TGCTTGGCAA TATCTTGAAC AGTTATCAAC GTTATTTGTG C C AAT AAC T T 2 88 0 

GTCTTGCTAA TTTTTGCATT AAATACGATT CTTCATTCGT CGCTTTAGAA G AAGAAAT G A 2 94 0 
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ATGATAGTGC ATCTGGGCCA TGCTTTTCTT TAATAGCTGT AAAATTATCT GCAATGACGT 30 00 

TTAAAGCTTC ATCCCATTCT ACTTCATGGA ACTCACCATT TTTCCTTACT AGTGGTTTAG 30 60 

TTAATCGTTG ATCTGAATTA ATATGTCCCC ATGAAAACTT ECTTTAACA CAAGTCGCAA 3120 

TTTTATTTGC TGGAGAATCA TGTGATGGTT GTACTTTTAA AATTTCTCTA TCTTTAGTCC 3180 

AAACTTCAAA TGAACAACCC ACACCACAAT AAGTACACAC TGTTTTAGTT TTCTTAATAC 32 4 0 

GCTCTTTACG CATTTCTGCT TCTGAATCTG AGATTGCAAA TAGTGGACCA TAACCAGGTT 333) 

CTGCTTTTTT AGTTAAATCA ATCATTGCTG CTAATGAACC AGGTTCCGTA TCAGTCATAT 33 60 

AACCCGCATT ACCTTCCATA TTCACTTCCA TCATGGCATT ACATGGACAT ACCGTCGCAC 3420 

ATTGACCACA AGATACACAT GAAGACTCAT TAATCGGTAC ATCATTATCC CAAATAACAC 34 80 

GTGGATGTTC ACGATCCCAA TCSVTTCTAA TAGTTTCATT CACTTCGATA TCTTGACATG 3540 

CTTCTACACA ACGCCCACAT AAG AT AC AT T GATTTGGATC ATAACGATAA AATGGGCCGT 3600 

AATCTTTTTC GTATGGCTTC TCTTTATATT CATACGTTTG ATGCTGAAGC CCCCATGCAT 36 60 

CCATCGTATT ATGAATTTCA CAATCACCAT TATTATAATC ACATACTGffl CAATACAGCA 37 2 0 

TATGCTTTTC TAAAATTCGA TCAAGCGCTT CTTTTTGAGC ATCTTTCACA TCATTGTTCA 37 80 

CAGTATTTAC AGTCATTGGA CGATCAATCA CCGTACTACA TGAACGTTCA ATTTTACCGT 38 4 0 

CAATCTCAAC AG T AC AT G T A TCACATGTTT GAATTGGTCC CATCGACTCG T T AT AAC AAA 3900 

TTGAAGGTAC AAAAGTATCT TGTGATTTAA TAAATTCAAG TAAATTCGTA CCTGGTTCTA 3 9 60 

CAAGATAATC TTTTCCATCA AGTGTAACCA CCAAATGTTC TTGCATATTA CTCACCCCGT 4 02 0 

C TAT AT AT AT TTTCCGTAAA TGACTTTTAA TAAATTGCTC ATATCCACCT AAAATAACGA 4 08 0 

TGCCCCACAC ATCTTTCAGA TAGAATTAAT TTAATTGTAT TACTTTATGT ACTAGTTGTT 4 14 0 

AAGTAAAATT TTGTATTTTG CCTTTTTACA ATCATTTTTA TTTGAAATAT TTTGCGCGAA 4200 

ATTAAATCAT CTTTTTGTTT AATTGAAAAT AATTATCATT ATTAGTTTTC CAATTATCTG 4 2 60 

TTTCACGCTT TTTGCCATAT CTTTCACAAC CTTATTAATG ACAATATTTA ATAATSCCT 4 32 0 

C AC C T AAAAA TCGTTATACT ATTTATAAAT ACCCTTTTTC TGAAAATTAA TAACCCAAGT 4380 

TTGATAAATA TCTACTATCA TTTAGAAGGT AATATTTATC TTTAAATTAA ATTTGTAATG 4440 

GATTAATTTA TAAAAATCAA ATCAGGCATT AAATAAAATA GCCCATAAAT ACAAAGTGTT 4 500 

ATCACCTTCT ATTTACGGGC TATTAGTTCT ATTCGTTATT CTATTTACAG ATCATTCTAT 4 5 60 

CTAATTAATT TGTGTACAAT T T T GAT AAC T TATTTTCCCT TAGTTTACTA CTCTAGATTA 4 62 0 

TCTTTTAATA ACTTAGTACT TTCAGCTTTT GACTGCTCAC TAGGAATGAA G TAG T AC AAT 4 68 0 

CCGTCACTTT GAATGCCGCC TTGACCACTC AATTGATCT TATTAATCGT GTCATTAGCA 4 74 0 

TCTTTATAAT TGCTTCTAAT CGTATTCAAA TCACCTAATG TTAAATCTGT TTTAACATTA 4800 
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TTTTGAATTT CATTCATTAG AC TAT T AAAA TGTGTAATCG ATGATGGGCT TGCAATCTTA 4 8 60 

TTGGCCATCG CTTCAAGCAC AATTTGCTGA CGTTGTTGTC GACCAAAGTC ACCACCAGCA 4 920 

CCTTCTTCTT TACGACTTCT AATAAACTTC AATGCTTGAT CACCATTTAC ATGTGTCTGC 4 98 0 

TGTCCTTTTG TAAAACGAAC ACCATCAACA GTGAATGTAT CATTACTTAC TACATCAACA 5 04 0 

CCGCCGATGC TATCTATCAT ATTATGCAAA CCATCCATAT CGATTGTCGC ATAATGATCA 5100 

ATTGGCACAT TCATTAATTT TTCAAGTGAT TTAACAGCCA TATTTGGTCC ACCATATGCA 5160 

TAGGCATGTG CAATTTTTTC AGTAGTACCA CGGCCAACAA TTTCCGCTCT TGTATCACGC 5220 

GGTATACTTA CTATTTCAGT TTTCTTCGTT TTAGGGTTGA TAGATAAAAT CATAATACTA 5280 

tCACTACGCT CTCCGCCACC CTTTTTCTTA C GAT C AG CAT CTGAffiCGAC ACCAAATAAA 534 0 

GCGATTGTGA ATGGATCACC ATCGTTTAAA CTCACTTTTT TATCTCTTAA TTCTGAATGA 5 4 00 

TTGCGATCTA ACGGATTGTG TATCTTATTA CCAGTAATAA AAATTTTAGC AGCTACATAC 5 4 60 

AC AAC AG C T A CAATCGCTAA GATTACTAAA ATACCGAATA CCCATAAAAA TATTTTTTTC 5 52 0 

GGTAGGCTCA TTTTACTTTT AGACGAACGT TTCAATCCCA CCACTCCTTT ACTATTCCTT 558 0 

ACATACTTTG TCTGTTTTCT CTATTTATTA TAT AG T AAAA TAATTTTTTT AC TAT AC T T C 5 64 0 

TGTAGACGTA TAACTATTTT TTATCATTTT TTATCTCTAG AGAATATCTA TCTGTATTTT 5700 

TGATAACCAC CATTTGCATT TAAAAT1TTA AGTACCGTTT CATGACATGC TTTATTACTT 57 60 

ATAATAAAAG GTGCACCCTT TAAATGATCA ATTGCCTTAC CATCTAAAGT CGTCATTTTT 582 0 

AGATTCAATA GTTCTGCAAA TAAAAACTGT GCAGCAATGT CCCAAGGTTT AGGATTTGTA 588 0 

TTAATATGTG CCCCAAATTG ACCTTTTGCC ACTCGCATAG AATCTAATCC GEAGCACCA 5 94 0 

ACTAAACGAT AACTAAATGA GGCGTCAAAT AAATCTTGCA CCGTATCTAG ATTCATCACT 6 00 0 

TGTGCATTAA AC G AT AT AAT AGCGTCTTCC AATTTTAACG ATGGTGGTTC TTCCATCTTA 6060 

ATTCCATTAC AAAAAGCACC ' TTCTCCTCGT ATTGCTTTAT AAAGCTTTTT ATGCGGATAA 6120 

TCATATACGT AC GAT AAC AT TGGTTTACCT TCATAAAAAT ACGCCAATAT AAT AC AAT AA 618 0 

TCTTCTTGCT GTTTTACTAA ATTGGCAGTT CCATCAATGG GATCCATAAT CCATAAATGA 624 0 

TTAATTTCAT TCGTAATCAT TTCATTACTT TTTTCTTCCG CTAATAGTTG GTGTTCCGGA 630 0 

AAATGTGTTG CTAAAAATTG TTGGAATTGT TGTTSAATCT GTTTATCTAC ATTTGTAACT 63 60 

AAATCAAATC GATGACGCTT AGTTTCTGTA GTCATTTCCA T AAT T AAT T G CGGAATAACA 64 20 

TTGTCTATTT GTTTCAACCA CGAACATATT AACTTATCTA TTTGCTGTAA TGTTTTATCT 64 8 0 

GTCATTTCGT CCACCACTTC TCATATCATT ATCATTTTAT TATTACCCTA TATTAAAAGA 654 0 

AT C AAC AAT A CAACTGAAGA CTTCTTCATT TTATGCATAA AAAAATCGGC TAGTCACGTG 6 60 0 
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CTAGCCGACA AATAGAAAGG AAAGTAAGTA ATAAATATTG AAGATGTTGT GATGTAACTT 6 6 60 

GAACGATTAA AAGCTATCTG TTATATAGCT CTACCCCTTT GTTTAATCGC TCCCCCTGTT 6720 

ACAAGTAATA TCATAGCACA ATCTTTTTTA AAATGTAAGC GTTTTCCACA AAATTTTTAC 67 8 0 

GATTTTTTTA AAAAG AT AT T GAAAATGTCC TCATTGTCAC TCTTATGTTA TACTTTGTGT 684 0 

AATATATCAT CTTTTAGGAG GTGGCTGTCA TGAATAAAGC TGAAAGGCAA AATTTAATAA 6 900 

TTACTGCAAT TCAACAAAAT AAAAAAATGA CCGCTTTAGA STAGCTAAA TATTGCAACG 69 60 

TATCCAAACG CACAATTTTA AG AG AT AT T G ATGATTTAGA AAATCAAGGT GTTAAAATTT 7020 

ATGCGCATTA TGGGAAAAAT GGTGGTTACC AAATACAACA AGCACAATCT AAAATTGCAT 7 08 0 

TAAACTTATC TGAAACACAA TTATCAGCCT TATTTTTAGT GCTTAATGAA AGTCAGTCGT 7 101 

ACTCGACATT ACCATATAAA AGCGAAATCA ACGCAATTAT AAAACAATGT TTAAGTCTTC 7200 

CACAAACACG CTTAAGAAAA TTGCTTAAAC - GCATGGACTT TTATATTAAA T T T GAT G AC A 7260 

CACAACATAT GACACTCCCA ATGCTGTTTT CCGACATTTT AATTTATTGT ACAGAACGAA 7 32 0 

ATGTGATGTT AGTAGATCAT AGGETTGATG ATAATATTAA AGCTGAAAAC GTTATATTTA 7380 

TTGGCCTTTT GTGTAAACAT GGACATTGGC ATGCAGTCAT TTATGACATT GCTCAAGACA 74 40 

AAACTGCCGA ACTCGAAATT GAAAAT AT T A TAGATATTTC GTATTCATTC GGTAAGACGA 7 500 

TTCAAACCAG AGACATATCC ATTGATAACT ATCATCAATT TTTAAACCC ATCGATTCCT 7 5 60 

AAAAAACAGC AGTAAGATGA TTTTCAATTA GAAAAT ATCT TGCTGCTGTT CTCTATTTAT 7 620 

ACAATACTTC GTATTGAATG GnTTCGCTTT CCTAGGGTGC CGTCTCAGCC TTGGTCTTCG 7 680 

ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACAT GTAATTTTAC 7740 

TTTGAAATAC TTAAAAAAAT AAAACACTTT GCCCAACTTA CACTACCAAT AGAAACTGCT 7 8 00 

GTTAGAATTC CTCAAAATGA TATTTCGCGA TATGTTAATG AAATTGTTAA AAAGATAGCT 7 8 60 

GATAGCGAAT TCGATGAATT CAGACATCAT CGTGGCGCAA CATCCTATCA TCTAAAAATG 7 92 0 

ATGTTAAAAA TCACCTCATA TTCATATACT CAATCTGAAT TTTCTGGCCG TAGAATAGAA 7980 

AAATTACTTC ATAACAGTAT TCGAATGATG TGGTTAGCTC AAGATCAAAC ACCTTCTTAT 8 04 0 

AAAAC T AT T A ATCTTTTTAG AGTGAATCCT AATACTGATG CGCTAATTGA ATCTTTATTT 8100 

ATTCAGTTTC ATAATAAAAT GCATATCAAA AAAGC T GAT T TCTATCAAAT AATTARAGA 8160 

AATCAGCTTT TTTCaTTGCC TAAAAACTTA ATGTCCCGAC CTCTTTATCT ACGCATAAAT 8220 

ACTTATTACT GAT AT AAC G A AAGAAACAAA ATTATTTGCT ATATGTAATG CAATTGTTGA 8280 

ACCTAGGTTT CTTCCAGATT TTAAATAAGT GAAAACTAAT ATGATGGATA GT AT GAGAT A 8 34 0 

TGGACCAAAC TCAAACGGCG ACTTTGCATC AGTCACATGA ATAAATGCAA ATAAGAACAC 8 4 00 

CGAAACAATA CTCATAGCTA TAAAATTAAA CTTCTTACCT AATTCTCCAA TTAAAATATG 8 4 60 
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TCTAAATACG ATTTCTTCAA CTATTGGACC TACAATCACA ATTAATAAGA ATGCTACAGG • 8 520 

TAAAAATGCA GGCACTTCAA ACATTTTATT TAGCTCABT TCATTGGCTG TTtCACTATA 8580 

TTGCAAATGT TTAGGTAGAA ACTGTGTCAT ATATTCATAT GTATAAATTA AGATGAGAGC 8 64 0 

AATAATATAC GTTATTGACA ATCTAAGCCA ATATTTTTTG ATATACGCAA AACCAGCTCG 87 00 

AAGCCTTGAT GGCATCACTT TTAAATGAAA TAAATAAAAT GCGCCAATCC CAATCGTATA 8 7 60 

TGCTAAAGCT TGTGTGATAG TCGCTACAAA TATCAGATTA CTATCGATTT CATAATAACC 8 82 0 

AAACAAAATT GGTCCTATGT AAGCTGCAAT TGTGAGTGCA TAAAATATAA CACCTATAAT 88 80 

TGGAATTATA AGCAAATCTC TCCATGCTAT ATCTTTAAAC GTGTATTTCT TTTTTTCATT 8 94 0 

TTCCaCTGTT ATATCCtTTC CTGTTTAATA ATTGATTTTT GGAGGTACTT CTACATGATA 9000 

AACGAAACTA AGTATATGAG ACAACAAATT ACTAATTTGA T T C AAAT CAT T GAT AC GAT T 90 60 

AAATATAATA CTTTAATGAC AGATTGGAAC ATTCAAACGC ATATTTATAA ATTTAACCAA 9120 

ATAGTTACTA ATGAATTGAA TAAGTTCAAA GGCTTTGAAA CATCffiATAT AATAAACGAA 9180 

AATCAAGTTT CCTATTATGA AATTATAACA CTACTTAATA AACGTCCCCT CgACAAGTCG 92 4 0 

ACTATGGTAA CAAAATTCAA TATCTTAATT TTTATCATAC AGAACTATCT AACGCATTAT 9300 

TTGCAATTAA ATTTGCCCAT TAACCTATTT TTCATAAAAT GTCATTTAAA CAAGTTATTT 9360 

ATTAAAATTC ACTTTATTAC ATAAATTATA C AAT T Ar AAA GTTTCTTCAA ATTGTAAAGA 94 2 0 

TGCATTAATC GAG TT AT AAT CATAATGATT AAGATGGT 94 58 
(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

AnGCGTATCA TGTCACGCAT TTTAACTACT TCTTTACCAC AAGAT TAT AC AG TC AC ATT A 60 

GTTGATCGTA TGCCATTTCA TGGATTGAAA CCAGAATTTT ATGCTTTAGC TGCGGGCACG 120 

AAAT C AG AT A AAGAT GTTCG TATGAAATTC CCTAATCATC CACAAGTGAA TACAGTTTAT 180 

GGTGAAATTA ACGACATAGA TTTAGATGCT CAAATTGTCT CAGTCGGTAA TTCTAAAATT 24 0 

GATTATGATG AGCTAATCAT TGGTTTAGGA T G T G AAGAT A AATATCATA CGTTCCAGGA 300 

GCCGAAGAAT ATACACATAG TATTCAAACA CTCTCAAAGG CTCGGGATAC TTTCCATAGT 3 60 

AT TAG T GAAC TACCAGAAGG TGCTAAAGTC GGTATCGTTG GTGCTGGATT AAGCGGCATA 4 20 
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GAACTTGCCA GCGAATTAAG AG AAAG TAG A TCAGACTTGG AAATATATCT TTATGACCGT 4 80 

GGGCCGCGAA TTTTAAGAAA TTTTCCAGAA AAAT TAAGTA AGTATGTTGC GAAATGGTTC 54 0 

GCCAAAAATA ATGTTACCGT TGTTCCAAAT TCAAATATTA ATAAAGTTGA ACCTGGTAAA 600 

ATATATAACT GTGATGAACC T AAA GAT AT T GATTTAGTTG TATGGACAGC AGGAATTCAA 660 

CCTGTTGAAG TTGTTCGTAA CTTGCCGATT GATATAAATA GTAATGGACG CGTGATAGTT 720 

AACCAGTATC ATCAAGTACC AACATATCGT AACGTCTATG TAGTTGGTGA TTGTGCTGAT 7 80 

TTACCACATG CGCCAAGTGC TCAGTTAGCC GAAGTTCAAG GTGATCAAAT TGCCGATGTG 84 0 

CTTAAAAAGC AATGGCTAAA TGAACCATTA CCTGACAAAA TGCCGGAACT AAAGGSCAA 900 

GGTATCGTTG r 910 
(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

TTTTTGATTC AAAGTGGTGA TTTAACAAGC ATTTTAAATA GCAATGATTT GAAAGT CAC A 60 

CATGATCCTA CCACTGATTA TTATAATTTA TCTGGTAAGT TGTCGAACGA TAAT CC AAA C 12 0 

GTTAAACAAT TAAAACGTAG ATATAATATT CCTAAAAACG CAT C AAC AAA GGTGGAATTA 180 

AAGGGAATGA GTGATTTAAA AGGCAATAAT CAT C AAGAT C AGAAACTTTA TTTTTATTTT 240 

TCAAGTCCTG GAAAAGACCA AATCATTTAT AAAG AAAG C C TTACTTATAA TAAAATAAGT 3 00 

G AAC AT TAAT ACTTATGCTG TAATTATAGA AKATCCAAA TCATCTATTA nAATCCTATA 3 60 

TTATAAAAnC ACCTCACATA ACTCGTTCAA CTGTACCAAA C CAC AT T AC A TTAGATTTTA 4 20 

GGCTAACTAT TGTGATGTAC AT C AAAAAC G AATTTGTGAG GCGTTGTATA T T T T AC AAAG 4 80 

GTGACTAGCG TTTCGTATAG CATTTCCAAC ATTACTACAC TCAAGCGTCA CGCTAAAET 54 0 

CGAAATCGAA TCCTTTCATT CAACAAAAGC TCATATCCAC TACAAACTTC ATATCAAGCG 600 

TATAAACTAT CTTGTGATAC TATCTCGATC ATATCTATAG TATGCATTTG TGTTCCGTTT 660 

CACTGAAGTA TATGTATCAT CAGT TAAGTA TAAACCGTCA TCCTTCAATG TTACTTGATA 720 

AGCATATTTC CGTGCTAACC AGGCAATATC TATATAATTT TCTCCTGCGT TTTCATAACT 780 

TCTTAAATCT TCAATATGTG CACTAACTTC AGGGaAAATG ATTCTAACAA CACTTTCATC 840 

AACCCAATAT TTGTCATGCA TCCATCGCAC TTGATCTGCC AATAAAGGTA ACTGCACATC 900 

ATTGAAATAT AGACGAAAGC CGTCACTATC ATACATTTGC CGATATGGTA ATGGCTGTTT 960 
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TCTAATCACT AACACCTCGC CACCCATTAC GGTGCCTTCT CTAGTATCAT CACTTCCACC 1020 

CGAAGCTTCA TACGTTGTTG GGTCAACCTG TAGTCCATGT ACATCTCCAA TATAAGCATC 1080 

TGGTTTATGT TCCATTGCAT GTCCATGTGC AATCAATGCT AATATTGTAG ATTGTGAAAA H.4 0 

TTGAGGCTCC CATTCAATGC GATTAGGATG GCTACTATAA ATTCTAGGTT CATCTATAGC 12 00 

CTGCTGAATA TCCATGCCAA ACACTAATAC ATTGATTAAT GTTTGCGCAA C AC TAG C AAT 12 60 

GATACTTATG GCACCAGGTG CACCTACTGT TAATATTGGC TTCCCGTGAT AC AT C AC AAT 132 0 

CGTTGGAGCC ATGTTACTTA GTGGTCGTTT ATATGGTGCA ATTTCGTTAA TACCACCATC 1380 

TACTACATCA AAGCCATCCA TTGTCGTATT CAATAACACA CCGTAGCCTG GAATCGTGAT 14 4 0 

ACCTGAACCA TAAATCATAC CAATTGATGT CGTAAATGAA GCAAT AT T AC CTTCCTTATC 15 00 

AATCACACAA AAATGAGTTG TTTCGGTATG ATTTTCTTCA ACATCTCEAT GACTAATGCA 15 60 

AT C AGAC AC A ACACCATGCT CTATATCAAT ATTTGCTTTA TTGCTATCAA TGAGCGTACT 162 0 

GCGTGCTTTT AAATAATCAT CATCAATTAA TGACTGTACA GGCACCTCAT GAAAATTATC 1680 

ATCCGCCAAG TATTGCGCAC GATCACTATA TGCTAAATGC ATCGCTTGTA TCAAATGATG 17 4 0 

CAAGTAATCA ACAGATCTTG GACCCATAGA TGGTAAATCG ACATGTTCTA ATAACTTCAA 18 00 

TATTTGAATT ACCGTGATAC CGCCAGAACT AGATGGTCCC ATTGaATAAA TGTCATAGTC 18 60 

TTTAAATGTT GCACTGATTG GCGCTTTAAT CTGAATGTCA TATTTGGCTA GATCCTCTAA 192 0 

AGTGATTGTC CCACCACATG CTTTGACAA: ATTGACTAAT TGTTTCGCAA TGTCACCTTT 1980 

ATAAAATGCA TTAAACCCTT GTTCTCTTAA TATTTGAAAT GTCTTACCTA ATTCGGGTTG 204 0 

TACAATCCAA TCACCTTCAC GCCAATATTG ATTTTCATGC GTAAATACTT GTGCCGTTTC 2100 

AT GAT AC T T T GTCAATCGTG CGTGTTGCTG GCGCGAATAT TTTTCAGTAG CCCATTGGC 2160 

TGCATGACCT TCAATGGCTA GTTCAATTGC AGGATTAATT AAATCTTCCA ATGACAATTT 222 0 

AGCATAACGC TTGTGAATAT AATCAAACAG CTTTGGAATT GCTGGCACAG CGACAGTTTT 22 8 0 

ACCATGTGTA GTCATATCAA AAAATGATTT ATATTCGCCT GAATCATCTA GATAAAATTG 234 0 

TTTGTCTACA TGTTCAGGTG CTGTCTCACG TGCATCAAAC GCAGTTATAC TGCCAGTACT 24 00 

TTGCTCATAA TATAGCAAAT ACCCGCCACC ACCAATACCT GATGCAAATG GTTCTACCAC 2 4 60 

ATTCAATGCC AGTTGAATTG CAATCACTGC ATCCATGGCG TTGCCACCTT GATCTAATAC 2520 

ATCCTTACCA ATTTTAGCCG CAAGAGGATG TGATAGGAA ATTAACCCTT CTTTAGATGT 2580 

TTTTGTCTGT TTGTCATTTA AGTTAATGAC CATACTATAT CCTCCTACTT TCTGTTAAAT 2 64 0 

ATTTAAAACA T TAT T GAT T A ATGGCTTTTT CTACTTTTTC TAAATCTTGA CGTTGCTCGT 2700 

TACCAGTATC GACAAGTGGT GTAATCGGTG ATGCAATTTT AAATTTATCG CCACGATAAA 27 60 
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ACTTAATAAA TTGATCCTGA TCTATCGCAT TAACTACTGC TTGTCTCAAG TTTGGATGCG 2 82 0 

TCTTAAATAT ACCTTTTTTA ATATTTAGCA T T AAAAAG AC TGACTTGCGT CCATTTTTGC 2880 

GAATAATGCT TAAATTTTTA TCCGACTTAA T T AAAT C AAA ATGTTTTTGA TTCACATCTG 2 94 0 

CCAACATATC AATTGAATGA TTTCTAAGTT CTGACAATGC ATTATTCGGG TCACCATTAA 30 00 

ACTTCAATGT AATATTTTTA ATTTTAGCTG GTCCATAACT ACCTTTTTCT GTTTCGTTGA 3060 

ATCCTGGATT ACGTTGAAAC GTTGCTTGAT ATGCATTTTT CTGTGTCATA ATGTATGCGC 3120 

CACTTGCATA CAGCGCATTT TTCCCATCTG AATTTGCAGG AATGTACTG CTATCCCCAT 3180 

ATCCTTTTGG ATATTCTTGA TTTACTTGAT T AAC AAAT T T TTTAGATAAA ATGCCTGCCG 32 4 0 

AAGAGTGTGT TAAGTAATTT ACCTCTCGAG GCATCGATTG ATCTGTCGTA ATTTTAACAA 3300 

TTTGATAAAT ACCGTCTTTA TTATTTACTT TTTGACCATC TGTCGTTAAC GATTTGACGT 3360 

TATAAGCTTT AATCAACTTA TCATAGATTG ATTTATCGTC CTTGTCTTTC TCTTTACGCA 3420 

ACTGATCGAT GTCCTCATCT TTTAATATCT TGATGTCATT TATATGTTTG TGCATATTGT 34 8 0 

AAGTATTATT GTTAGGCACA GACTTTTTAT CACGTGCTCT ATCTAAAGAA AACTTAACAT 354 0 

CTTCAGCCGA TACACGCTCT CCAGTA.TTAC GTGCTTGTCC ATTGACCACT TTCGCAAAAT 3600 

AAT CAT CAT C TCTTAACAAG AAATAAAATG CTTTATTGTC CTTATTCACA GCATAATCAT 3 660 

G AC T T AAC G A ACCTTTCGTT GTTAAATGAT CATTTTCATC T AAT AAT AAT AACCTTGTGT 3720 

ACATATTCAT ATTAATTGAA TATACTGACG GCGCAATTGA AC G T AT T GG AT C C AAT G T AG 37 8 0 

GAATTTCACC ATCTTGTTGT GTCATCACAA GTGGCCGCGT ATCTCGTTCT CTACTATTGT 384 0 

TGTAATCAAA TTGTTGCCAT AT T AATGCAC GTGAATTTGG CAATCCAACA CTATTTTTAT 3900 

CTAACACTTT ATTGTCATAT ACT AAAT TCT TTTTTGATCC ATATAAAGGC GCCATATACC 3960 

CTTTATCAAA T AC AAC T T C A TCTTCAATTT GCTTATATGT TTGTTTAACA TCTGCTTCAT 4020 

TTTGAGTAGA AGCTTTATTT AACAACTGGT CTACATGTTT ATCTTTCAAT AAACTATTTG 4 080 

ATCCTGTAGA AC T AAAT AAT GCCGTCATAG CATAGTTCGG GTCACCAAAC ACTGTCATCC 414 0 

AGTCATCAAT TTGGATATCA TAATTGCCGG CIT GACGTTG TGTACGATAG CTACCATAAT 4 200 

CTGGTTGGAT ATTCATCTTC ACGTTAAATC CTGCATTTTC C AAT T GAT C T TT AAC GAT AT 4260 

TCATATCATT TTCATAACTT GCTTGTCCTA GGAAATGTAT TGTTGGTCGC TCGCCTTTCA 4 320 

CTTCAACTTT CGATGACTTT TGAGCCACTT CTGATTTCGT AGGGACACCA C AAC C ACTA 4 380 

AT AC C AAC G C TAAAACTATA ATTGCGATAC TAATGATTTT CTTCACATCT ATCCCTACCT 4440 

TTTTAATGAA TTCTTGGATC TAGTGCATCA CGCACTGCAT CACCTATAAA ATTAAATGCT 4 500 

AAAACGACGA ACATAATACA AACACCAGGT ACAATAGCTA AATTACTGTG CGTTTCCAAG 4 5 60 

TAGTTACTAC CGGTACGTAA AATGTTGCCC CATTCAGCTA CATCAGGTGC AACACCAAGT 4 620 
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CCTAGGAAAC TTAAACTACT TGTTGTTAAT ACAACCACAC CTATATTTAA TGAAAAACGT 4 680 

ACAATCATAG GCGCAATCGC ATTCGGTAAA ATATAACGCC ATATGATATT CCAAGTGTTT 4 74 0 

TCACCAGTGA TACGTGCTGC ATCTACATAT TCCATGCGTT TAATTTCTAA AACACTGGCA 4800 

CGCATTGTCC GTGCAAATGA TGGTATATTA CCGATACTTA AAGCAATAAT TAAATTTGGA 4 8 60 

ATACTTGCTC CAAATGATGC AATAATTGCC ACCGCTAACA ATAATGATGG AATTGCAAAC 4 920 

ACTACATCTA AAATTCGCAT TATTAAATTA TCAATATGAT TAAAATAACC TGCGATAGTG 918 0 

CCTAGTAACA CACCAAAAAT AACTGCAATA AC T AC T G AAA TAATTGAAAT TGAAAATGTC 504 0 

AGCTTCGTTC CTACAACTAC GCGTGTAAAT AAGTCTCTAC C G AAAT CAT C AGTACCAAAC 5100 

GGATAGGCTA GACTCGGTCC ATGTAACAGT GCATTGAACT GATTTTTAGT AGCCAATGTC 5160 

GTATCAAATG TAAATTGTGA CR.CAATTGAT AATGTCAGCA T G TAG AC T AA AATAAGTAAC 522 0 

CCGATAATCG CAATACGATG TCTAGTAGTT TTTCGTATAA AC GATTCCCA CCCGTTATAA 52 8 0 

CTATGTATTT GCGATGTACG TTGGTAACGT CTAATACTTA CAAACATTAA TAATGTAAAT 534 0 

ACGTTGCCTG TTAATGTCAT CAACAATAAC AACACTTCGA CGATACffiCG CCATAGGTCA 5400 

TGATGCTTCC ATGTTTGTTC CGTTGTTAAA ATAATAATTA AAATGATGGT T AAAAC GAT T 54 60 

AGCAATGTTT CAGCAATATA GAACGTATCG GCCACATAAC CTTTAAAAAG ATTTAATGCA 5520 

CTCGTTAATA TAACTAAAAT ATAAGTTGCT ATGGCGTAAC TTGCGAATAA TTTTAAGGAA 558 0 

GCTATCTTTG AAT TAAGTTG TGCCATATGC CTCACTTCCT TTCGTTGATT TCACTACGTA 5640 

ATTTTGGATC GAT T AAAGCA TAAAATATAT C AAT AAT T AA GTTTGCTAAA GAT AT T AC AA 57 00 

TTGATATATA TACGACCCCA CCCATGACTG CTGGAATATC AGGTATTAGT TGTTTTTGGA 57 60 

CGATATAACG CCCGATACCA TTAATGTTm ATACTTGTTC CGTCACTGCT GAACCGCCTA 5 82 0 

GTAACTCTGC CACTAGAAGA CCAACTAACG TTACAATTGG AATAATGGCA TTTTTCAAAA 5880 

TATGTTTAAT AACAACTTGT GTCGTCGATA ATCCTTTTGC ATAAGCAGTT AAAACATAAT 594 0 

CGctGCGCAT TACTTCAAGT AC AG AAG AC C TTGTCATACG CGTGATAGAA GCAGAATAC 6000 

TTGTTCCAAT GACAAGTACA GGTAAAATCA ACGATATTGG ATGTTCTGGC ATATAAGATG 60 60 

GTGGCAAAAT ATCCAATTTC AATGAGAACG CTAAAATGAA TAATAGCCCT TGCCAGAAAC 6120 

TTGGAATAGA TAAACCAATT AATGCAATTA TCATTAACGT GATATCAAGC CAACTATTTC 6180 

GCTTCATCGC ACTGATAATA CCAATTGGTA TTGCAATAAT TAATGCCACC ATTAGCGCTA 624 0 

ATACTGCGAC AATTATTGTA ATTGGAATTC TTTCGCCAAC TGCTTTAGTC ACAACCTCAT 6300 

TCCCTTTGTA AGTCGTACCT AAGTCAAAGG TAAAAACACC CTTGATGGTA TCCCACAATT 6360 

GAATAAAATA AGGTTCGTTA AGATGATGTA ATACATGAA TTGATGTATC TGTGCCTTTG 64 20 
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TTGCATTTTG TCCCAGTATG CTATAAGCCG CATCAAGCGG T GAAAAAT AC AGAATGGTAA 64 8 0 

AC AC AC T G AC AATAACACCA AT GAT G AC AA TCACAGCCAT GACAATTCGT T C AAAAAT AT 654 0 

ATCTAACTAA TGGCTGTAAA TAAAAAGTCA ATAAGATGAA CATCGGCAAG GCCAATATCA 6600 

CTTTGATCAT GATGAACTTA TGAAATAATA CATTTTCAAA GTATGTTGAA AAATGTGCTT 6660 

GTTCAATATT CTTTGAACTC GTATTAGAAC TTTGTGCCTT GAATATTTTT AATGCTTCTT 67 20 

TATGTATTTG TGTGGATGAC TTTTGCTGCG ATAAATATTT ATATTTTTGA TGTAACGCCT 67 8 0 

GTTCAATTTC TGAAATTTCA GAATTATTAG CGTAAAAATT TTTCCTCTTA GCAGAAAAGA 68 4 0 

AAAACTTTAT CACTGCATAT AAAAAT AT T G GCAAGCTTAA TACCGATAAT ACAAACTTGT 6900 

AAAATGCCTT GTGATACAAC GACTTAAAAG ATTCTAAATA AGTAATTATG ATAATCACTC 6960 

CTTGTAAAAT AATCTTGAGT AGATTACTAT GATATACAAA AG»TAGAAT AAAT T T AC AC 7 02 0 

ATTTGTGaAT AGGGAGGCAC AAC AT CAT G T CAAATTTATT AGAAGTCAAC AGTCTGAATG 7 08 0 

TACAATTCAA T TAT GAT G AA ACTACAGTTC AAGCGGTAAA AAACGTCTCT TTCGAATTAC 714 0 

GAAAAAAACA TATCCTAGGT ATTGTTGGTG AATCAGGATC AGGAAAAAGT ATTACCGCTA 7 2 00 

AATCTATTTT AGGGCTACTA CCAGATTATC CAGATCACAC AT T AAC AG GA GAAATTATTT 7 2 60 

TTAATGGGCA ATCGTTAAAT AATTTATCAA CTTCAGCGTT ACAACAAATT CGAGGTAAGG 7 320 

ATATTTCAAT GATTTTTCAA GATCCACTCT CTTCGTTGAA TCCAAGATTA ACGATTGGCA 7380 

AAC AAAT T AC AGAAGTAATA TTTCi&CATA AACGTGTATC TAAATCTGAA GCAAAGTCGA 7 440 

T G AC AAT AG A CATTTTAGAA AAAGTAGGTA TAAAACATGC AAC T C G AC AA TTTGATGCTT 7500 

ATCCACATGA ACTTTCTGGT GGTATGCGTC AACGTGTCAT GATAGCAATG GCATTGATTT 7 5 60 

TAAAGCCACA AATTTTAATC GCAGATGAaC CAACAACGGC ATTAGATGCCAGTACACAAA 7 620 

ATCAATTACT GCAGTTAATG AAGTCCCTTT AT GAG T AC AC AG AAAC AT C T ATTATTTTTA 7 68 0 

TCACTCACGA TTTAGGCGCT GTGTATCAAT TTTGCGACGA TGTGATTGTA ATGAAAGATG 77 4 0 

GAAGTGTCGT TGAAAGTGGC ACGGTTGAAA GTATTTTTAA ATCGCCACAA CATACCTATA 7800 

CAAAACGCTT AATAGATGCG ATTCCTGATA TTCATCAAAC GCGTCCGCCA AGACCGTTAA 7 8 60 

ACAATGATAT TTTATTAAAA TTCGATCGCG TGAGyGgGAT TACACATCAC CGAGTGGCAG 7 92 0 

CCTATACCGA GCAGTTAATG AT ATT AAC TT GGCTATTAGA AAAGGCGAAA CATTAGGCAT 7 98 0 

TGTCGGTGAA TCAGGGTCAG GGAAATCGAC ATAGCTAAG ACGGTCGTCG GTCTAAAGGA 8 04 0 

AGTGTCAGAA GGCTTTATTT GGTATAACGA ATTACCATTA AGTTTATTTA AAG AT GAT G A 810 0 

ATTGAAATCT T T AC G AC AAG AGATACAAAT GATTTTTCAA GATCCATTCG CATCTATTAA 8160 

TCCAAGATTT AAAGTCATTG AT G T GAT T AA AC G AC C AC T A ATCATTCATG GGAAAGTEA 8220 

AGATAATGAT G AC AT TAT T A AAACTGTCGT ATCGTTGTTA GAAAAGGTTG GCCTAGATCA 828 0 
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AACTTTCTTA TATCGCTATC CACACGAATT ATCTGGTGGG CAACGTCAGC GTGTAAGTAT 834 0 

CGCGAGAGCA CTTGCTGTTG AACCTAAAGT GATTGTTTGC GACGAGGCAG TGTCCGCTTT 8400 

AGACGTTTCA ATTCAAAAAG ATATCATCGA GTTATTAAAA CAATTACAGT TAGACTTCGG 84 60 

CATCACTTAT TTATTCATCA CACATGACAT GGGTGTTATC AATGAAATAT GTGATCGCGT 8520 

TGCAGTTATG AAAAATGGCG AAATCGTTGA ACTGAATAAC ACAGAAGATA TTATCAAACA 8580 

TCCGCAGTCA GACTATGCAA AGCAACTTAT T T C AGAAG T A GC AG T TAT T G CTAAATAAAA 8 64 0 

GTCATGCGTT GTGCAACTTT ATCACTGTAT GGTCTGAAAT AAATTGCGCG ACTTCTGATG 8700 

TGTTATGTTA GATGACTTTG TCATACTAAC TATATTGTGC TAAGTTAGGG CTCGGCACCA 87 60 

TATCAAGTTT TAGGTGCTTT GCCATGATTT AAGAGTCACC CCCATACTTT GGGCATTTTA 8B2 0 

ACGCCAGAAT AAATCCCCCG CCACTATGTG AAGTGTGGGG GATTATTTAT ATTTTATTAG 88 80 

AAT AT T C AG A TTTTTGAGTG TGTCAACTTA GCTTAGTCAA TGTATATTTA ACGTCACTTA 8 94 0 

CTCTTTTTCT TTCATAATTA ACACATTCAA ATAAACTTTG ATCAAAAAAC ACAAAGTTAA 9000 

AAGTACCATC TTGTAATATG CTCTCATACA TTATCCCGTC ATATTTAAGG CTTCGAATAT 90 60 

AATCAGCTAA ATATTGAAAT GGCAAATAAT CTATTCCTTG TTCATCGCTT GGATTTGTTA 9120 

TTCCTTTATG AATCTTTTTT AATGTTTGGT AAT T T AC AAA ATACTTTCTA AATCCATCAT 9180 

CGCCAGCTTT GATTGCATTA CTAGTTAAAT TAGTTAAATT CGCAATTTC AATTTCTCTT 924 0 

TTGTCACGTT TTTTTGTAAC TTAACCTTAC C TAT AT AAA T AATGTCATTA TGCTTAGGTT 9300 

TAACTTCTTC TAT AC T G AC C TGTTCTTTTG TACTAAGGTA TAATACGCTT ATCCATTTAG 9360 

AATTCAATCT TCCTGCCGTT GCAAATCCCT TTGGTGGTGA CATTAGTTCA CTTTTCTCTG 9420 

TAATGAACTT AACTATTCTA GATCTATATA ATGGTTCAAA TCTTTCTCTA AATTCCTCAA 94 80 

TACTATAGTA ATTAGTAGTG AT AT C G AG AA AGAACGCTAA ATTCTCTAAA TTGATCATAT 9540 

TTTTATGAAA TCTATTTTTA TACTTCAAGC TCTCACAAAA TCCATCCCAG TCATTATTTG 9600 

CTACAATTAG ATTTTTATTT GTATATTT1T TATCGTTTAT GATTTTAGCG CCTACTAAAT 9660 

CTTCCAACAC TCGTCTATCT AAATTTTCAT CATCTTTAAA AAGTTCATTT AAAATACAAC 97 2 0 

TTATTTGAGC TTCCTCAACA TTAAATATAC TCCAGTCGTC TTTTAATGCT ATTTCAATCT 97 8 0 

TTTTACCTTC TTTTGGGCTA AAAGTATCTG GTAAATTTAT AC T AATATCA TATATTCTA 984 0 

ATGCTGGTCT TAAATAATCT CTAATAAGTT CTAATTTATC TATGTCCTTA GTCGTATCAA 9900 

ATATTTTAAC AC C AAG AT G A TTGTTATCAA TATCACAATT GTCAAATTTG CTATTTATCA 9 9 60 

TTTGCAATGA TTTCTACGAT TTCAGTATTA TTAAAACATT TTTCACATAT TTTCATTTTG 10020 

AGACTCCAAG TATCTATTCA TAATTTCTAG GTGATGCATG ATAGATAACC TTTTAATTAA 10080 
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ACCTAATCCT GGATaCTTAT TATTTTCATT TAATTCTTCA AATTGTCCCA AGCGCATAAG 10140 

ATCTATTTTT AATATCTAAG TTTTTTGACC ATGTTACTAA TT 10182 
(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3491 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

AACTCAGGCA ATT G AAAC AG CATTAGGTGC TCATTACAA CATGTCATTG TAGATTCAGA 60 

AAAAGATGGA CGCCAGGCTA TTCAATTTTT AAAAGAACGT AATTTAGGTC GTGCGACGTT 120 

TTTACCATTA AATGTTATAC AGAGTAGAGT GGTAGCGACT GAT AT T AAAT CTATTGCTAA 180 

AGAGGCAAAC GGATTTATTA GTATCGCTTC GGAAGCAGTT AAAGTAGCAC CAGAATACA 240 

AAATATTATC GGGAATTTAT TAGGTAATAC GATTATCGTT GATCATTTAA AG CAT G C AAA 300 

TGAATTGGCA CGTGCGATTA AATATCGAAC TCGTATTGTT ACTTTGGAAG GTGATATTGT 3 60 

AAATCCTGGT GGt TCTATGA CTGGTGGTGG CGCTCGTAAG TCAAAAAGTA TTCTGTCTCA 420 

AAAAG AC GAG TTGACAACAA TGAGACACCA ATTAGAAGAT TACTTGCGTC AAAC AG AAT C 4 80 

ATTTGAACAA CAATTTAAAG AG T T GAAG AT AAAAAGTGAT CAATTAAGTG AACTGTATTT 54 0 

TGAAAAAAGT CAAAAGCATA AT AC AC T T AA AGAGCAAGTG CATCATTTTG AAATGGAGCT 600 

CGATAGATTA AC T AC AC AAG AAAC AC AAA T AAAAAATGff CAT G a AG AAT TCGAATTTGA 660 

AAAAAATGAT GGTTATACGA GTGACAAAAG TCGACAAACT TTGAGTGAAA AAG AAAC T T A 720 

TCTAGAAAGT ATTAAAGCAT CTTTAAAACG ACTAGAAGAT GAAAT TGAAC GCTACACAAA 7 80 

ACTTTCTAAA GAAGGTAAGG AAAGCGTTAC TAAAACACAA CAAACCTTAC AT C AGAAAC A 84 0 

ATCTGATCTT GCTGTGGTTA AAGAGCGTAT TAAAACACAA CAACAGACAA TAGATCGATT 900 

AAATAATCAA AAT C AAC AAA C T AAAC AT C A ATTAAAAGAT G T T AAAG AAA AAAT TGC ATT 960 

CTTTAATTCG GAT GAAG TGA TGGGCGAACA AGCTTTTCAA AAT AT T AAAG ATCAAATTAA 1020 

TGGTCAACAA GAAACGAGAA CACGCTTATC AGATGAATTA GAT AAAT T GA AAC AAC AAC G 1080 

TATTGAGTTG AAT G AAC AAA TCGATGCGCA AGAAGC T AAA CTACAAGTTT GTCACCAAGA 1140 

TATTTTAGCT ATCGAAAATC ACTACCAAGA TATTAAAGCT G AAC AAT C AA AGCTAGATGT 12 00 

ATTAATTCAT CATGCGATAG ATCATTaAAT GATGrATATC AATTGETGT TGAACGTGCG 12 60 

ArATCTGAAT ATACGaGTGA TGrATCGATg ACGCATTACG TAAAAAAGTT AAGTTAATGr 1320 

AGaTGyCGAT TGATGrACTA GGTCCTGTAA ACTTAAATGC AATTGAACAA T T T GAAG AG T 138 0 
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TAAATGAACG TTATACATTT TTAAGTGAAC AAC G T AC AG A TCTTCGTAAA GCTAAAGAAA 14 40 

CATTAGAGCA AATTATAAGT GAAATGGATC AAGAGGTTAC TGAAAGATTT AAAGAAACTT 1500 

TCCATGCTAT TCAAGGACAT TTTACAGCTG TGTTCAAACA ATTGTTTGGT GGAGGCGATG 15 60 

CAGAATTGCA AT T AAC T G AA GCCGATTATT TAACAGCTGG TATTGATATT GTGGtACAAC 1620 

CACCGGGTAA AAAGTTGCAA CATTTATGST TACTGAGTGG TGGTGAGCGT GCATTAACTG 168 0 

CTATTGCTTT ACTATTTGCA ATTTTAAAAG TAAGATCTGC ACCTTTTGTT ATATTAGrTG 17 4 0 

rGGkTGrAGC TGCACTAGAT GAAGCAAATG TTATTAGATA CGCAAAATAT TTAAATGAGT 1800 

TATCAGACGA AACACAATTC ATTGTTATTA CACACCGTAA AGGAACAATG GAATTGCAG 18 60 

ATAGGTTATA CGGTGTAACA ATGCAAGAAT CAGGTGTTAC TAAACTTGTG AGTGTGAATT 1920 

TAAATACAAT AGATGATGTG TTGAAGGAGG AGCAATAATG AGCTTTTTTA AACGCTTAAA 1980 

AGATAAGTTT GCAACAAATA AAGAAAATGA AG AAG T T AAA TCCTTAACAG AAGAACAAGG 204 0 

T C AAG AC AAA T T AGAAGAT A CACATTCTGA AGGTTCAACG CAGGACGCAA ATGATTTAGC 2100 

AGAAAATGCT GAAGTGAAAA AGAAGCCACG C AAG T T GAG T GAAGCGGATT TTGATGACGA 2160 

TGGCTTAATA TCAATTGAAG ATTTTGAAGA AATTGAAGCT CAAAAAATGG GTGCTAAATT 2220 

TAAAGCAGGA CTCGAAAAAT CTCGTCAAAA TTTCffiAGAA CAATTAAATA ATTTGATAGC 22 8 0 

GAGATATCGT AAAGTAGATG AAGACTTTTT TGAAGCTTTA GAAGAAATGT TAATCACTGC 234 0 

AGACGTCGGT TTTAATACAG TGATGACGTT AAC T G AAG AA TTACGTATGG AAGCACAACG 24 00 

ACGTAATATT C AAG AT AC T G AAGATTTGCG TGAAGTCATT GTTGAAAAGA TCGTAGAGAT 24 60 

TTACCATCAA GAAGATkATA ATTCAGAAGC TATGAACTTA GAAGATGGTC GTTTAAATGT 2520 

CATTTTAATG GTTGGTGTGA ATGGTGTTGG TAAAACAACA ACAATTGGAA AATTAGCTTA 2580 

CCGATATAAA ATGGAAGGTA AAAAAGTAAT GTTAGCTGCG GGCGATACTT TTAGAGCGGG 2 64 0 

TGCTATTGAT CAATTGAAAG TTTGGGGCGA ACGTGTTGGT GTAGACGTAA TTAGCCAAAG 2 7 00 

TGAAGGTTCT GATCCAGCTG CTGTTATGTA TGATGCgATT AATGCCGCTA AAAACAAAGG 27 60 

TGTTGATATT TTAATCTGTG ATACCGCTGG ACGTTTACAA AATAAmACAA ATCTAATGCm 2820 

AGAATTAGAA AAAGTTAAGC GTGTAATTAA TCGAGCAGTG C&GATGCGC CTCATGAAGC 28 8 0 

ATTACTATGT TTAGATGCTA CAACTGGTCA GAATGCGTTG TCACAAGCTA GAAACTTTAA 2 94 0 

AG AAG T AAC A AATGTTACAG GTATTGTATT AACGAAATTA GATGGTACAG CCAAAGGTGG 3000 

TATCGTATTA GCCATTCGTA ATGAATTGCA CATCCCAGTT AAATATGTAG GTTTAGGTGA 30 60 

GCAAT TAGAT GACTTACAAC CATTTAACCC TGAAAGTTAT GTCTACGGCT TATTCGCTGA 312 0 

TATGATTGAA CAAAATGAAG AAATAACAAC AGTTGAAAAT GATCAAATTG TAACAGAAGA 318 0 
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AAAGGACGAT AATCATGGGT CAAAATGATT TAGTtAAAAC GTTACGAATG AATTATTTGT 



3240 



TTGATTTTaT CAATCCTTAT TGACEAATAA ACAACGTaAT TATTTGGAAT TATTTTATCT 



3300 



TGAAGATTAT TCTTTAAGTG AAATCGCAGa TACTTTTAAT GTGAGTAGaC AAGCAGTTTA 



3360 



TGATAATATA AGAAGAACTG GCGATTTAGT TGAAGATTAT GAAAAGAAAT TGGAATTATA 



3420 



CCAGAAATTT GAGCAACGCC GAGAAATATA TGATGAAATG AAACCACATTTAAGTAATCC 



3480 



AGAACAAATA C 



3491 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

AGTACGTTTT ATAATTATAA GTACGTAATT AACATATTAA CATATCGCAA GTATGTATTT 60 

AAATAAgATT GTTATAATTT CAAAGTTCAT CCAAGaTTAT GGCGTTTGCA TTTACCTATT 120 

AAAAACGTTA TTATATCAAA GATGCGAAAG ATAATACGGG TTTATTTTAT GAAAGTGAGA 180 

AGGATAAAAT GGATAATGAG CAACGCTTAA AAAGAAGAGA GAATATAAGG AATTTCTCGA 24 0 

TTATAGCACA TATTGACCAC GGAAAATCTA CATTGGCTGA TAGAATTTTA GAAAATACCA 300 

AATCAGTTGA AACAAGAGAT ATGCAAG\TC AGTTACTAGA TTCAATGGAT T T AG AAAG AG 3 60 

AACGTGGTAT TACAATCAAA TTAAACGCgT ACGTTTAAAG TACGAAGCTA AAGAT GGAAA 4 20 

TACTTATACA TTCCATTTAA TCGATACGCC TGGACACGTC GATTTTACAT ATGAAGTGTC 4 80 

ACGTTcTTTG GCAGCTTGTG AGGGCGCGAT TTTAGTAGTA GATGCGGCTC AflGTATCGA 540 

AGCACAAACA TTAGCAAATG TTTATTTAGC ATTAGATAAT GAGTTAGAGT TATTGCCTGT 600 

TATTAACAAA AT T GAT T T AC CTGCTGCAGA ACCTGAACGC GTGAAACAAG AAATTGAAGA 6 60 

TATGATAGGT TTAGACCAAG ACGATGTTGT TTTAGCAAGT GCTAAATCTA ACATTGGAAT 720 

TGAAGAGATA C T AG AGAAAA TAGTTGAAGT TGTGCCAGCT CCAGATGGTG AC C C AG AAGC 780 

ACCACTAAAA GCGTTAATAT TTGATTCTGA GTATGATCCA TATAGAGGGG TAATTTCATC 84 0 

GATAAGAATT GTGGACGGTG TTGTTAAAGC CGGAGATAAA ATTCGAATGA TGGCCACTGG 900 

TAAAGAGTTC GAAGTAACAG AAGTTGGAAT TAAACACCT AAGCAGCTTC C AG T T GAT G A 9 60 

ATTAACAGTT GGTGATGTTG GTTATATTAT TGCAAGTATT AAAAATGTTG ATGATTCTAG 1020 

GGTTGGTGAC ACCATCACAT TAGCTAGTAG ACCTGCATCA GAACCATTGC AAGGTTATAA 108 0 

GAAAATGAAT CCAATGGTAT ATTGCGGACT GTTCCCAATA GATAACAAAA AT TATAATGA 1140 
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TTTAAGAGAA GCATTAGAAA AATTACAATT GAATGATGCA TCATTAGAAT TTGAGCCTGA 1200 

ATCGTCACAA GCATTAGGTT TTGGTTATAG AACTGGTTTC TTAGGTATGT TACACATGGA 12 60 

AATAATTCAA GAAAGAATTG AAAGAGAATT TGGTATTGAA TTAATTGCAA CTGCACCATC 132 0 

TGTAATTTAT CAATG1GTTT TAAGGGACGG TTCAGAAGTG ACGGTTGATA ACCCAGCACA 138 0 

AATGCCAGAT CGTGATAAAA TTGATAAAAT ATTTGAGCCA TATGTTCGTG CAaCTATGAT 14 4 0 

GGTTCCAAAT GACTATGTCG GTGCAGTAAT GGAATTATGT CAACGTAAAC GTGGACAATT 15 00 

TATAAATATG GACTATTTAG ATGATATTCG TGTAAATATT (PTTATGAAT TACCTTTAGC 15 60 

TGAAGTTGTA TTTGATTTCT TCGATCAACT TAAATCTAAT ACTAAAGGAT ATGCATCATT 162 0 

TGATTATGAA TTCATCGAAA ATAAAGAAAG TAATTTAGTC AAGATGGATA TTTTATTAAA 168 0 

TGGTGATAAA GTGGATGCGC TAAGCTTCAT AGTTCATAGA GATTTTGCAT ATGAACGTGG 1703 

TAAAGCATTA GTTGAAAAAC TTAAAACGTT AATTCCAAGA CAGCAATTTG AAGTACCTGT 18 00 

ACAGGCTGCA ATAGGACAAA AAATTGTAGC GCGTACAAAT ATTAAATCAA TGGGTAAAAA 18 60 

CGTTTTAGCT AAATGTTATG GCGGTGACAT AAGCCGTAAA CGTAAATTAC TTGAAAAACA 192 0 

AAAAGCAGGT AAAGCTAAGA TGHRAGCAGT TGGTAATGTT GAAATTCCAC AAGATGCTTT 198 0 

CTTGGCTGTA TTGAAAATGG AT GAT G AAT A ATTTTAAAAA ATCAATTAAC AATTTACAAT 2 04 0 

GAATAAAGTT TAATAACTAA AAAGAGGGAG CCTAGGATAA ATTAACGTCC TGGGCTTTAC 2100 

AAT GT TAT AT TGGCAGCCAT CGACAGAGTT AAAATGAGCT TATAACAACE GGGCCCCAAC 2160 

ACAGAAGCTG ACGAAAAGTC AGCTTACTAT AATGTGCAAG TTGGGGTGGG GCCCCAACAT 2 22 0 

AGAGAATTTC GAAAAGAAAT TCTACAGGCA ATGCAAGTTG GGGTGGGACG ACGAAATAAA 2280 

TTTTGCGAAA ATATCATTTC TGTCCCACTC CCTTATGCAT GAGTTTTACT CATGTAATTT 2 34 0 

TATTTTTAAG GACATATTAC ATCTGGCTAA TGTGTAAGAG CCACTACATA ATAAATCATT 2 4 00 

AGTGGTTCTT TATTATTTCT ATCTCACTCC CTCTAAACAA GAATAAATAT TAAAATGAAT 2 4 60 

C GAT AT AT T A GACAATCATT GATTAAACGT TAAAGTTAAA AGTAAGAATA ATTGCAGATA 2 52 0 

GTCCAACAGG ATATAGCCGA TTGGATAAAA AGTCTGAGAA GCGGGGCATT AAAATGACGG 2580 

TACAAAGTGC ATATATACAT ATTCCATTTT GTGTAAGAAT ATGTACATAT TGTGATTTCA 2 64 0 

ATAAATATTT TATACAGAAT CAACCTGTAG ATGAGTACTT AGATGCACTA AT C AC AG AAA 2 700 

TGTCTACAGC AAAATATAGG ATCTTAAAGA CCATGTATGT AGGTGGCGGC ACACC&CGG 27 60 

CCCTTTCTAT TAATCaGTTG GAAAGAT T AC TTAAAGCAAT ACGTGATACG T T T AC AAT C A 2 82 0 

CAGGCGAGTA T AC AT T T G AA GCAAATCCTG ATGAGTTAAC TAAAGAGAAA GTCCAAC TAT 2 88 0 

TAGAGAAATA TGGAGTAAAA AGGATTTCAA TGGGCGTTCA AACATTCAAG CCGGAGTTAT 2 94 0 
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TGTCTGTTTT AGGTAGAACG CACAATACTG AAG AT AT T T A CACTTCGGTG TTAAATGCTA 3000 

AAAACGCAGG TATTAAATCA ATCAGTTTAG ATTTAATGTA TCATTTACCG AAAC AG AC G A 30 60 

TTGAAGATTT TGAACAAAGT TTAGATCTAG CTTTAGATAT GGAT AT TCAA CATATTTCGA 3120 

GTTACGGCTT AATACTTGAA CCTAAAACCC AATTTTA'AA TATGTATAGA AAAGGCTTGC 318 0 

TCAAACTTGC TAATGAGGAT TTAGGTGCTG ACATGTATCA GTTGCTGATG T C T AAG AT AG 32 4 0 

AACAATCTCC TTTCCATCAA TACGAAATAT CTAATTTTGC ATTAGATGGC CATGAATCAG 3300 

AACATAATAA GGTTTACTGG TTTAATGAGG AATATTATGG ATTTGGAGCA GGTGCAAGTG 3360 

GTTATGTAGA TGGTGTGCGT TATACGAATA TCAATCCAGT GAATCATTAT AT C AAAGC T A 34 20 

TAAATAAAGA AAGTAAAGCA ATTTTAGTAT CAAATAAACC TTCTTTGACT GAGAGAATGG 34 80 

AAGAAGAAAT GTTTCTTGGG TTGCGTTTAA ATGAAGGTGT GAGTAGTAGT AGGTTCAAAA 3540 

AG AAG T T T G A CCAATCTATT GAAAGTGTCT TTGGTCAAAC AATAAATAAT TTAAAAGAGA 3600 

AGGAATTAAT TGTAGAAAAG AACGATGTGA TTGCACTTAC AAATAGAGGG AAAGTCATAG 3 6 60 

GTAATGAGGT TTTTGAAGCT TTCCTAATAA ATGATTAAAA AAAATTGAAA TTTCGAGTCT 3720 

TTAACATTGA CTTACTTTGA CCAATTTGAT AAATTATAAT TAGCSTTGA GATAAGTGAG 37 8 0 

TGCTAATGAG GTGAAAACAT GATTACAGAT AGGCAATTGA GTATATTAAA CGCAATTGTT 38 4 0 

GAGGATTATG TTGATTTTGG ACAACCCGTT GGTTCTAAAA CACTAATTGA GCGACATAAC 3900 

TTGAATGTTA GTCCTGCTAC AATTAGAAAT GAGATGAAAC AGCTTGAAGA TTTAAACTAT 3 9 60 

AT C GAG AAG A CACATAGTTC TTCAGGGCGT TCGCCATCAC AATTAGGTTT TAGGTATTAT 4 020 

GTCAATCGTT TACTTGAACA AACATCTCAT CAAAAAACAA AT AAAT T AAG ACGATTAAAT 4 08 0 

CAATTGTTAG TTGAGAATCA ATATGATGTA TCATCAGCAT TGACATATTT TGCAGATGAA 414 0 

TTATCAAATA TATCTCAATA TACAACTf TA GTTGTTCATC CTAATCATAA ACAAGATATT 4200 

ATCAATAATG TACACTTGAT TCGTGCTAAT CCTAATTTAG TTATAATGGT TAT 4 2 53 
(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

TCCCTAATCG AACAAAATTA TGCGCATAAA CAAAGTAGAT TGATATAAAA TTCTTAATTA 60 

T C AG AAT AT A TTTACAAATC TGAATTTTAT TAGTATATTG GrTAGTrTC ATAGAGGCAT 120 

GACGGTaTTT GAGCAGGATT TTAAATCGGg ATTTTATAAT CGATTTAAGA GAGGCCACtT 18 0 
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TGCTTGcACA TTAATACTGT cAATGGGAGG GGAATGTATA TGAGTrAAGC ACATCAATTA 24 0 

ATTCAAGAGG ATGAACATTA TTTTGCGAAA TCAGGACGTA TTAAATATTA TCCGTTAGTG 300 

ATTGATCATG GATATGGAGC AACATTGGTT GATATTGAGG GGAAGACATA TATCGATTTG 3 60 

TTATCGAGTG CGAGTTCTCA AAACGTAGGT CATGCACCTA GAGAAGTAAC AGAAGCGATA 4 20 

AAGGCACAAG TTGATAAATT TAT T CAT TAT ACACCAGCAT ATATGTATCA TGAACCGTTG 4 80 

GTACGTTTAG CTAAGAAGCT TTGTGAGATT GCACCTGGAG ATTTTGAAAA AAGAGTGACC 540 

TTCGGATTAA CCGGATCAGA CGCAAATGAT GGCATCATTA AATTTGCCAG AGCATATACA 6 00 

GGGCGTCCTT ATATCATTAG TTTCACTAAT GCATATCATG GTTCAACTTT TGGCTCATTG 6 60 

TCTATGTCAG CTATTAGTTT AAATATGCGC AAACATTATG GTCCGTTATT GAATETTTT 72 0 

TATCATATTC CGTTTCCAGA TAAATATCGT GGTATGTACG AGCAGCCACA AGCTAATTCA 7 80 

GTAGAAGAAT ATTTAGCACC CTTAAAAGAA ATGTTTGCGA AGTATGTACC TGCTGACGAA 84 0 

GTAGCATGTA TTGTTATTGA AACGATACAA GGCGATGGTG G AC T T T TAG A ACCAGTTCCA 900 

GGGTATTTTG AAGCGTTAGA AAAGATTTGT CGTGAACATG GTATTTTAAT CGCTGTCGAT 9 60 

GATATTCAAC AAGGTTTTGG GAGAACAGGT ACATGGAGTT CAGTCTCGCA TTTTAATTTT 1020 

ACGCCTGATT TAATCACTTT CGGAAAATCC TTAGCAGGTG GTATGCCTAT GTCAGCAATT 10 8 0 

GTTGGACGCA AAGAG AT TAT GAATTGTTTA GAAGCACAG CACATTTATT TACAACAGGT 114 0 

GCTAATCCAG TTAGTTGTGA AGCTGCATTA GCCACAATTC AAATGATTGA AGATCAGTCG 1200 

CTTCTTCAGG CTAGTGCGGA AAAAGGGGAA TATGTTAGGA AACGAATGGA TCAATGGGTA 12 60 

TCTAAATACA ATAGTGTAGG CGATGTTAGA GGTAAAGGTC TGAGCATTGG TATTGATATT 1320 

GTTTCCGACA AAAAACTCAA AACACGTGAT GCCAGTGCGG CACTTAAAAT TTGTAATTAC 1380 

TGCTTTGAGC ATGGCGTAGT TATTATAGCT GTAGCAGGAA ATGTGTTGCG ATTCCAACCG 14 4 0 

CCATTGGTAA TAACATATGA GCAATTAGAC ACGGCGTTAA ACACTATAGA AGATGCACTG 15 00 

ACTGCTTTGG AAGCAGGTA^ CTTAGATCAA TATGACATAT CTGGACAAGG TTGGTAATAG 15 60 

CGATTATCTT AATATAAAAT AAAAAATCAT TTCCACATCT GGATGTTAAT CAGATGGGAA 162 0 

ATGATTTTTT TTATTTTTTA TTTTGGTGGG TGGTATTCAG CTACGTCATT TTTCTTAGAA 1680 

TGTCTAAGTC CATAACTTAA ATATAGGATG ATACCAACAA TAAECAAAT TAAAGTGTAT 17 4 0 

AATTTCGCTT CGAATCCTAA TCCCCAGAAT ACTAGCAATA CTAAAACAAA TGTAATTGCT 18 00 

GGTAACACAG GATATAAAGG TAATTTAAAT GCAGGAATTG GTAGATCTTT ACCTTcACGC 18 60 

TTTCTCAAAC GATACATTGC TAATGAAACG AACATAAATG CAACAAGTGT ACCTGCTGAA 1920 

ATTAATTGTG CTAAAAATGC GAATGGGAAC ATAGAACCAA TTAAAACACC AATAATAGTA 1980 
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AGTATAACTA GTGCGCGATT AGGTAAATGT TTGTCGTTTA AGTGGCTTAA CCATGAAGGT 204 0 

AATAAGCCGT CACGTCCAAA TGAATAAAGT AAACGTGAGC CTGCTAACAT CAT ACCAAT T 2100 

AATGCTGTAA ACATACCGAT AACAGA5ATA GCTTGAACAA TAGCTGCTAC AACACCATGA 2160 

CCACTTTGAC GTAAAGCCCA ACCAACAGGT TCAGCATTGT TTGCGTATTG TGAGTAATGG 222 0 

AACATACCAA CTAACACAAG TGC TACAGCA ATAAATAATA CGATAGCAAC GCTTAAAGAA 22 8 0 

CCAAGAATAC CTCTAGGCAT TGTCTTTTGA GGATCAAGTG CTTCTGCTGA ETTGCTGCG 2340 

ATAGAATCGA AACCGATATA CGC T AAG AAA AT CAT T G AAA CACCAGCATA TATGCCTTGC 24 00 

CATCCACCAA AGTCACCTGT AGCAGTTACT TTGTGTTCTG GAATAAATGG C ACATAGT T A 24 60 

CTAACATTTA TTGCTGTTAA ACCTACGATG ACAAATAAAA TAATAGCTAA TACT TTT AAA 2 520 

ATAACTAAAA TATTTTCCAT ACGAGCTGCT T C C G AC AT AC C AC GT G AT AG TAATAATGCA 2 58 0 

GTTAATAAAA TAACGATAGC AGCAATAATA TCGATAAAAC CGCCATTTGT ACCAAATGGA 2 64 0 

TTTGATAATG CTGCAGGTAA TTCGATGCCA ATTGGTTTCA CAAGTCCGCG TAAATTCGCT 2700 

GAGAATCCTG AT G C AAC AAA GGCTACGGCG ATAAATATT CAGCTAATAG AGCCCAACCG 27 60 

GCAACCCATC CAAAAAATTC ACCAAATAAT ACATTGACCC AAGAATAGGC TGAACCTGCA 282 0 

AATGGCATAG CGGCAGCCAT TTCTGCATAA GTAAATGCAA CTAAACCAGC AACAATAGCA 2880 

GCGAGTAAGA AT GAT AAC GC AACGGCCGGT CCTGCATGTT CTGCAGCAAC AATGCCAGE 2 94 0 

AGCGTAAAGA TAGATGTCGA TACAATTGTT CCTACACCTA AAGCTAAGAA ATCACGCACC 3000 

CGAAGTGTAC GCTTTAAATG ACCATCTTTA TTTTGATAGA TAGCCGGATC CTCTTTTCGT 30 60 

GCTATTTTAT TGAAAAAACT TCCCATAAAC TTTCCTCCCA AACATTCATA AACAATTCTA 3120 

TACGGTGTTT TTTAATATGT TATATCATAG CACAAATAAT CAATATTTTG TCTAAAAATT 318 0 

CTGAAAAATC ACAACTTTAT GTTACGTATT AAT GACTTGT C T T GAT AAC A T C CAT AG AT T 32 4 0 

TTTTAAATGA TAAAACTGAT TATAACAGAT ATTAAATGAA TAAGTACTAT TTTTTGCnAA 3300 

TTTTCTAACA ATTTTGCACA TTATATGTTT AAAATCAATT TCATGTTTAT GGTCTGATTG 33 60 

GCTAGTGTGT ATGAAATGTA AnTCTTTGAC TnnGA 3395 
(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

ATCAGGTAAT GCCATGCGTT TAGCTGAAAA TTTTTTCAGA ACGTTTAAGT GATATCGGAC 60 
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ATCAAGTTGT TTTGATGTCA ATGGATGAAT ATGATACGAC AAACATCGCG CAGTTAGAAG 12 0 

ATTTATTTAT TATTACGTCT ACTCATGGTG AAGGAGAACC GCCTGATAAT GCATGGGATT 180 

TCTTTGAATT TTTAGAAGAC GATAACGCAC CTAATTTAAA TCATGTGAGA TATTCAGTAC 24 0 

TAGCTTTAGG TGATCAAACA TATGAATTTT TCTGTCAAGC CGGTAAAGAT GTAGATGTTT 300 

T AC T AG AAAA TCTAGGG3CT GAGCGTATAT GTAAGCGTGT AGATTGTGAT ATTGATTATG 3 60 

AAGAAGACGC AGAAAAGTGG ATGGCAGACA TCATTAATAT TATTGATACC AC AT C AG AAG 4 20 

GTATTCAAAG TGAATCGGTG ATAAGTGAAT CAATTAAGTC TGCCAAAGAA AAGAAATATT 4 80 

CTAAATCAAA TCCATACCAA GCAGAAGTAT TAGCGAATAT CATTTAAAT GGTACCGATT 54 0 

CAAATAAAGA AACACGACAT ATAGAATTTT T AC T T GAT G A TTTTAGTGAA TCATATGAAC 600 

CAGGAGATTG TATAGTAGCA TTACCGCAAA ACGACCCTGA ATTGGTTGAA AAACTAATAT 660 

CCATGTTAGG TTGGGATCCG CAATCTCCGG TGCCAATTAA TGATCATGGT GATACAGTTC 720 

CTATTGTTGA AGCACTAACA TCACATTTTG AATTTACTAA ATTAACATTG CCATTATTGA 780 

AAAAT GC AG A TATCTATTTT GACAATGAAG AATTATCTGA ACGTATTCAA GATGAGTCAT 84 0 

GGGCGCGTGA ATATGTTATA AATCGGGACT TTATAGATTT AATAACAGAT TTTCCAACTA 900 

TAGAATTACA ACCTGAGAAT ATGTATCAAA TCCTTAGAAA ATTACCACCA AGAGAGTATT 9 60 

CGATTTCTAG TAGTTTTATG GCAACGCcAG AT G AAG T G C A TATTACCGTT GGTACGGTTC 102 0 

GTTATCAAGC ACATGGACGT GAGAGAAAAG GTGTATGCTC GGTTCATTTT GCTGAGCGAA 108 0 

TTAAACCAGG CGATATAGTA CCAATTTATT TGAAGAAAAA TCCGAACTTCAAATTTCCGA 114 0 

TGAAGCAAGA TATACCGGTT ATTATGATTG GACCAGGTAC TGrAATTGCT CCTTTTAGAG 1200 

CATATTTACA AG ARC G T G AA GAACTTGGTA TGACTGGAAA AACATGGTTG TTCTTTGGTG 12 60 

ATCAACACCG TAGTTCTGAC TTTTTATATG AAGAAGAAAT AGAAGAATGG CTTGAAAATG 1320 

GAAACTTAAC ACGCGTAGAT TTAGCATTTT CAAGAGACCA AGAACACAAA GAATATGTAC 1380 

AGCATCGTAT AATGGAAGAA AGTAAACGTT TCAATGAATG GAT TGAGC AA GGCGCACAAT 14 4 0 

CTATATTTGT GGCGATGAAA AATGTATGGC GAAAGATGTC CATCAAGCCA TTAAAGATGT 1500 

ATTGGTAAAA GAACGTCATA TTTCTCAAGA JGAAGCAGAG TTATTATTGC GACAAATGAA 15 60 

ACAACAACAA CGCTATCAAC GTGATGTTTA TTAGCGATTG GTGTTAAATA TTTTAAGGTG 162 0 

TAATGATGTA AAAAGATATA AAGGATGTTG CTCAACATGA AT AT GC CAT T AAT GAT AG AT 168 0 

T T AACAAAT A AAAATGTCGT CATAGTTGGT GGAGGCGTCG TTGCAAGTCG TCGGGOSAA 174 0 

ACATTAAATC AATACGTTGA ACATATGACG GTCATCAGTC C G AC AA T C AC TGAAAAACTT 1800 

C AAAAT ATGG TAGATAACGG TGTCGTCATA T G G AAAG AAA AAG AAT T T G A ACCAAGCGAT 18 60 
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ATTGTAGACG CGTATCTAGT TATTGCAGCA ACCAATGAGC CACGTGTCAA TGAAGCGGTA 192 0 

AAAAAAGCCT TAGCTGAGCA TGCCCTTTTT AATAATGTTG GAGATGCATC AAATGGCAAT 1980 

GTTGTATTTC CAAGTGCACT ACACCGCGAC AAGCTAACTA TCAGTGTATC AACTGATGGT 204 0 

GCGAGTCCTA AG T T GAC AAA ATCAATTATG GCAGAGCTTG AGGCGTTATA TCCACCATCA 2100 

TACAGTTCGT ATATCGACTT TTTATATACT TGCCGACAO. AAATAAAAGT ACTTGATATA 2160 

AC AT AT AAC G AAAAGCAACA GTTACTGTCA CAAATTGTGT CACAAGAATA TTTAAATCAT 2220 

GACAAACAAG CTCAATTTTT AGCGTGGTTG GATGTAAGAT AATAATAGCG GACCGTCTAA 2280 

CCGTCTAAGG TAAGTCTTCT TATTTTAACT TTAACGCTTA AT CAT T GAAA TTAAGACATG 2 34 0 

GGCGGCTTTG TGAATAGTCT AATAATGAAG GATTTAAGCG ATAATGATAT GCGTTTTAAA 24 00 

TATGAATATT ACAATAGAGA AAAAGAT AC G TAGAACAAAC TTAATAAAAT AGGTGGATAA 24 60 

ATTGAAATCT GGTTGAAGTC GTTACTATCA TAGCGACCTT TAGCCAGATT TTTTGTGCAA 2520 

TAGAAAGCAA TAATAAAAAT GAT AG AT C AA AATGAAATAC AGGACAGGAT ATACAAGGAT 258 0 

TAGTCATGCC ATGTTATCAA G TAG G AAAAT CAAACTTCAC TATTGATAGT TACGCAAAAA 2 64 0 

AGATTTTTTT GATAAAATGA GAT AAC T T AA ATATAAAAAA TTATATTAAT TATAATATTT 27 00 

AAGTTAAAGA GGGGGATTAT GTAAATTGTA TTAAAAGTGG AGGGASAAA TAATATGAAT 27 60 

AG T G AT AAT A TGTGGTTAAC AGTAATGGGG CTCATTATTA TTATTTCAAT TGTAGGTTTA 2820 

CTCATTGCCA AAAAGATAAA TCCAGTTGTA GGTATGACAA TCATACCTTG CTTAGGGGCA 28 80 

ATGATTTTAG GAT AT AG T G T GAC AG AT T T G GTTGGATTTT TTGCTAAAGG GTTAGATCAA 2 94 0 

GTCATCAACG TTGTTATTAT GTTTATCTTT GCCATTATTT TCTTTGGCAT CATGAACGAT 3000 

AGTGGTTTAT TCAAGCCGCT TGTCAAACGC TTAATATTAA TGACACGAGG CAATGTCGTC 3060 

ATTGTCTGTG CAATGACAGC TTTAATTGGC ACAATAGCCC AATTAGATGG GGCCGGTGCG 3120 

G T AAC AT T T T TGCTTTCTAT TCCTGCATTA TTACCTTTAT ATAAAGCGTT AAATATGAAT 3180 

AAATATTTAT TGATTTTACT ATTAGCATTA AGCGCGGCGA T T AT GAAC AT GGTACCTTGG 32 4 0 

GGAGGTCCAA TGGCTCGTGT AGCTGCAGTG TTAAAAGCCA AAAGTGTCAA TGAATTATGG 3300 

TATGGATTAA TACCTATTCA AATAATAGGT TTCATTCTTG TTATGTTGTT TGGGTATAT 3360 

CTTGGATTTA AAGAACAGAA AC G TAT C AAA AAAGCAATAG AGAGAAATGA ATTACCGCAA 34 20 

ACACAAGATA TAGATGTACA TAAATTAGTT GAAGTATATG AACGAGATCA AGATGTAAGG 34 80 

TTTCCTGTAA AAGGACGTGC AAGAACAAAA TCATGGATAA AATGGGTGAA TACAGCTTTA 35 4 0 

ACTTTAGCTG TTATTCTATC GATGTTAATA AATATTGCGC CACCTGAATT TGCATTCATG 36 00 

ATAGGTGTTy CGTTGGCACT TGTTATTAAT TTTAAATCAG T GGAT GAAC A AAT G GAAC G A 3660 

TTAAGAGCgC ATGCGCCGAA TGCATTAATG ATGGCTGCAG TGATTATTGC AGCAGGTATG 37 20 
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TTTTTAGGTG TACTAAATGA AACCGGTATG CTTAAGCGA TTGCGACCAA TTTAATCAAA 37 8 0 

GTGATTCCTG CAGAAGTAGG ACCATACTTG CATATTATTG TAGGTTTACT TGGCGTACCA 38 4 0 

TTAGATTTAC TAACTAGCAC AGACGCTTAT TATTTTGCTG TGTTACCGAT TGTCGAACAA 3900 

ACAGCAGGGC AATTTGGTGT ACCGTCTGTA TCAACAGCTT ATTCAATGGT CATAGGGAAT 3960 

ATTATAGGTA CATTTGTCAG CCCATTTTCA CCAGCCTTAT GGTTGGCAAT TGGTTTAGCA 4 02 0 

GAGGCAAACA TGGGCACGTA TATTAAGTAT GCATTCTTTT GGATTTGGGG ATTCGCTATC 4 08 0 

GTTATGTTAG TAATTGCAAT GTTGATGGGC ATTGTGACGA TTTAAGTATG AAAAAATAGA 414 0 

AACTATGGTC ACGTTGCAAA ATGAAATAAT AGTTGCATAA AC AT G T C G AA ATGACGGACG 4 200 

AATCTTTAAA CAATTTTAAA AATTAATGAA ATAATTGTGT AGAAATATGA AT T T C AC T AA 4 2 60 

ATGTTAATAA CTTTGTGACG TTTTAGTTAA CAGACTAATA AAAATTTGAA AATACTATAT 4 32 0 

ATAGTGGTAT AACGTAATGA GTAGACACAA TATATAGGAA GAGGGGTAA AATGAATCAA 4 38 0 

ATCGAAGAAG CATTAACGGG TTTGATTTCT AAAGATCCTG CTATTGTTAA CGAAAATGCT 4 440 

AACAAAGATA GTGATACATT TTCAACAATG AG AG AT T T AA CAGCAGGTAT CGTTTCTAAA 4 500 

TCTTACGCAT TAAATCATTT AT T AC C AAAG CACGTTGCAG ATGCACATCA AAGAGGGGAC 4 5 60 

AT AC AT T T T C AC G AC T T AGA TTAT CATCCA TTCCAACCGT TAACTAACTG TTGTTTAATA 4 62 0 

GATGCTAAAA ATATGCTACA TAATGGATTT GAAATAGGCA ACGCGAATGT AACTTCACCA 4 680 

AAATCAATAC AAACTGCATC AGCGCAGCTT GTACAAATTA TAGCCAATGT TTCTAGCAGT 4 74 0 

CAATATGGTG GCTGTAcGGT TGACCgCGTT G AC GAAT T AC tTAGTACATA TGCACGACcA 4 800 

TAATGAAGAA CAACATAGGA ATATsCGCAA AGCAATTTGT CAAAGAATCT GAAATTGATC 4 8 60 

GTTATGTTGA TCAACAAGTC ACTAAAGACA TCAATGATGC GATTGAAAGT TTAGAATATG 4 92 0 

AAATTAATAC CTTATATACA TCTAATGGAC AGACACCTTT TGTAACATTAGGATTCGGCT 4 980 

TAGGTACAGA TCATTTAAGT CGCAAAATTC AACAAGCTAT CTTAAATACT CGTATCAAAG 504 0 

GCTTAGGAAA AGACCGCACG ACAGCGATTT TCCCGAAACT TGTATTTTCA ATTAAAAAAG 5100 

GAACCAACTT TAGTCCGCAA GATCCGAACT AT G AC AT T AA ACAACTAGCA TTAAAGTGTT 5160 

CAACGAAACG TATGTATCCA GATATTTTAA AT TAT GACAA ACTCGTAGAA ATATTAGGTG 522 0 

ATTTCAAAGC GCCAATGGGT TGTCGTTCAT TTTTACCAAG TTGGAAAGAT GCGGAAGGTC 52 8 0 

ATTTTGAAAA TAATGGTCGT TGTAATCTTG GTGTTGTTAC ACTTAATTTA CCTAGAATGG 534 0 

CATTAGAATC TGCCGGTAAT ATGACGAAAT CTGGGAAAT CTTTTATGAA CGTATCGATG 54 00 

TGTTACATGA TGCATTACTT TATCGTATAA ATCGTTTGAA AGATGCTGTA CCGAATAACG 54 60 

CACCGATTTT ATATAAAAGT GGCGCATTTA ACTATAAATT AAAAGAAACA GATGATGTTG 5520 
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CTGAGTTATT TAAAAATAAA CGTGCAACGA TTTCAATGGG CTATATAGGG TTGTATfiAA 5 58 0 

CAGCTACTGT TTTCTATGGT CCAGACTGGG AAACATCTCA AGAAGCAAAA GCATTTACGC 5 64 0 

TTGAAATTCT TAAAGAAATG AAACGTTATC AAACGAAATG GACAGAATTA TATGACATTT 57 00 

GGTTCAGTAT TTmCAGTACG CCGAGTGAAT CGCTAcGGAT CGTTTTTGTC GTTTAGACCA 57 60 

AG AG AG AT T T GGAGATATTA AAG AC AT T AC AGATAAAGGA TAT TAT C AAA ACTCTTTCCA 582 0 

TTATGATGTA CGTAAAGATG TTACACCTTT TGAAAAGTTA GATTTTGAAA AAGATTATCC 58 8 0 

TTATTATGCG AGTGGTGGTT TCATTCACTA TTGTGAGTAT CCGAAATTGC AACACAATTT 594 0 

GAAAGCACTA GAAGCGGTAT GGGACTACTC TTATGACAA GTTGGTTACT TAGGTACAAA 6000 

TATTCCGATT GATCATTGTT ATGAATGTGA TTACGATGGA GATTTTGAAG CAACTGAAAA 60 60 

AGGATTTAAA TGCCCGAACT GTGGCAATGA TAATCCTAAA ACAGTTGATG TCGTTAAACG 612 0 

AACATGTGGT TACCTAGGCA ATCCAGTTCA ACGTCCAGTA ATTAAAGGCC GTCATAAAGA 618 0 

AATTTGCGCA CGAGTAAAAC ATATGAAAGC GCCTAAAGAA TGATACTTTT AG AC AT T AAA 62 4 0 

CAAGGACAAG GTTATATTGC TAAAATAGAA TCAAATAGCT TTGTTGACGG TGAAGGAGTA 63 0 0 

AGATGCAGTG TTTATGTATC AGGATGTCCA TTTAATTGTG TTGGATGTTA TAACAAAGCC 63 60 

TCACAAAAGT TCAGATATGG C G AGAAAT AC AC T GAT G AAA TATTAGCAGA AATATTAGAT 642 0 

GATTGCGATC ATGATTATAT ATCTGGGCTA AGTCTATTAG GTGGCGAACC ATTTTGTAAT 64 8 0 

TTGGATATTA CATTAAATCT TGTCAAAGCA TTTCGAGCAC GTTTTGGAAA TACAAAGACA 65 4 0 

ATTTGGGTAT GGACTGGATT TTTATATGAA TATTTAGCAA ATGATCETAC AGAACGTCGA 6 60 0 

GAGTTATTAT CATACATTGA CGTTTTAGTA GATGGTCTAT TTATACAACA CTTATTCAAA 66 60 

CCTGATTTAC CATATAAAGG TTCTTTAAAT CAACGCATTA TAGATGTACA ACAATCACTC 67 20 

TCGCATGCGC GTATGATTGA ATATATAGTT AGTTGAATAT GTATTAGAAG TCAAGGTAAC 67 8 0 

ATTCGTTGCC TTGGCTTCTT TTTAGGTTAG GTACATAATT GAAAGTTAAT AAAAGCAATT 68 4 0 

CTTTATAAAA ATATATTGAT AGAAT AT GAC CTAACAATCA TTTTGATACC AATACTAAAA 69 00 

GTTGCATATC CGTTTTTTAA AAAAGTTGAA AG AG AAAAG T GGTATTTTAG TGGGAAGGAA 6 9 60 

GTCTAACTTT TTGGTAGCGT TTTACAATAA ATAAATATTC GTTAATAACG TATAAATATT 7 02 0 

CTTAAATGCC ATTCTAGTAA AATTTGTTAA ATTCGTTAAA TCGTAACTTA ACACTGTTAT 708 0 

TTTAGCGCTA TTAAGGTTTT GTTTATTACG GGAAAAATTA TATAAATATT CAATAATTGC 7140 

CAAGTTTCAA ATTGTATGAA ATTTGCATTA TTATTAAATG TTAGTTATTG TCATTTTGT 7 200 

GAATCAATAT AATTATTACA TTTTGAGATA AATCGAAACA GGATTCATAA AATTAATAAT 72 60 

TAGGGGGAGC ACAATTGAAA AAAGAGAAAG TTATGGACTG GACGACCTTT ATAGGGACAG 7 32 0 

TAGCTGTACT TCTTTTTGCA GTTATACCTA TGATGGCTTT TCCAAAAGCA AG T G AAG AT A 7380 
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TCATCACTGG TAT TAATAGT GCCATTTCTG ATTCAATTGG TTCGATATAT TTATTTATGG 74 40 

GGCTGGCGAT TTTTTGCTTT GTAATGTATA TCGCATTTGG TAAATATGGT AATGTCACAC 7500 

TTGGTAAAGC AAGTGATAAA. CCAGAATTTA ATACATTTAC ATGGGCGGCA ATGCTGTTTT 7 5 60 

GTGCAGGCAT AGGCTCTGAT ATTTTATACT GGGGffSTTAT TGAATGGGCT TTTTACTATC 7 620 

AAGTTCCACC AAATGGCGCG AAAAGTATGA GTGATGAAGC ACTCCAATAT GCGACGCAAT 7 68 0 

ATGGTATGTT CCACTGGGGG CCAATTGCTT GGGCTATTTA TGTTCTACCA GCATTACCAA 77 4 0 

TTGGTTATTT AGTATTTGTT AAAAAACAAC CGGTGTATAA AATTAGTCAA GCTTGTCGTC 7 800 

CGATTTTAAA AGGTCAAACA GATAAATTTG TAGGTAAAGT TGTAGATATC TTATTTATCT 7 8 60 

TTGGATTGCT AGGTGGTGCG GCAACATCAC TAGCGTTAGG TGTGCCATTA ATTTCTGCAG 7 92 0 

GCATAGAAAG ATTAACTGGT TTAGATGGTA AAAAT AT GAT TTTACGTTCG GCCATTTTAT 7 98 0 

TAACAATCAC GGTTATATTT GCCATTAGTT CATATACAGG ATTGAAAAAA GGTATTCAAA 8 04 0 

AGTTAAGTGA TATCAACGTT TGGCTATCCT TTGTACTTTT AGCCTTTATA TTTATTATTG 8100 

GACCGACTGT TTTTATTATG G AAAC G AC AG TGACAGGGTT CGGAAATATG TT GAG AG ATT 8160 

TCTTTCATAT GGCAACATGG TTAGAACCAT TCGGTGGTAT TAAGGTCGA AAAGAAACGA 8 220 

ATTTCCCACA AGACTGGACA ATATTCTACT GGTCATGGTG GTTAGTATAT GCGCCATTTA 82 80 

TCGGTTTATT TATCGCTAGA ATTTCAAAAG GTCGACGCCT TAAAGAAGTC GTGCTAGGAA 834 0 

CAATTATTTA TGGAACGCTT GGATGCGTAT TATTCTTTGG TATTTTTGGT AACTATGCTG 84 00 

TGTATTTACA AATTTCTGGA CAGTTTAATG TAACACAATA TTTAAATACA CATGGTACAG 84 60 

AGGCAACCAT TATTGAAGTG GTGCATCATT TACCATTCCC AT CAT T GAT G ATTGTACTAT 8 520 

TCTTAGTATC TGCTTTCTTA TTCTTAGCAA CAACATTTGA TTCGGGTTCA TATATTTTAG 8580 

CGGCAGCATC TCAGAAAAAA GTGGTAGGCG AACCAT TACG TGCCAATCGT TTATTCTGGG 8 64 0 

CATTTGCATT GTGCTTATTG CCATTTTCAT TGATGCTAGT TGGTGGTGAA CGTGCATTAG 8700 

AAGTATTGAA AACTGCTTCA ATACTGGCAA GTGTGCCATT AATTGTTATT TTTATTTTCA 8 7 60 

TGATGATATC ATTTTTAATC ATTTTAGGGC GCGATAGAAT TAAACTTGAAACGCGTGCTG 8 82 0 

AAAAATTAAA AGAAGTTGAA CGTCGTTCAT TGCGAATCGT TCAAGTATCa GAAGAAGAAC 8880 

AAGACGATAA TTTATAATTC AAAGCGGGTC TGGGACGACG AAATGaATTT TGTGAAAATA 8 94 0 

TCATTTCTGT TCCaTTCCCC TTTTTTTAGT AGCATTGTAG GATGAACTTT TAGGTTTTCA 9000 

TTAATGTTGT ACTAAAAGAT TTAATTTTTT AGTGCTCCAA GTACTTATTT ATTGTATGAA 90 60 

GCATATTCTA AATCGAAGTT TGAAAGACTC T CAT T GAT T A T TAAAT T AAA TAAAGGGTAT 912 0 

GCGTATGTAC AATTCAAATT AATCGAAGGA TGAAATAAAA TGACTAATCA ATT T AAAAAT 918 0 
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AAACAGTCCA AATTACATGA CAGTTTAGAA TJCATCACAA AAAACTTATA TGCGACACCT 924 0 

ACTTCGGAAC TGCCTTTTGA TAATCGATTT CTATTTAAAT CTTTTATTTT AAAACGTGAA 9300 

ACAGAATATT GTTATCTATC ATTCCGGACA CTTAGGTGAC TCCCAACAAG ACATTGCATC 9360 

ATTAGGTGGT GTTTCAAAAG TAT T GAT G AA TCATGATCAT GAATCTATAG GAGGTTEAA 94 20 

TCAAGTTGAA GCCCCTTACT TTATACATGA AAAT GATGTG GCTGCACTGA AACATAAGAT 94 8 0 

TTCTGTTCAA AAACAATTTA GTAATCGTGT AATGTTGGAT AAGG AT T TAG AAGTTATTCC 95 4 0 

CGCGCCTGGA CATACACCAG GGACGACACT ATTTTTATGG GATGATGGTC ATCACCGTTA 9600 

CTTATTTACT GGAGATTTTA TATGTTTTGA AGGGAAGAGA TGGCGTACAG TTATATTAGG 9660 

TTCAAGTGAT AGAGAAAAAT CTATTCAAAG T T TAG AG AT G GTTAAAGAAT TAGATTTTGA 97 2 0 

TGTACTTGTA CCTTGGGTTA CTATCAAAGA TGAACCGTTA GTTTATTTTG T AG AAAAT G A 97 8 0 

AT AT G AAAAA CGTGAACAAA TACAAAATAT TATTGATA& GTACGTGAGG GCGAGAATAG 9840 

CTAATTGAAA TATATTGGCG AAgCAATGTA ACGAATCTAA GAAAGCCCTA G AAAAT ACCT 9900 

CCATAATTGA TTGTCATATA AAACAAAAAC GGTAATTTCT ATTTATTGAG ATAGAAATTA 9960 

CCGTTTATTT CGTGGACCTA TTGCATTGTT TTTATCATGC ATAATCATCA TTGTCGTTGT 00 2 0 

TTGAGTCAAT TTTAATTTTC AGAATCAGAA GGCTGTTCTG GAATTGGGAA AT AT T T G AAA 10080 

ATTTCACCGC TTTCAATCGC TTCGGTTAAC TGTTCTAACC ATTCGTAATA AACATGTGTA 1014 0 

TGATCAAGCT GAGCTTTAAT TTTTTGTGCC TCTTGTGTTT CAGCTTCAGT TAAATCACTG 10200 

CTTTCAAGTA ATGGATTGAT AATAGCTTGA GCATCTTTTA CTGCTTCGAC ATTGATGTCA 10260 

ATTTCACGCT GGAATTTTTT AG T G AAAAAG TTTCGGAAAA AG AT G AAAAA GTCTTTCTCG 10320 

GCGATAAAAT GTTGTTTGCG GCTTCCTCTC GTAAATTGTT GTTTAACAAT ATCAAATTCC 1038 0 

TGCAATTTCT TAACGCCAGC ACTCATACTT GGTTTGCTCA TTTGCATTG ATGACGCATT 104 4 0 

TCATCAAGCG TCATACTGCC TTCAAACACC ATTGTGCCAT ATAAGTTTCC TACACTTCTA 10500 

TTAGTGCCAT ACAAATCCAT TGTCTGTCCA ATTGAATTAA TTACAATATC TTTTGCTTGT 10560 

TCTAATTGTT GCTGTTTGTT CTGAGAACGA GTCATCATTG CACCTCCGTA CATCATTTTG 10 620 

GTCACGTTAA AAT AAAT ACT AAT AC AT TAT AAAACCTTTT CTAAAAAAAG AC AT T AAAAA 10 680 

TATTTAAAGC AT T AAAGT T A AATGTTTCGT T AAAT AAAAA TCTAACGAAC TTACAAAACT 1074 0 

TAATTCTTGA GTTGTTTTGT AAAT T G AC AC ATTTTTCATT TCTATGCTAA CATAAGTnTG 10800 

TAAAATTcGT T AAAT AAAAA TTTAACAERC TTAACGGrGG TTGTTGAAkG GrACTTTTAA 108 60 

aACATTTATC TCAGCGTCAA TATATTGATG GTGAGTGGGT TGAAAGCGCG AATAAAAATA 10 920 

CAAGAGATAT TATCAATCCT TACAATCAAG AAGTGATATT TACGGTTTCT GAAGGGACAA 10980 

AAGAGGATGC AGAACGTGCA ATCTTAGCTG CAAGACGTGC GTTTGAGTCT GGGAATGGT 110 4 0 



455 

( opied horn 1080(ii>0(i on 08/ 1/ t?00 I 



CGCAAGAAAC TGCTGAAACA AGAGGTAAAA AAGTACGTGC CATCGCGGAC AAGATTAAAG 11100 

AACATCgCGA AgCgTTAGCA CGATTAGAAA C ATT AG AT AC TGGAAAAACG TTAGAAGAAT 11160 

CATATGCAGA TATGGATGAT ATTCATAATG TGTTTATGTA TTTTGCTGGA T T AGCAGAT A 11220 

AAGACGGTGG CGAAATGATT GAT T C AC C AA TTCCAGATAC AGAAAGCAAA ATTGTTAAAG 11280 

AACCAGTAGG TGTAGTTACA CAAATTACAC CTTGGAATTA TCCGTTATTA CAAGCATCAT 1134 0 

GGAAAATTGC GCCAGCGCTT GCTACGGGTT GTTCACTAGT TATGAAACCA AGTGAAATTA 114 00 

CACCATTAAC AACAATACGT GTTTTTGAAT TAATGAAGA AGTTGGTTTC CCTAAAGGAA 114 60 

CAATTAATCT TATTCTAGGT GCAGGTTCTG AAGTTGGTGA CGTAATGTCA GGTCATAAAG 11520 

AGGTTGACCT TGTATCATTT ACAGGTGGCA TTGAGACTGG TAAGC AT AT T AT G AAAAAT G 11580 

CTGCTAATAA TGTTACGAAT ATTGCCTTGG AACTTGGCGG TAAAAATCCA AACATTATCT 11640 

TTGATGATGC TGATTTTGAA TTGGCAGTAG ACCAAGCGTT AAATGGTGGA TATTTCCATG 117 00 

CAGGTCAAGT TTGTTCAGCA GGATCAAGAA TATTAGTACA AAACAGTATT AAAGACAAAT 117 60 

TTGAGCAAGC AC T TAT T GAT CGCGTGAAAA AAATCAAATT AGGTAATGGT TTTGATGCTG 11820 

ATACTGAAAT GGGACCAGTG ATTTCAACAG AACATCGTAA TAAGATCGAA TCTTATATGG 11880 

ATGTAGcTAA AGCAGAAGGC GCAACAATTG CTGTTGGTGG TAAACGTCCA GAT AG AG AT G 11940 

ATTTAAAAGA TGGTCTATTC TTCGAGCCAA CAGTCATTAC AAATTGTGAT ACGTCAATGC 12 000 

GTATTGTACA AGAAGAGGTT TTCGGACCTG TCGTTACTGT A&AGGCTTT GAAACTGAAC 12060 

AAGAAGCGAT TCAATTAGCG AATGATTCTA TATATGGTTT AGCAGGTGCT GTATTTTCTA 1212 0 

AAGATATTGG AAAAGCACAA CGCGTTGCTA ACAAGTTGAA ACTTGGAACG GTGTGGATTA 12180 

ATGATTTCCA TCCATATTTT GCACAAGCGC CATGGGGTGG ATACAAACAA TCAGGTATCG 12240 

GTAGAGAATT AGGCAAAGAA GGCTTAGAAG AGTACCTTGT TTCAAAACAC ATTTTAACAA 12300 

ATACAAATCC AC AAT TAG T G AATTGGTTTA GCAAATAAAA ATTAGATAAG GTGAGTGCCA 12360 

TTGTAAGAAC ACAAGACACT CACTTTGTTT TGTATAAGTG GCGAAATGTT GATTGATAAT 12420 

TTGGACTAAA CGCAAAATGA ATCATAGATT ATTTCATTAC TGTTAGTAAC AATCGTAAAA 124 8 0 

GGAAAAGCGA GTGTTTTGGT TAGCTAAGTT TAGCAATTCA ACGATAACCA ATCAGCCACT 12 54 0 

AACAAATATT TCATGCAATA CTCACTTTGA AATACAACAA ACTTTGGAGG TCATAACGAT 12 600 

GAGTAACAAA AACAAATCAT ATGATTATGT CATCATTGGA GGAGGCAGTGCAGGTTCTGT 12 660 

ACTAGGTAAT CGTCTGAGTG AAGATAAAGA TAAAGAAGTC T TAG TAT TAG AAGCGGGTCG 127 20 

CAGTGATTAT TTTTGGGATT TATTTATCCA AATGCCTGCT GCGTTAATGT TCCCTTCAGG 12780 

CAATAAATTT TACGATTGGA TTTATTCAAC AGATGAAGAA CCACATATGG GCGGTCGTAA 128 4 0 
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AGTAGCACAT GCCCGAGGTA AAGTTTTAGG TGGATCAAGT TCGATTAATG GCATGATTTA 12 900 

TCAACGTGGT AATCCAATGG ACTATGAAGG CTGGGCAGAA CCAGAAGGTA TGGAAACTTG 12 9 60 

GGATTTTGCG CACTGTTTAC CGTATTTTAA AAAATTAGAA AAAACATACG GTGCAGCGCC 1302 0 

TTATGATAAA TTTAGAGGCC ATGATGGACC mTTAAGTTA AAACGAGGGC CAGCAACGAA 13080 

TCCTTTATTC CAGTCATTCT TTGATGCAGG TGTTGAAGCA GGCTATCATA AAACACCTGA 1314 0 

TGTGAATGGA TTTAGACAAG AAGGTTTTGG ACCGTTCGAT AGTCAAGTAC ATCGTGGTCG 132 00 

CCGAATGTCA GCTTCAAGAG CATATTTACA TCCAGCGATG AAGCGTAAAA ACTTAAOGT 132 60 

TGAAACACGT GCCTTTGTAA CTGAAATTCA TTATGAAGGT AGAAGAGCAA CTGGTGTTAC 13320 

GTATAAGAAA AATGGCAAAC TACATACCAT CGATGCTAAT GAAGTCATTT TGTCTGGTGG 13380 

GGCATTCAAT ACGCCACAAT TACTACAATT ATCTGGTATC GGTGATTCAG AGTTCCTAAA 13440 

ATCAAAAGGC ATTCAGCCAC GTGTTCATTT ACCTGGTGTG GGTGAAAACT TTGAAGATCA 13500 

CTTAGAGG 13508 
(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

GTAAGTATTG TCTTGATTTC CTAATAAAGT TATATCTTGT AATTCATCTT GTTGACGGCC 60 

ATGTGCCATA TAAAGCGCTC CTTTAAATTT ATTTTTTTAT TATTTTGGCG TCTCGGCGTG 12 0 

CTTTTTCAAA CATGTAATAA CTTGCACCGA TAATAACGAC GTAACCTAAT GTTGCATAGA 180 

AATCTGGAGA TTCTCCGAAT AGAATAAATC CAAGTATTGC TGTGAAAATT ATAGATGCAT 240 

ACGTAAAAAT AGAAATATCT TTTGCTGCTG CAAAACTATA TGCTAAAGTA ACACCAATTT 300 

GACCCACAGC GGCAgCTAAG CCAGCCCCTA ATAGATAAAG TATTTGCATC TGACTCATTG 3 60 

GTTCATAAGT ATATGCAGTG AAAGGTATTA AAACGATGAC AGAAAATAAG G AGAAG T AAA 4 20 

AT AC T AT AG T ATATGGTGCT TyTCTTGTAC TAAGTGCTCG AACACATGTA TATGCTGATG 4 80 

CTGCAAAAAT ACCTGA3AAT AAGCCAGCTA ATGATGGAAT CATAGATGAT GAAAATTCAG 54 0 

GTTTCACTAT TAAnAGCAaC CTAAAATAGC AATTATCATT GCTGTAATTT GaTACTTCCT 600 

TACCTTTTCA TGtAAGAaaA CAATGCTTaA TAAAATCGTC CAGAAAGGAT TGAGTTTCAT 6 60 

TAATGAATCG GCATCACTAA G T AC CAT AT G ATCAATGGCA AAATATTTA ACAATACACC 72 0 

AATAAGTCCA AGTGTTGATC GTGTTATTAA TAAGGGTTGA CTTGAAAGTC TGCCAAACAT 780 
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TGGCTGATGG TATTTATATA TAAAAAATAA TGGAATAAAC ATTGCTACTA AGTTTCGTGC 84 0 

TAATGATTTT TGAAAAACAG GAAGGTCACC TGCAAGTCTG AAAAAC AC TG ACATAAAACT 90D 

GAAACCAATA GCCGAAATTA AAATGGCAAT GATACCTTTT ACTTTAGGAT TCAATTTTAT 9 60 

CGCCTCTTTT ATATAAAATT AACGTATTTA TATTAGCATA AAACAACATG TTGTGCATAA 102 0 

ATAGTTGAAA TTTACTATAA AAAGAC TATA ATAGACTGTA GCGAACAAAC GTTCTGTGTT 1080 

TATTTGTCGG AATAATAGGG CATTACACTT TTATGAATGT TTGTGTTATT ACATAAAACA 114 0 

AATATCAATT CAGTATCAAG CTAATAAGCT TTTTCTTGAT TTCTGTTGAT ACAATTGAGA 1200 

TTGACACAGA TTTAAAAAAA TCAAGTGATA TCTACTAAAA AATTTTTTTA AATTTGTTCA 1260 

AGTTTTTCTA ATTTAGTATT GGTGCCTAGT TGGAACGTTT TACGAACAT C GAT TAG AAA 1320 

ATGGCACTTT AAATCATAGT GTGTCTTATG T AT AAT G AAA CACATAATAT AGTGTTGGTG 1380 

AAACGAAAAA gACACAATAT CTTGTGTTTT GTATGCAAAT GCTTTATTTA TGAAGAAATT 14 4 0 

ACATTTAAAA GTAATTTAAC AC AGAAAT T T AATAGTTATT AT C AAT T AAT AGTCATATTT 1500 

TTAGAAAATG TACTGAGCAA AT GGAAGAT A TCCAAT GATG TAAACACTAC ATATAGTGAT 1560 

TTTTATACAT T C AAC C CAT A TAAGCTACTA TTTTCTCAAA TATAAATCTA TGCAATTGGT 1620 

TTACATTTGA GAAAATAAGT AGCTTCATTA TAGT TAATAC AATGCTGAGA TAACCATAGT 168 0 

AACCATGTTG TTAAAGCATT T TTT AAT TGG AATGAC TACT TTATTTAAAA GGGTTGAAGA 17 4 0 

AAGAAGGTGA TCCAATGAAA ATAATATATT TTTCATTTAC TGGAAATGTC CGTCGTTTTA 18 00 

TTAAGAGAAC AG AAC T T G AA AATACGCTTG AGATTACAGC AGAAAATTGT ATGGAACCAG 18 60 

TTCATGAACC GTTTATTATC GTTACTGGCA CTATTGGATT TGGAGAAGTA CCAGAfflCCG 1920 

TTCAATCTTT TTTAGAAGTT AATCATCAAT ACATCAGAGG TGTGGCAGCT AGCGGTAATC 198 0 

GAAATTGGGG ACTAAATTTC GCAAAAGCGG GTCGCACGAT AT C AG AAG AG TATAATGTCC 2 04 0 

CTTTATTAAT GAAGTTTGAG TTACATGGAA AAAACAAAGA CGTTATTGAA T T T AAG AAC A 2100 

AGGTGGGTAA TTTTAATGAA AACCATGGAA GAGAAAAAGT ACAATCATAT TGAATTAAAT 2160 

AATGAGGTCA CTAAACGAaG AG AAG AT G G A TTCTTTAGTT TAGAAAAAGA CCAAGAAGCT 2220 

TTAGTAGCTT ATTTAGAAGA AG T AAAAG AC AAAACAATCT TCTTCGACAC TGAAATCGAG 22 8 0 

CGTTTACGTT ATTTAGTAGA C AAC GAT TTT TATTTCAffG TGTTTGATAT TTATAGTGAA 2340 

GCGGATCTAA TTGAAATCAC TGATTATGCA AAATCAATCC CGTTTAATTT TGCAAGTTAT 2 4 00 

ATGTCAGCTA GTAAATTTTT C AAAG AT T AC GCTTTGAAAA CAAATGATAA AAG T C AAT AC 2 4 60 

TTAGAAGACT ATAATCAACA CGTTGCCATT GTTGCTTTAT AC C TAG C AAA TGGTAATAAA 2 52 0 

G C AC AAG C T A AACAATTTAT TTCTGCTATG GTTGAACAAA GAT AT C AAC C AGCGACACCA 2580 
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ACATTTTTAA ACGCAGGCCG TGCGCGTCGT GGTGAGCTAG TGTCATGTTT CTTATTAGAA 2 64 0 

GTGGATGACA GCTTAAATTC AATTAACTTT ATTGATTCAA CTGCAAAACA ATTAAGTAAA 2 7 00 

ATTGGGGGCG GCGTTGCAAT TAACTTATCT AAATTGCGTG CACGTGGTGA AGCAATTAAA 2 7 60 

GGAATTAAAG GCGTAgCGAA AGGCGTTTTA CCTATTGCTA AGTCACTTGA AGGTGGCTTT 2 820 

AGCTATGCAG ATCAACTTGG TCAACGCCCT GGTGCTGGTG CTGTGTACTT AAATATCTTC 2880 

CATTATGATG TAGAAGAATT T T TAG AT AC T AAAAAAGTAA ATGCGATGA AGATTTACGT 2 94 0 

TTATCTACAA TATCAACTGG TTTAATTGTT CCATCTAAAT TCTTCGATTT AGCTAAAGAA 3000 

GGTAAGGACT TTTATATGTT TGCACCTCAT ACAGTTAAAG AAGAAT AT GG TGTGACATTA 3060 

GACGATATCG ATTTAGAAAA ATATTATGAT GACATGGTTG CAAACCCAAA TGTTGAGAAA 3120 

AAGAAAAAGA ATGCGCGTGA AATGTTGAAT TTAATTGCGC AAACACAATT ACAATCAGGT 318 0 

TATCCATATT TAATGTTTAA AGATAATGCT AACAGAGTGC ATCCGAATTC AAACATTGGA 32 4 0 

CAAATTAAAA TGAGTAACTT ATGTACGGAA ATTTTCCAAC TACAAGAAAC TTCAATTATT 3300 

AATGACTATG GTATTGAAGA CGAAATTVAA CGTGATATTT CTTGTAACTT GGGCTCATTA 3360 

AATATTGTTA ATGTAATGGA AAGCGGAAAA T T C AG AG AT T CAGTTCACTC TGGTATGGAC 3420 

GCATTAACTG TTGTGAGTGA TGTAGCAAAT ATTCAAAATG CACCAGGAGT TAGAAAAGCT 34 8 0 

AACAGTGAAT TACATTCAGT TGGTCTTGGT GTGATGAATT TACACGGTTA CEAGCAAAA 354 0 

AATAAAATTG GTTATGAGTC AGAAGAAGCA AAAGATTTTG CAAATATCTT CTTTATGATG 3 600 

ATGAATTTCT ACTCAATCGA ACGTTCAATG GAAATCGCTA AAGAGCGTGG TATCAAATAT 3660 

CAAGACTTTG AAAAGTCTGA TTATGCTAAT GGCAAATATT TCGAGTTCTA TACAACTCAA 3720 

GAATTTGAAC CTCAATTCGA AAAAGTACGT GAATTATTCG ATGGTATGGC TATTCCTACT 37 8 0 

TCTGAGGATT GG AAGAAAC T ACAACAAGAT GTTGAACAAT ATGGTTTATA TCATGCATAT 38 4 0 

AGATTAGCAA TTGCTCCAAC ACAAAGTATT TCTTATGTTC AAAATGCAAC AAGTTCTGTA 3900 

ATGCCAATCG TTGACCAAAT TGAACGTCGT ACT ATGGTA ATGCGGAAAC ATTTTACCCT 3 9 60 

ATGCCATTCT TATCACCACA AACAATGTGG TACTACAAAT CAGCATTCAA TACTGATCAG 4 02 0 

ATGAAAT TAA TCGATTTAAT TGCGACAATT CAAACGCATA TTGACCAAGG TATCTCAACG 4080 

ATCCTTTATG TTAATTCTGA AATTTCTACA CGTGAGTTAG CAAGATTATA TGTATATGCG 414 0 

C ACT AT AAA G GATTAAAATC ACTTTACTAT ACTAGAAATA AATTATTAAG TGTAGAAGAA 4 2 00 

TGTACAAGTT GTTCTATCTA ACAATTAAAT GTTGAAAATG ACAAACAGCT AATCATCTGG 4 2 60 

TCTGAATTAG CAGATGATTA GACTGCTATG TCTGTATTTG TCAATTATTG AGTAACATTA 4 320 

CAGGAGGAAA T TAT ATT CAT GATAGCTGTT AATTGGAACA CACAAGAAGA TATGACGAAT 4 380 

ATGTTTTGGA GACAAAATAT ATCTCAAATG TGGGTTGAAA CAGAATTTAA AGTATCAAAA 4 4 40 
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GACATTGCAA GTTGGAAGAC TTTATCTGAA GCTGAACAAG ACACATTTAA AAAAGC AT T A 4 500 

GCTGGTTTAA CAGGCTTAGA T AC AC AT C AA GCAGATGATG GATGCCTTT AGTTATGCTA 4 5 60 

CAT AC G AC T G ACTTAAGGAA AAAAGCAGTT TATTCATTTA TGGCGATGAT GGAGCAAATA 4 620 

CACGCGAAAA GCTATTCACA TATTTTCACA ACACTATTAC CATCTAGTGA AaCAAACTAC 4 68 0 

C TAT TAG AT G AATGGGTTTT AGAGGAACCC CATTTAAAAT ATAAATCTGA TAAAATTGTT 4 703 

GCTAATTATC ACAAACTTTG GGGTAAAGAA GCTTCGATAT ACGACCAATA TATGGCCAGA 4 8 00 

GTTACGAGTG TATTTTTAGA AACATTCTTA TTCTTCTCAG GTTTCTATTA TCCACTATAT 4 8 60 

CTTGCTGGTC AAGGGAAAAT GACGACATCA GGTGAAATCA TTCGTAAAAT TCTTTTAGAT 4 920 

GAATCTATTC ATGGTGTATT TACCGGTTTA GATGCACAGC ATT T AC G AAA TGAACTATCT 4 980 

GAAAGTGAGA AACAAAAAGC AGATCAAGAA ATGTATAAAT TGCTAAATGA CTTGTATTTA 50 4 0 

AATGAAGAGT CATACACAAA AATGTTATAC GATGATCTTG GAATCACTGA AGATGTGCTA 5100 

AACTATGTTA AATATAATGG AAACAAAGCA CTTTCAAACT TAGGCTTTS ACCTTATTTT 5160 

GAGGAACGTG AATTTAACCC AAT CAT T GAG AATGCCTTAG ATACAACAAC TAAAAACCAT 52 2 0 

GACTTCTTCT CAGTAAAAGG TGATGGTTAT GTATTAGCAT TAAACGTAGA AGCATTACAA 52 80 

GAT GAT G AC T TTGTATTTGA CAACAAATAA CAATTAAATT AAAAGACCTT CACATGTAAA 534 0 

GGGAAATAGC GATTCGTTTC GTCTTGTCTC CTACATGTTG AAGGTCTTTT TTTATGTGTA 54 00 

TCTAACTCAT TATGAGTCTG AGTAAGAAAT CAATGCTCTA AG AT G T AC AA TGCTATTTAT 54 60 

ATTGGCAGTA GTTGGCGGGG CCCCAACACA GAAGCAGGCG GAAAGTCAGC T AAC AAT AT T 552 0 

GTGCAAGTTG GCGGGGCCCC AACATAGAAG CAGGCGGAAA GTCAGCTAAC AATAATGTGC 5580 

AAGTTGGCGG GGCCCCAACA TAAAAGCAGG CGGAAAGTCA GCTAACAATA TTGTGCAAGT 564 0 

TCGGgCGGGG CCCCAACATA AAGAAAAACT TTTTCCTTTA GAAAT TAT C A CTTCCaCaTG 57 0 0 

AGTTTTACTC ATGTATTCCT ATTTTTAAGT ACACATTAGC TGAGGCTAAT GTTAAEACC 57 60 

ACTACTTAAT CAATCATTAG TAGTTTTTAT CATTTCCACT ATTCCCaGAC ATCaAAATCT 5820 

TAAGTGTTCT ATTTTACTTT AAGTAAACAA AAT AC AC AT T CCGAAAAATT AAATTTCAGT 5880 

TTAATTGCAA AT AT CAAT AA AAT TGAC AC T AAATTATTTG AAAGGCTATT GAAATTATGG 594 0 

TCAAAAAACG CTACTATTAA T GAG AAAT AT TATCAATGAT AATGATTATC AT T AAT T T AA 6000 

AG G G AG AAAA ATTTGTAATG AAGTATTTAT TAAAGGGAAA TATTTTGCTT CTATTACTAA 60 60 

TATTGTTGAC AATTATTTCG TTGTTCATAG GTGTGAGTGA ACTATCAATT AAAG AT T T AC 6120 

TACATTTAAC TGAGTCACAG CGGAATATTT TATTCTCffi.G CCGAATACCA AGGACGATGA 6180 

GTATTTTAAT TGCTGGAAGT TCGTTGGCTT TAGCAGGCTT GATAATGCAA CAAATGATGC 62 4 0 
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AAAATAAGTT TGTTAGTCCG ACTACAGCTG GAACGATGGA ATGGGCTAAA CTAGGTATTT 6300 

TAATTGCTTT ATTGTTCTTT CCAACCGGTC ATATTTTATT AAAACTAGTA TTTGCTGTTA 6360 

TTTGCAGTAT TTGCGGTACG TTTTTATTTG TTAAAATCAT TGATTTTATA AAAGTGAAAG 64 20 

ATGTCATTTT TGTACCGCTT TTAGGAATTA TGATGGGTGG GATTGTTGCA AGTTcACAAC 64 80 

CTTCATCTCA TTGCGCACGA ATGCTGTTCA AAGCATTGGT AACTGGCTTA ACGGGAACTT 65 4 0 

TGCCATTATC ACAAGTGGAC GCTAT GAAAT TTTATATTTA AGTATTCCTC TTTTAGCATT 6600 

GACATATCTT TTTGCTAATC ATTTCACGAT TGTAGGAATG GGTAAAGACT TTACTAATAA 66 60 

TTTAGGTTTG AGTTACGAAA AATTAATTAA CATCGCATTG TTTATTACTG CAACTATTAC 6720 

AGCATTGGTA GTGGTGACTG TTGGAACATT ACCGTTCTTA GGACKGTAA TACCAAATAT 67 8 0 

TATTTCAATT TATCGAGGTG ATCATTTGAA AAATGCTATC CCTCATACGA TGATGTTAGG 68 4 0 

TGCCATCTTT GTATTATTTT CTGATATAGT TGGCAGAATT GTTGTTTATC CAT AT GAAAT 6900 

AAATATTGGT T T AACAAT AG GTGTATTTGG AACAATCATT TTCCTTATCT TGCTTATGAA 6 9 60 

AGGTAGGAAA AATTATGCGC aACAATAATA AAAAAATAAT GCTTTTAATT GCAGTAACGT 7 02 0 

TATTAATTAG TATGCTGTAC TTATTTGTAG GTATTGATTT TGAAATATTT GAATATCAAT 70 8 0 

TTTCAAGTCG TTTAAGAAAG TTCATATTAA TTATTTTAGT AGGTGCTGCC ATTGCAACTT 714 0 

CAGTGGTGAT TTTTCAAGCG ATTACAZATA ACCGTCTATT GACACC AT C A ATAATGGGGT 720 0 

TAGATGCAGT TTATTTATTT ATCAAAGTAT TGCCAGTCTT TTTATTTGGA ATTCAATCGG 7 2 60 

TATGGGTTAC TAATGTATAT TTGAACTTTA TATTAACACT TATAACGATG GTGTTATTCG 7 32 0 

CACTAATCCT ATTCCAAGGT ATCTTTAAAA TCGGACATTT TTCAATTTAT TTATCTTAC 7 38 0 

TTATTGGTGT CCTTTTAGGA ACATTTTTTA GAAGCATAAC AGGTTTTATT CAACTGATTA 74 4 0 

TGGATCCTGA GTCATTTTTA G C AAT AC AAA GTAGTATGTT TGCTAATTTT AATGCTTCTA 7 5 00 

ATTCGAATTT AGTTACTTTC TCAGCAGTGC TATTAGTAAT CTTATTAGTC ATTACAATTT 7 5 60 

TACTATTGCC T T AT T TAG AT GTATTGCTTT TAGGTCGTGC TGAAGCAATT AATCTTGGGA 7 62 0 

TATCGTATGA AAAATTAACG CGAATT 7 64 6 
(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1194 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

AT GAAT AT AT TTnnAAATAA ATTATTATGG ATTGCACCAA TnGCCACTAT GATTATCTTG 60 
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GTAATCTTTT CTTTAGCTTT TTATCCTGCA TATAATCCTA AACCAAAAGA TTTACCAATT 120 

GGTATATTAA ACGAGGATAA AGGTACAACG ATTCAAGATA AAAATGTTAA CATTGGTAAA 180 

AAATTAGAGG ATAAATTATT AGATAGTGAT TCTAATAAAA TTAAATGGGT TAAGGTTGAT 24 0 

AG T G AAAAAG ACCTTGAAAA AG AT T T G AAA GATCAAAAAA TCTTTGGAGT AGCTATATT 300 

GATAAAGACT TTTCAAAAGA TGCTATGAGT AAAACACAAA AAGTAGTTAT GGATAGTAAA 3 60 

AAAGAAGAAA TGCAACAAAA AGTTGCTTCA GGTGAAATTC CGCCACAAGT GGTTCAACAA 420 

AT G AAAC AAA AAATGGGGAA TCAACAAGTA GAGGTTAAGC AGGCTAAATT TAAAACGATT 480 

GTAAGTGAAG GATCAAGCTT ACAAGGTTCA CAAATTGCAT CAGCTGTGTT AACTGGTATG 54 0 

GGTGATAATA TTAATGCTCA AATTACGAAG CAAAGTTTGG AAAC AT T AAC GAGTCAAAAT 600 

GTTAAAGTCA ATGCCGCGGA CATCAATGGT TTGACGAATC CAGTAAAAGT GGATAATGAA 6 60 

AAACTTAATA AAGTTAAAGA TCACCAAGCA GGTGGT.&.TG CACCATTCCT AATGTTTATG 720 

CCAATTTGGA TAGGTTCAAT CGTAACGTCT ATCTTATTGT TCTTTGCATT TAGAACTAGT 7 80 

AACAATATCG TCGTGCAACA TCGTATCaTT GCtTCAATTG G AC AG AT GAT ATTTGCAGTT 84 0 

GTTGCAGCAT TTGCAGGTAG CTTTGTTTAT ATTTATTTCA TGCAAGGCGT TCAAAGATTT 900 

GATTTTGACC ATCCAAATCG TATCGCAATT TTTGTAGCAT TTGCGATTCT TGGTTTCGTG 9 60 

GGCCTTATTT TAGGTGTTAT GGTATGGCTA GGTATGAAGT CAGTTCCAAT TTTCTTCATT 102 0 

TTAATGTTCT TTAGTATGCA ACTTGTAACG T T AC C T AAAC AAATGTTGCC TGAAAGTTAT 1080 

CAAAAATATG TAT AT GATTG GAATCCATTC ACACACTATG CAACAAGTGT AAGAGAcTAT 114 0 

TATACTTGAA TCATCATATT GAATTAAATA GTACAATGTG GATGTTTATA GGGT 1194 



(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

GACCGACCTA TACATCCGTA TAAGTATTTC TTGATATAAG TCTTCTAAAT CATAATGATT 60 

AAATCCAAAT GTTTTGATGC GTCGAATAAT TAATGGTTGT AGATCCATTA CTAACTTTTC 120 

GAAGTCTTTA TCATTTATTG CATTGTTACG TTTGATTGTA CTGTCTTGAG TTGTCAAATC 180 

GTATTTCAAA TATTAAACTA ACCCCTTCTA TCTAAAATTT AAGGTTAGTT TAATATTGTT 24 0 

ACATTCAAAA TTTCAAGATG ACGGAAATGT CATTTCTTAT GATGTCCTCT TCGTATTTTT 030 
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TCAAATTCTG CAAGGATTTC AGAAGATAAC GGAATTCGAG TTCTTGGCTT GTTTTCACTT 



360 



ATATCATCTA AT GAT T TACT CACATCAATT TCATTTTCTT TTAAATCTCT CCACATTTCG 



420 



CGAGATGATA TTCTATATGC ACCTGATCCA AAGATAGCAT GTTGcTCACT CaTATCACTT 



480 



GTTACAACTG TAATATGcTT AGtATGCTTG tCaTAAAGtT CaTAAACCAT AACGGTTCTA 



540 



ATGGAAACCA ATCAGCTG 



558 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

GCTTCAGACA TnTGATGATA TAATCTCTCA TCATCGATTA ATTCTTTTGC AGCTTGATAC 60 

ACATnTTGCT TATTTGTTCC AATGACTTTT AATGTGCCAG CTCAACACC TTCAGGACGT 120 

TCTGTAACAC TTCGCCAAAA CTAAAACTGG CTTATTAAAT GATGGCGCTT CTTCCTGAAT 18 0 

TCCACCTGAA TCTGTCAAAA TAAAATAAGA TTTTnTAGCA AAATTATGGA AATCTATACG 24 0 

TCCAAAGGTT CAATCAATTC AATTCTGTCA TGACTACCTA AAATCTTTTG AGCCACCTCT 300 

CGAACTTTCG GGTTTTTATG CAT T GGAT AT ACCAGTGCTA AATCAGTATA CTCATCTATT 3 60 

AAGCGTCTAA CCGCTTTAAA TATATTTTCC aTGGGTTTCC CGATATTTTC TCGTCGGTGT 420 

GCTGTCATrA GAATGAATTT kTtGTCATGG TATTTATCCA TGATGTTAGA TTTATAATTG 4 80 

TCATCAACTG TATATTTCAT AGCAXAATC GCAGTATTAC CAGTGACAAC AACACTTTCT 54 0 

GAATATTTCC CTTCACTTAA CAAATGCGAT GCAGCATTTT TAGTAGGTGC AAAATGTAAG 600 

TCAGCTAATA CACCAACTAA TTGTCTATTC ACCTCTTCTG GAAAAGGTGA ATATTTATCA 660 

TAACTTCTAA GCCCTGCTTC AACGTGTCCA ATCGGCACTT GGTTATAAAATGCCGCTAAA 720 

CCACCTGCAA ATGTCGTCAT CGTATCACCA TGTACAAGTA CCATGTCTGG TTTTTCTAAT 7 80 

TGAATCACTT GTTCTAATTG AG T GAT T GAT TTAGAAGTTA TCTCAGAAAG TGTCTGTCCT 84 0 

GATTTCATAA TATTCAAATC GTATTTTGGT TTGATTTCAA AGGTACTTAA TACTGAATCA 900 

AGCATTTCTC TATGCTGTGC TGTAACAACA ACAATTGGCT CGAGCATTTT TTCTTGTTCC 9 60 

AAAGCTTTAA CTAAAGGAGC CATCTTTATG GCTTCGGGTC TTGTTCCAAA TATGGTCATA 1020 

ATCTTTTTCA TCAAACTACT TATCTCCGAT TCTTCTATTT AGTACCAAAC AATCTATCTC 1080 

CAGCGTCGCC TAACCCTGGT GTGATATATG CTTTGTCATT aGCTTTTCAT CAAGTGCAGC 114 0 

AATATAAATA TCTACATCTG GATGTGCTTC ATGCATCTTT TCTACGCCTT CTGGTGCTGC 1200 
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AATTAAACAC ATGAAGCGAA TATTTTTAGC GCCACGTTTC TTCAATGAAG TAATAGCTTC 12 60 

AATTGCTGAT GCGCCTGTTG CTAACATAGG AT CAAC AACA ATGATTTGTC TTTCAGTffiT 132 0 

ATCTTGAGGT AACTTAGCAA AATACTCTAC AGCCTTTAAT GTTTCGGGAT C T C G AT AT AA 138 0 

ACCGATATGT CCAACTCTGG CTGCAGGTAC T AAAC T T AAA ATACCATCAG TCATACCTAA 14 4 0 

ACCAGCTCTT AAAATTGGAA CGATAGCTAA TTTTTTACCA GCTAATCGTT TAGCCGTCAT 1500 

TTTAGTTACA GGCGTTTCAA TATCAACATC CTGAAGCTCT AAGTCTCTAG TTACTTCATA 1560 

TGCCATCAAC ATACCAACTT CGTCTACAAG TTCTCTAAAT TCTTTAGTAC CTGTATTTAC 1620 

ATCTCTAATA TAGCTTAGTT TGTGTTGAAT TAATGGATGA TCGAAAACGT GTACTTTACT 1680 

CATAAAAATT ACTCCTATCT TTGTGTATGT TTATTGATAT AGAGGATATT CAGCTGTTAA 1740 

TTTCGCAACG CGTTCTTTAG CTTGTTGTAA TTTTTCTTCA TCTTTACTAT TTTTCAATGC 1800 

T AAAC T GAT G ATTTTTGCAA CTTCCTCAAA AGCTTTTTCA TCAAATCCAC GCGTTGTTGC 18 60 

AGCAGGTGTA CCTAAACGTA TACCACTCGT TACAAAAGGT TTTTCTTGAT CGAACGGAAT £2 0 

GGTATTTTTG TTACATGTGA T AC CAAC T G A ATCTAAAGTC TCTTCAGCTT CTTTACCAGT 1980 

AAGTCCTATA GACCCTTTTA CAT CAAC AG C TACTAAGTGA TTATCTGTAC CGCCAGAAAC 2040 

AATTCTAAAT CCTTCATTAA TTAATGCTTC TGCAAGAACT TTTGCGTTTT TAACCACTTG 2100 

TTGTTGATAC GTTTTGAAAT TA.TTTTCTAA CGCTTCTCCA AAAGCAACTG CTTTtGCTgC 2160 

AATAACATGC TCAAGAGGTC CACCTTGAAT ACCAGGGAAA ATTGTTTTAT CTATGTCTTT 2220 

TTTATATTCT TCCTTACATA AAAT CAT AC C ACCACGtGGT CCGcGTAATG TTTTGTGTGT 2280 

TGTAGTTGTT ACAAAATCAG CATATTCTAC TGGATTTGGA TGTAAACTG CCGCTACTAA 2340 

TCCTGCAATA TGTGCCATGT CTACCATTAA CTTAGCGTTT ACTTCATCTG CGATTTCTTT 2400 

AAACTTTTTG AAGTCAATTG TTCTTGAATA TGCTGATGCT CCTGCCACAA TAAGCTTAGG 24 60 

CTTATGCTCT AACGCTAATT TACGAACTTC ATCATAATTG ATTCGTTCTG TGTCTTTATC 2520 

TACTCCATAT TCAACGAAAT TGTAGAATTT AC C AC T AAAA TTAACAGGCG CTCCATGTGT 2580 

CAAGTGACCA C CAT G AC T C A AATTCATACC TAAAACTGTG TCGCCCATTT CTAATGCAAC 2 64 0 

TAAGTAAACA GCCATGTTCG CTTGTGAACC TGAATGTGGT TGAACATTGA CATGTTCAGC 2 7 00 

TCCAAACAAT GCTTTAGCAC GATCAATTCE GATGCTTTCA GTAACATCTA CAAACTCACA 27 60 

GCCACCATAA TATCGGCGTC CTGGATAGCC TTcAGCATAC TTATTAGTCA ACACTGAACC 2820 

TTGTGCTTCC ATAACCGCTT CCGATACAAA ATTTTCCGAT GCGATTAACT CTATGTTGCT 2880 

ATTTTGTCTC TGAAATTCTC TCTCGATTGC TTCTGCGATA ACTTTATCTT GCTGTGTGAT 2 94 0 

ATAAGACATA AAATCTCCCC TTCTTTCAAA AAAACTTATT GGTATTTAGC ACGTTCGCCA 3000 
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CCAATCTTTT TCGGCCTAGA TGTGGCAATA GTTACAATTG CCTGTCCTAC TTGCTTTACT 30 60 

GAGGTCCTTA CAGGTACACA TACATGTTTA ATATGCATGC CTATTAACGT TTGACCAATA 3120 

TCAATTCCAC AAGGAACAGT AATATGTTCG ACCACGATCG GATCCTTCAT ATGCTGAAAA 318 0 

GCGTATGTTG CCAAACTCCC TCCAGCATGT ACATCTGGAA CGACGGAAAC TTCTTCCATT 32 4 0 

GTTAATGGAT TAT AC T GAGA TTTTTCTATT GTTATCGCTC TGTTGATATG TTCACATCCT 3300 

TGAAAAGCAA AAGTAACGCC TGTCTCTTTA CTCAC^CAT CTAATGCATT AAAAATAGTT 3360 

TCTGCAACTT CCaTCGAACC GACAGTCCCT ATTTTTTCGC CAATGACTTC CGATGTTGAA 3420 

CATCCAATTA AACATATATC TCCTTTATTA AAAAAGGACA TATCTTTTAA TTCGTCTAAT 34 8 0 

AACATTGTCA AATCTTTCAT AAAAGCCCAC CCTTCCTAAA AATAAAAAAG GAATATAGCA 354 0 

AAGTGCTACA CTCCTCTATT ATAACTTATT TAACTGTTAA CAT AT AC T AA TTATACAGAA 3 600 

TTCCTACTAG CAAATAATAT CTTTTAATTT TAAAATTAAA CTTACAAGTT CTTCATAGGT 3660 

AT GT AC AT AG ATTTCTTTTG TTCCACCGTA TGGATCTATA ACTTCTCCTG CTTCTTTtAC 37 2 0 

ATATTCATGC AATGTGAfiAA CATGATTTTG CAAACCAAAG TGTGCCTCTA TTAATTCTTT 37 8 0 

GTGCGAATAC GACATCGTCA AAATAATATC TGCTTTCAAA TCTGCTTCAG TAAATTGTTG 38 4 0 

CGATAAGGTC GTTTCAGCTA AATGATGTTC TTCAACTAAG TCTTCAACAT AATTCGAAAC 3900 

ACCTTGATTG TTCACAGCGA ATATACCTCT TGATTCAAAT TGffiGATTTG GCATAACCTC 3960 

TTTTGCAATA CTTTCCGCTA ATGGGCTACG ACATGTGTTA CCTGTACAAA CGAATAAAAT 4020 

CTTCATAGTT CACATCCTTT AATAATGTGA TTACCTGCAG CTTTTAACAT GCGATTCATA 4 08 0 

ATTGCTTCTG TATTATCATT CAGCTCAAAG CCGTATATAT ACGCCGCTGA AATATTTTCA 414 0 

TTTTCATCAA GTGAATGTAA C AC AT CAT AA AGATTATGAC TTGCTTGTTT AACATCATTG 4200 

TCATCCTGAC ATAATTGAAT GAATTGCGCT TCACTTGGTA TAAACGCCAC CTTATTACTC 4 2 60 

GGCACAATAA AAGCTATAGA AGACCAATCT TTACCGTCAT TTCCAATTTT GCTCTCAATA 4 320 

TCTGTAATAA TTGTAAGTGG TGTATTGGGT GAGTAATGCT TATACTTCAT ACCTGGTGCA 4 38 0 

ATTGGCTGTT CAGTATCATT AT AAT CAGCA TGGGCGATAC TATTCGGAAG TATTTCTGTA 44 4 0 

ATCATTGCTG CTGTTATAGA ACCAGGTCTT GCAATTTTAT AAGGAAAAGA TGTGCAATCT 4500 

AAAACCGTAC TTTCTAATCC TTCTTCACTT TGTTCAGCTT GAAC AAT AC CAT CG AT AC GG 4560 

CCATTCAAAT CTTGATATAC AT GAT T G AAA GTTGTTGGTG AAGGTCTACC ACTTAAATTA 4 620 

GCACTTGGAG CAGCTAGAGG TTCATTTATG ATTTGTAATA ATTGTCTACC TACAGAATGG 4 680 

CTTGGCATTC TAACAGCAAC TGATGATAAA CCTCCAGAAA CTTTTCGACA TAGATAGCCT 47 4 0 

AGCTTTAACG GCAATATAAA CGAAATAGGG CCCGGCCAGA ATGCCTGCAT TAACTTTTCT 4 8 00 

ACGCGTGGAT CCAAAGTATA TGTAAAATCT TTTAATTGAC CTTTACTGTG TATATGAACA 4 8 60 
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ATAAGCGGAT TGTCAGATGG ACGGCCTTTA GCTTCATATA TTTTAGCTAC AGCTTCTTCA 4 920 

TCTGTCGCAT TTGCTGCAAG TCCATAAACT GTTCAGTTG GTAAACCTAT TAAACCACCG 4 98 0 

TTTAAAACAA TGTCTTTTAT TTCATTAATT TTAGGATATT GCTGTAAATC T T CAT TAT AT 50 4 0 

TCTCTAACAT CCCAAATTTT AGTATCCAAC TTAATCACGC CTTTCTTATT TATCATAATA 5100 

TAAAGCAAAA AGCTATGCAC TTAACTAATC ATAGCAAAGG CATAACTTCT AATTACCAT 5160 

T AAAT GAG AC GATTCGATCG TGGCCATTTA TATCTTTAAT AATGTCGATT TTTTTGTCAG 5220 

GAAATTTATT TAAAATTATT GATTTAAGTG CCTCACCTTG ATTGTAACCA ATTTCAAAAA 52 8 0 

CAACTGGGCT GCCTTTTTCC ATAACGTGAG GTAAATCTTC AATGATTGAT TCATAAATAG 5340 

CATATCCATG GTTATCTGCA AACAATGCCT GATGTGGTTC GAATCTCGTA ACCGTTGGAG 54 00 

ACATCGTAAC CATATCTTTT T CATC TAT AT ATGGTGGATT AG AT AT C AAG CCGTTCAACT 54 60 

TGATACCTTC ATTAATTAAG GGCTTTAATG CATCCCCTGT TAAAAATTGT ATTTGTGATT 5520 

GATGCTTCTC AGCATTATTA CGAGCCATAT TCATTGCTTC AAGTGAAATA TCAGTAGCAA 558 0 

TAACATTTAA ATCCGGCTTT TCACATTTCA AAGTAATTGC AAGTACACCA CTACCCGTTC 564 0 

CGATATCTAC GATTGTTGCA TCATCTTCTA ACTGTTGTAA GAAATGCAAC ATTACTTCTT 57 00 

CAGTTTCAGG TCTTGGTATC AAACAATTTG AGTTTACATC AAACGTTCTA CCATAAAATG 7560 

AGGCAAAGCC AAC T AT AT AC TGTATAGGCT CTCCTAATAA CATACGTTGT AATGCTAAGT 58 2 0 

CGAACTTCAT AATCATCGCT TTCGGCATAT CAT C AT GCAT GTGGACTACA AAGTCCGTAC 58 80 

GCGTCCATTG AAATACATCT AACATTAACC ATTCAGCTCG TGTTTGTTCA AACCCTTTTT 594 0 

GTTGTGTTAA ATGAATTGCT TCATCTAACT TTTCTTTATA ATTCACCATT ATTAAGTTCT 6000 

TTCAATTTAT CTGTCTGCTC TGATAAAGTC AGTGCATCTA TAATTTCTTC TAAATGGCCT 60 60 

TCCATAATTT GCCCTAATTT TTGAAGCGTT AGAC C TAT AC GATGGTCTGT TACACGGCTT 6120 

TGTGGATAAT TATAAGTTCG AATACGTTCT GAACGATCAC CAGTACGAC TGCTGATTTA 618 0 

CGTTGTGACG CATACTTTTG TTGTTCTTCT TGAACTTTCA TATCGTATAA ACGTGCTTTT 62 4 0 

AAC AC T T T C A TTGCTTTTTC ACGGTTTTGA ATTTGAGACT T C T c AG AAG A TGTTGCAATG 6300 

ACACCAGTTG GTAAATGGGT AATACGTACT GCAGAGTCAG TTGTGTTTAC GTGCTGACCA 63 60 

CCTGCACCAC T T G AAC GAT A CGTGTCGATT TTTAAATCTT CATTTCTAAT TTCAATTTCT 64 20 

ACATCTTCAA CTTCTGGTAA AACTGCCACT GTAGCTGTTG AAG TAT GAAT ACGTCCACCT 64 80 

GATTCTGTTT CAGGCACACG TTGAACGCGG TGCGCACCAT TTTCAAATTT CAATTTACTA 65 4 0 

TACGCGCCAT TACCAGAAAC T G AG AAAC TA ATTTCTTTGT AACCACCATG GTCACTTTCA 6600 

GACGCTTCTA CTATTTCAGT TTTGAATCCT TGTGATTCAG CATACTTTGA ATACATACGC 6660 
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ATTAAATCAC CAGCAAAAAT CGCAGCCTCA TCACCACCTG CTGCTGCTCT TATTTCTACA 67 2 0 

ATAACGTCTT TGTCATCATT AGGATCTTTA GGAATCAATA ATATTTTAAG CTCTCTTCA 67 8 0 

AG AT T T G G AA GTTCAGCTTT AATACCATTA CTCTCCTCTT TTAACATTTC TACTTCTTCT 68 4 0 

T TAT CAT C AG TCTCACTTAA CATTTCTTCA ATATCAGCTA ATTCTTCTTT TTTAGCTTTA 6900 

TAG TT AC GAT AAAC AT C T AC AGTTTTTTGT AAATCAGCTT GCTCTTTAGA ATATTTACGT 6 9 60 

AATTTATCTG AATCATTTAC AACATCTGGG TCACTTAACA GTTCATTTAA CTGTTCGTAT 7 020 

CTTTCTTCTA CAATATCTAA TTGATCAAAC ACTTATAATT CCTCCTTATT AT TAT C ACT A 70 80 

GGTGCTACGA TATGGTGCGC GCGACAACGT GGCTCATAAC TTTCATTGGC ACCTACTAAG 714 0 

ATAATCGGAT CATCGATTTT AGCTGGTTTA CCATTATTA ATCGTTGCGT TCTACTAGAT 7200 

GAAGAACCAC AAACAGCACA AACTGCTTGA AGTTTCGTTA CTTGTTCACT GACAGCCATC 7 2 60 

AATTTAGGCA TTGGTTCGAA CGGTTCGCCC CTAAAATCCA TATCTAATCC AGCAACAATA 7 320 

ACACGGTGTC CATCTGCTGA TAGTTTTTCT ACTATACTTA CAATTTCATC GTCAAAAAAT 7 38 0 

TGCAcTTCGT CTATTCCTAT AACATCAACA TTAGTTAAGT CGTGCGTCAT AATTTCACTT 74 40 

GCTTTAGAAA TATTAATCGC TTCAATGGCA TTACCATTAT GAGAGACCAC TTTTTCTTTA 7 5 00 

TGATATCGAT CATCAATCGC CGGTTTAAAT ACAACGACTT TTTGTTTAGC GTATATACCC 7 5 60 

CTTCTTAGAC GTCTTAT TAG TTCTTCGGAT TTACCGCTAA AC AT AC T ACC TGTAATACAT 7 620 

TCTATCCAAC CGGAATGGTA AGTTTCATAC ATT GAG AG T n CCACCTTTTT CAAAACATAA 7 680 

TCGCTTTATT ATATCATATT TCAAATATTC ATAAATGTCT TTnTCATAAT TATATCGATA 77 4 0 

TTGTACATGA ACAATTATTT TA 7 7 62 
(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2583 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

AGCGTATGCT TATCTCATTT TCGTACAAGA AACATGGACG ATGTTGCATG AAAAC AT G AC 60 

TAAAAAAATT ATTATCAATG AT G AAC TAG A ATTGACTGAA TTCCACCAAG AACTTACTTA 120 

TATTTTAGAC AACATAnAAG GGAATAATAA TTATGGTAAG GAATTTGTTG CAACCGTTGA 180 

AG AAAC AT T C GACATTGAAT AaAGCGGGGT GgaAGCACTA TGAATCAATG GGATCAGTTC 24 0 

TTAACACCTT ATAAGCAAGC GGTTGATGAG TTGAAAGkGA AcTTaAAGGC ATGCGCAAAC 300 

AATATGAAGT TGGTGAACAA GCGTCGCCAA TAGAATTTGT TACTGGTCGT G T T AAAC C AA 3 60 
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TCGCTAGTAT TATAGATAAG GCAAACAAAC GACAAATACC ATTTGATAGG TTAAGAGAAG 4 20 

AAATGTACGA TATCGCTGGT TTAAGAATGA TGTGCCAATT TGTTGAAGAT ATTGATGTTG 4 80 

TCGTCAATAT TTTAAGACAA AGAmAAGATT TTAAAGTAAT TGAAGAACGA GATTATATTC 54 0 

GTAACACTAA AGAAAGTGGT TACCGCTCGT ATCATGTCAT TATTGATAT CCAATTGAAA 600 

CATTACAAGG CCAAAAATTT ATATTGGCTG AGATTCAGAT TCGTACATTA GCAATGAATT 6 60 

TCTGGGCAAC GATTGAACAT ACTTTACGAT ATAAATATGA TGGTGCTTAT CCGGATGAAA 720 

TTCAACATCG TTTGGAAAGA GCGGCAGAAG CAGCGTATTT AC T T GAT G AA GAGATGTCTG 780 

AAATTAAAGA TGAAATTCAG GAAGCTCAAA AATATTACAC GCAAAAACGT TCTAAAAAAC 840 

AT G AAAAT G A TTAACGAGGT GTTATAAATC ATGCGTTATA CAATTTTAAC TAAAGGTGAC 900 

TCCAAGTCTA ATGCCTTAAA GCATAAAATG AT G AAC TATA TGAAAGrTTT TcGCATGaTT 9 60 

GaGGATrGTG AAAaTCCTGA AATTGTTiTT yCAGTTGGTG GTGACGGTAC AT T AC T AC AA 1020 

GCATTCCATC AGTATAGCCA CATGTTATCA AAAGTGGCAT TTGTTGGAGT TCATACAGGT 1080 

CATTTAGGAT TTTATGCGGA TTGGTTACCT CATGAAGTTG AAAAATTAAT CATCGAAATT 114 0 

AATAATTCAG AGTTTCAGGT CATTGAATAT CCATTGCTTG AAATTATTAT GASTACAAC 1200 

GACAACGGCT ATGAAACAAG GTATTTAGCA TTAAATGAAG CAACGATGAA AACTGAAAAT 12 60 

GGCTCAACAC TTGTTGTGGA TGTTAACTTA AGAGGGAAAC ACTTTGAGCG ATTTAGAGGC 1320 

GAT GGAT TAT GTGTATCAAC ACCTTCGGGT TCAACGGCTT ATAACAAAGC GCTAGGTGGC 138 0 

GCACTGATAC ATCCTTCACT TGAAGCAATG CAAATTACAG AAATTGCCTC GATAAATAAT 14 4 0 

CGTGTGTTTA GAACGGTAGG ATCACCACTT GTATTACCAA AGCATCATAC ATGTTTAATA 1500 

TCACCAGTTA ATCATGATAC CATTAGAATG AC G AT AG AT C ATGTTAGTAT CAAACATAAA 15 60 

AATGTTAATT CAATACAATA CCGTGTAGCA AATG&AAAG TGAGGTTTGC ACGTTTTAGA 162 0 

CCATTCCCAT TCTGGAAACG TGTGCACGAT TCTTTCATAT CAAGTGATGA AGAACGATGA 168 0 

AATTTAAGTA TCATATATCA CAACAAGAAA CTGTTAAAAC TTTTTTAGCA CGACATGATT 1740 

TTTCTAAGAA GACAGTGAGC GCCATTAAAA ATAATGGCGC TTTAATTGTT AATGATGAAC 1800 

CAGTCACAGT GCGTAAgcAA TTAATGACAA AT GAT AT ATT AGAAATTCAT TTACCGCGAG 18 60 

AAAT AC C GAG TGTTAATTTA ATACCTTATG CTCGTAAGCT AGAAGTATTG TATGAAGATG 1920 

CTTTTATCAT CATAGTTACT AAACCAAACA AT C AAAAT T G TACGCCTTCG AGAGAACATC 198 0 

C T CAT G AAAG T T T AATCGAA CAAGTACTAT ATCATTGTCA GGAACATGGT G AAAAT AT T A 20 4 0 

ACCCACATAT TGTTACGCGT CTAGATCGTA ATACAACTGG TATTGTGATA TTCGCTAAAT 2100 

ATGGACATAT CCATCATTTA TTTTCTAAAG TAAACTTGAA AAAAATATAT ACTTGCCTTG 2160 
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TATATGGTAA AACCCATACA TCTGGTATTA TTGAAGCTAA TffiTAGACGG T C AAAGG AT A 



2220 



GGATTATAAC TAGAGAAGTT GCCTCGGATG GTAAATACGC TAAAACATCT TATGAAGTAA 



2280 



TAAATCAGAA TGATAAATAC AGTTTATGCA AAGTTCATTT GCATACGGGA CGTACACATC 



2340 



AAATTCGTGT ACATTTTCAA CATATTGGGC ATCCAATTGT GGGAGATTCT TTGTATGATG 



2400 



GTTTTCATGA CAAAATTCAT GGTCAAGTAC TGCAATGTAC GCAAATATAT TTTGTTCATC 



2460 



CAATCAATAA GAACAATATT TAT AT T AC AA TTGATTATAA GCAATTACTT AAATTATnCA 



2520 



ATCAACTCTA ATnCACACAG GGGGTGTAAG TATGTCAATG AnCACAGATG AAAAAGAGCG 



2580 



TGT 



2583 



(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1818 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: liner 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

ATCAAGTGAT ACATTTAACT GGTAAAGGAT TAAnAGATGC TCAAGTTAAA AAATCnGGAT 60 

ATATACAATA TGAATTTGTT AAAGAGGATT TnACAGATTT ATTnGCAATT ACGGATACAG 120 

TAATAAGTAG AGCTGGATCA AATGCGATTT ATGAGTTCTT AACATACGT ATACCAATGT 180 

TAT TAG T AC C ATTAGGTTTA GATCAATCCC GAGGCGACCA AATTGACAAT GCAAATCATT 240 

TTGCTGATAA AGGATATGCT AAAGCGATTG AT G AAGAAC A AT T AACAGCA CAAATTTTAT 300 

TACAAGAACT AAATGAAATG GAACAGGAAA GAACTCGAAT TATCAATAAT ATGAAATCGT 3 60 

ATGAACAAAG T T AT ACGAAA GAAGCTTTAT TTGATAAGAT GAT T AAAGAC GCATTGAATT 4 20 

AATGGGGGGT AATGCTTTAT GAGTCAATGG AAACGTATCT CTTTGCTCAT CGTTTTTACA 4 80 

TTGGTTTTTG GAATTATCGC GTTTTTCCAC GAATCAAGAC TTGGGAAATG GATTGATAAT 54 0 

GAAGTTTATG AGTTTGTATA TTCATC3GAG AGCTTTATTA CGACATCTAT CATGCTTGGG 600 

GCTACTAAAG TAGGTGAAGT CTGGGCAATG TTATGTATTT CATTACTTCT TGTGGCATAT 6 60 

CTCATGTTAA AGCGCCACAA AATTGAAGCA TTATTTTTTG CATTAACAAT GGCATTATCT 720 

GGAATTTTGA ATCCAGCATT AAAAAATATA TTCGATAGAG AAAGACCTAC ATGCTGCGT 780 

TTAATTGATA TAACAGGATT TAGTTTTCCT AGCGGTCATG CTATGGGATC AACTGCATAT 84 0 

TTTGGAAGTG GTATCTATCT ATTAAATCGA TTAAATCAAG GTAATTCAAA AGGTATTCTT 900 

AT AGGGT TAT GTGCAGCTAT GATTTTATTG ATTTCCATAT CACGTGTATA TCTAGGTGTA 9 60 

CATTATCCAA CAGATATTAT TGCCGGCATT ATTGGTGGAT TATTTTGcAT TATTTTATCA 1020 
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ACGTTATTAC TTAGAAATAA ATTAATAAAT TAAATAGTAA AAAAACAAAA GCAGTAAACC 1080 

TAAAGTGTCG TAAGGGTTTA CTGCTTTTAT AAAACGTTGT TATAACGTAT ATTGTCTTTT 114 0 

ACGGGCATAT AAnAGGGGAA TATTTGAnAA TGAtCAATCC AACAAGAACG AAACGTTGTG 12 00 

GGGGGGATGT TCTATGTGGT ATTGATAATC ATTTTCAACT AC TAT TAT AC AT TAG T GAGA 12 60 

ATCATTGTCA ATTAGAAACT AAAACTTTTT TTGAATATTT TTTAAGAATA GTAAATAAAA 132 0 

CGCATGATTA CGCTATTTTA GAAAATAAAA AAATTTGTAT TTCTCATTAG AATTAGAATA 138 0 

TTTAAAAGTG ATGAGGTTTA AACATTATAT TGTTTACATA CTCCTTTTGA ATTCATACAT 14 4 0 

TATGAAATGT t ACTTCCAAG TTCAAAATCG CACATTGAAA TGATGTGTGA AATGTTTAAA 1500 

CTACGGTCAT tTTGTGmAAA TAAAG r TAAT AACTATTCAT TTTACAATAG TGAAAAGTCA 1560 

GTATATGACA ACAATTAATA TTGCGGTAAG GCCTTGTGTT ACAGTATTCT ATATTTAAGT 162 0 

ACTGCAATCA GAATTAACAG AATGCCATTA ACTGATTATT AAATATTTGA GTTAATAAAT 168 0 

AATTAATGAT TGTAGCTTGA AAAATTTAAA ACATGGTTAT TGATTTGTGA TAAAATTTAA 17 4 0 

ACGTAAACAA ACTAATTTAA AAAGCAAC T A TTGTATAGAA AATACAAAA TTTAAAATAT 18 00 

ATTACCTTAT TAGAAAAA 1818 
(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

TGTTTAAACA ATAGGGGGAA TCTTATGATT GAAAAATTAG TAACCTTTTT AAATGAGGTT 60 

GTTTGGAGTA AGCCATTAGT TTATGGTTTG CTAATTACTG GTGTGCTATT TACATTGCGT 120 

ATgCGATTTT TTCAAGTTAG AC AT T T T AAA GAAATGATTC GAT TAAT GTT TCAAGGAGAG 180 

AAGTCTCCTA ATGGTATTTC AAGTTTTCAA GCGATAgcCA TGTCTTTAGC AGGCAGGGTT 24 0 

GGTACAGGTA ATATTGTCGG TGTATCTACT GCAATATTTA TAGGAGGACC TGGTGCAGTA 300 

TTTTGGATGT GGATTACTGC GTTTTTAGGT GCAAGTAGTG CTTTTATTGA ATCTACACTT 360 

GGTCAAATAT TCAAGAGAGT TGAAAATAAT GAATACCGTG GTGGACCAGC GTATTATATT 4 20 

GAATATGGTA TTGGTGGTAA ATTTGGTAAA ATTTACGGAA TTATCTTTGC TATTGTTACG 4 80 

ATTATCTCAG TAGGTCTATT GCTTCCTGGT GTGCAATCTA ACGTATAGC AAGTTCTATG 54 0 

CATAATGCGA TTCATGTTCC ACAATGGTTA ATGGGTGGTA TTGTTGTAGT TATTTTGGGA 600 
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TTAATTATTT TTGGTGGTGT ACGTATTATT GCCAATGTTG CAACAGCCGT TGTACCATTT 660 

ATGGCAATTA T T T AC AT AC T GATGGCTGTC ATTATCATTT GTATCAATAT ACAAGAAGTG 720 

CCAGCGTTAT TTGCATTAAT TTTCAAATCA GCATTTGGAT TACAATCTGC TTTTGGTGGT 7 80 

ATCGTTGGCG CAATGATAGA GATTGGTGTT AAACGTGGAT TATATTCAAA TGAGGCTGGT 840 

CAAGGTACAG GTCCACACGC AGCAGCGGCa gcAGaAGTAT CAC AT CCAAG TAAACAAGGT 900 

CTAGTACAAG CATTTTCAGT TTATATTGAT ACATTATTTG TATGTACTGC AACTGCTCTG 960 

ATTATACTTA TTTCTGGTAC ATATAATGTG ACTGATGGTA CGGTTAATGC GAATGGCACA 1020 

CCGCATTTAA TTAAAGATGG CGGTATTTAT GTTgAAAATG CAACAGGTAA AGATTATTCA 1080 

GGTACTGCGA TGTATGCACA AGCCGGCATt GATAAAGCGT TCCATGGCAGTGGTTATCAA 114 0 

TTTGATCCTA CTTTCTCTGG CGTAGgTTCG TACTTTATTG cATTTGCTTT ATTCTTCTTT 12 00 

GCATTTACTA CAATTTTGTC GT ACT ACT AC AT T AC AG AAA CAAATGTTGC TTATTTAACG 12 60 

CGTAATCAAA ATAATCAAGT TTCATCGATA TTTATTAATA TTGCTCGTGT GATTATTTTG 1320 

TTCGCTACAT TTTACGGTGC AGTTAAAACA GCTGATGTAG CATGGGCATT CGGTGATTTA 1380 

GGTGTAGGTC TAATGGCTTG GTTAAATATC ATTGCGATTT GGATTTTACA TAAGCCTGCC 14 4 0 

GTAAATGCTT TAAAAGATTA TGAAATTCAA AAGAAACGTT TAGGCAACGG TTATAATGCA 1500 

GTTTATCAAC CTGATCCGAA TAAATTACCT AffGCTGTCT TTTGGTTGAA GACGTATCCA 1560 

GAACGTTTAA AACAAGCACG TGCCAAAAAG TAATCTACTT TTGTTTATAG TATATGTAGT 162 0 

GATCATTTGA TAAAAAAGAA AAGTATTGAG AATTTTAGGt GCTCAGAAAT TTGAATTTTA 1680 

AAAATATAGT GTCTCTTGGT ACAATAACAA TACAACTACT AGGGGCACTT TTTTATGOA 17 4 0 

GAATTTAAAA CTGGTAAGAT TAATAAACAT GTTTTATATA GTAATATTTT AAATAGAGAT 1800 

GTCACGTTAA GTATTTATTT ACCAGAATCT TATAATCAAC TTGTTAAATA TAATGTCATT 1860 

CTTTGCTTTG AC GG AT TAG A TTTTTTACGT TTCGGGAGAA TACAACGTAC ATATGAATCG 192 0 

TTAATCAAAG AAGCGCGTAT TGATGATGCG ATCATTGTTG GATTCCATTA TGAAGACGTT 198 0 

GATAAGCGTA GAGAGGAATT T CAT C C AC AA GGAAGTCGTT CTCATTTAAC TATTCAATCA 204 0 

GTCGGTAAAG AAATATTGCC ATTTATTGAC TCGACGTTTT C T AC AC T G AA AGTAGGTAAT 2100 

GCAAGGTTAT TAGTAGGGGA TAGTTTAGCG GGTAGTATTGCCTTATTAAC GGCGTTGACC 2160 

TATCCAACGA TTTTTAGTCG TGTAGCAATG TTAAGTCCAC ATT C AG AT G A AAAAG T AT T A 222 0 

GATAAGCTAA ATCAATGTGC AAATAAAGAA CAATTGACAA TTTGGCATGT CATTGGTCTA 2280 

GAT G AAAAAG ATTTTACTTT ACCAACAAAT GGTAAGCGTG CCGATTTCTT AAC AC CGAAT 3240 

AGAGAATTAG C T G AAC AAA T TAAGAAATAT AATATAACTT AT TAT T AC G A TGAATTTGAT 2 4 00 

GGTGGTCACC AATGGAAAGA TTGGAAACCA TTGCTGTCAG ATATATTATT GTATTTTTTA 2 4 60 
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AGTAAAAACA C AG AT GAT C A ACTTTATGAA TAATTTACAT TAGTAGATTT AGTATGAATT 2 52 0 

GTCTTCATAT AGTCTGGTCT ATAATATAAT TTATAAAAGA TTTTACTGTT TAATTTAATT 2580 

TAAATTTGAC GAAATTGCAA AAGATGTATA ATGAATTATT TTTAATGTAA CGGTTTTCAA 2 64 0 

AGAAATTTGA TATAATAGCA ATAGGTTAAA CAAAGGAGGA ATTCAGATGA TTTTAGGATT 2700 

AGCATTAATT CCATCAAAGT CATTTCAAGA AGCGGTGGAT TCTTACGTA AAAGATATGA 27 60 

TAAACAGTAT TCACGAATTA AAC C AC AT G T GACAATTAAA GCGCCATTTG AAATTAAAGA 2 820 

TGGTGATTTA GATTCTGTCA TTGAACAGGT TAGAGCTCGT ATTAATGGTA TACCAGCAGT 28 80 

AGAAGTTCAT GCTACAAAAG CTTCTAGCTT CAAACCAACG AACAATGTGA TTTACTTTAA 2 94 0 

AGTTGCGAAG ACGGACGACT TAGAAGAATT GTTTAATCGC TTTAATGGAG AAGATTTCTA 3000 

TGGAGAAGCT GAACATGTTT TTGTGCCACA CTTTACAATA GCACAAGGAC TATCTAGCCA 3060 

AGAATTCGAA GATATTTTTG GTCaAGTAGC ATTAGCTGGG GTAGACCaTA AAGAAAT TAT 312 0 

CGATGAATTA ACTTTGTTAC GTTTTGACCA TGACGAAGAT AAAT GGAAAG TTATTGAAAC 318 0 

GTTTAAATTA GCTTAAGTAA CATAATAGTA TTGTTAATCG TAGTATGTTT GAATTAATAA 324 0 

GAAAATGGTC ATTTTTATTG AATGTAATAA AAATGACCAT TTTCTTTATT TTAAAATACG 3300 

TTTTAACCTT ACTTAGCTTT TTCTCTATTT ACTATAAAGT rGCTTCCATA AAAffiCAGCT 33 60 

AAGACTAAAA AGATTAATGC CGAGAAATAA AATGTATTGT TTAAATTGTT GGTAAATTGT 34 20 

GTAATTAATC CGCCAAATAA TGGCCCTATC ATTGAGCCGA ATCCTTGGAT ACTATTAAAA 34 8 0 

ACACCCCAAG TTTCTTCTTG TTCATCTGAT TTGATAAATC GTGCCATAAA GGTATTCCAT 354 0 

GCTGGTAATA AGATGCCATA CATTAGACCG ATAGCTAAAG CGATAATCCA CAAGATGTGA 3600 

AT AT T AAC AA TCATAGATAG AGTAAAAATT AATATCATGT ATAAAATAAA TCCGCTTAGA 3660 

ATAACACCAT ACATAAAGTT TCTGCTGCGG TTATCTATTA GTTTCGATAA AAAT AGCAT C 3720 

GAAACTGCAC AGCCGATACC ACCAATAATG AT TG CMC AG TATATTCAAT TGTGCTTACG 37 8 0 

TTAATAACCT TAGTAGCATA TGTTGGTAAT ATAGGAACTA GGGCAGCAAT TGCGGCACCT 38 4 0 

TGTAAAAGAA TACCAGGGAA C a AC AAT AAA TGGcGCTTTG T C AC AT C AAC AATTTGTCTC 3900 

AATTGAGCTT TAACTGGACG AGTATTATAA TTTGTTAACT TTACATCGAC AAAATAATAT 3960 

AATATCCATG CAATTAAAAC GACTAAAGAC ATCATGAAGG CAAAGCGTGT TGGGTGCACT 4 020 

TTGATAAGTA GATTCATAAA AACCATACCT ACCAATAGGC CTAACAACCA TGAAAAATAA 4 08 0 

ACATAGCCCA TTTGTTTGCC ACGTTTATCT TCTTCAACAC TGGATAACAT AAT G AC C C AA 414 0 

ATAGGACTAA CTGCAATACC GAGCATCATA GCACTAAATA T GAT T AC AAA AGGTGATGCT 4 200 

GGAAACCAAA TAACTAAAAA TAAACTTGTA AATGCTAAAA TAAATCCAGT CGTTAAAACG 4 2 60 
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ATTTTTGTGC CGAATTTTTT CAGTAAAAAT CCTATAACAA AGTTTGTAGA TGCATCAGCA 4 32 0 

ATAAAATGTA TTGAAAATGC TAGAGACGTT ATTGCTACAG CAffiGGATGT AACTGTTGGC 4 380 

AAGAAATTAA TATAGCTTAG GAT AT AC AT G CCTCTCGCAA ATTCCATTAA AAATAAGATA 4 4 40 

ATAAGCaTTA AAATGAAATT TTTATGATTA GCGTAATTAT TTAACGAAGA ATCTTGCATA 4 500 

TAAAGGAACC TTTCCATAAA TCTCTTGTGG TTGTGATGAA TGACCGATTA AATCAAGTAA 4 5 60 

GTCTCGACAT ATTGTCTGTG TAGCATACTT AATTTTATCT TGTTCCATTG TACTAATCAT 4 620 

GTTAGTTAAT TGCTCATTAC CGTTAGTTAA ACTTGCTACA ATTTTTATTG CTTCTTCTGG ' 4 680 

AGTATCAGCG ATTTTACCAA AACCTTTTTC TTCAAAGTAA AGGGCATTTT CAAGCTCTTG 4740 

ACCAGGTGCA GGATTTAGGA AAATOYTTGG AATACAACGG GCGAAACCTT CAGTTATTGT 4 8 00 

GATACCACCA GGTTTCGTAA TCATAAGTTG ACTTGATGCC ATCCATTCAT TCATGTGTTT 4 8 60 

GGTATAACCT AGAATCAATA CATTCTCGTT AGATTTAAAC TTAGCTGTTA AAGAACGCTT 4 92 0 

TAGCTCTTTG CTCTTACCAC AAAT CAT AAC TACTTGTGCA TTTGCaCTTTtCGCTAATAT 4 980 

ATCAGTAATC ATCGTGTCAA AACCTTTAGA T AC AC C AAAT GCACCAGCTG aCATTAAAAT 504 0 

AGTTTGCTTA TCTGGATCTA AGTTGTTGTC TATTAACCAC TGCTTTTGAT TAATAGGCGT 5100 

TTCAAATTTG TTATCAATAG GAATACCTGT CaCTTTAACT GTTGAAGGAT CAATACCTAC 5160 

GTCTATGAAG TCTTGTTTCG TTTCTTTTGT TGCCACATAA TATCTTGTTG AATACGGCGT 5220 

AATCCAGTTT TTATGTAAGC GATAGTCTGT CATCACTGTA GCAACTGGAA TATTAATGTT 5280 

AAATTGCTCA GTTAGTACCG ACATAACTGG TGTAGGAAAC GTTAATAATA TTAAATCTGG 534 0 

CTTTTCTTTT ATCAATAAAT TAATTAACTT ATTAAGTCCA TAGTATTTGT AAAAAC AT T T 5 4 00 

GTCTAGTTTA TCTGGGCGGC TGTAATAAAA CCCTTTGTAC ATATTTCTAA AAT AT T T AAA 54 60 

GCTATTGATA TACCATTTTT TACAAATAGA AGTCAAAATT GGATGAGCTT CCATAAATAA 5520 

ATCGTGCTCA ATGACGCTTA AATGGTCTAG ATTCATATCA TTAAGTTGAT TAACGATST 5580 

CTGTGTAACT TGcATATGAC CGTTACCGAA TGAGCCAGTA ATAATCAATA TCTTTTTATT 5 64 0 

T T GAG T AAC C ATTAATAGCC ACCCTCCGTT AGTTTGAAAA TTTTATTTAA GTGTAACTTA 5700 

TTTTACGGCA TTATAAAAGA AATAAAGACG CAAAGTCGTT ACATTTATAG CAATTTTAAT 57 60 

CTATAGATGA ATTGA.TACAA AATAAAACGT TATTTTATAA AGCAATTTAT TGTTCTATGT 5820 

TTTATTTGTA TATTTAAAAT TATCCAGTAT ACAATTATAG CATATTTTTG GAAACAATTA 58 8 0 

TGATATTATA CCATGTTACA AGATGGTTTT AAT AAT T T AA GATGAGCCAT AATTGTAAAA 5 94 0 

CTAATTCATA ATACCGTATG TTTTATTTTT AATAGTAGAAATTAGAAAAT GCTGATTAGT 6000 

AGGATATAAC AGTGAAATTA TAAATTTATT AACATCAACA AAACGTGTAT AATAAACATA 60 60 

TTGTAGAAAA AGGAGCGGTT CAGTTTGGAT GCAAGTACGT TGTTTAAGAA AGTAAAAGTA 6120 
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AAGCGTGTAT TGGGTTCTTT AGAACAACAA ATAGATGATA T C AC TACT GA TTCACGTACA WO 

GCGAGAGAAG GTAGCATTTT TGTCGCTTCA GTTGGATATA CTGTAGACAG TCATAAGTTC 62 4 0 

TGTCAAAATG TAGCTGATCA AGGGTGTAAG TTGGTAGTGG TCAATAAAGA ACAATCATTA 63 00 

CCAGCTAACG TAACACAAGT GGTTGTGCCG G AC AC AT T AA GAGTAGCTAG TATTCTAGCA 63 60 

CACACATTAT ATGATTATCC G^GTCATCAG TTAGTGACAT TTGGTGTAaC GGGTACAAAT 64 2 0 

GGTAAAACTT CTATTGCGAC GATGATTCAT TTAATTCAAA GAAAGTTACA AAAAAATAGT 64 80 

GCATATTTAG GAACTAATGG TTTCCAAATT AATGAAACAA AGACAAAAGG TGCAAATACG 654 0 

AC AC C AG AAA CAGTTTCTTT AACTAAGAAA ATTAAAGAAG CAGTTGSGC AGGCGCTGAA 660 0 

T C TAT G AC AT TAGAAGTATC AAGCCATGGC TTAGTATTAG GACGACTGCG AGGCGTTGAA 6660 

TTTGACGTTG CAATATTTTC AAATTTAACA CAAGACCATT TAGATTTTCA TGGCACAATG 67 2 0 

GAAGCATACG G AC AC G C G AA GTCTTTATTG TTTAGTCAAT TAGGTGAAGA TTTGTCGAAA 67 8 0 

GAAAAGTATG TCGTGTTAAA CAATGACGAT TCATTTTCTG AGTATTTAAG AACAGTGACG 68 4 0 

CCTTATGAAG TATTTAGTTA TGGAATTGAT GAGGAAGCCC AATTTATGGC T AAAAAT AT T 6900 

CAAGAATCTT TACAAGGTGT CAGCTTTGAT TTTGTAACGC CTTTTGGAAC TTACCCAGTA 6 9 60 

AAATCGCCTT ATGTTGGTAA GTTTAATATT TCTAATATTA TGGCGGCAAT GATTGCGGTG 7 02 0 

TGGAGTAAAG GTACATCTTT AGAAACGATT ATTAAAGCTG TTGAAAATTT AGAACCTGTT 70 80 

GAAGGGCGAT TAGAAGTTTT AGATCCTTCG TTACCTATTG ATTTAATTAT CGATTATGCA 714 0 

CATACAGCTG ATGGTATGAA CAAATTAATC GATGCAGTAC AGCCTTTTGT AAAGAAAAG 7200 

TTGATATTTT TAGTTGGTAT GGCAGGCGAA CGTGATTTAA CTAAAACGCC TGAAATGGGG 72 60 

CGAGTTGCCT GTCGTGCAGA TTATGTCATT TTCACACCGG ATAATCCGGC AAATGATGAC 7 32 0 

CCGAAAATGT TAACGGCAGA ATTAGCCAAA GGTGCAACAC ATCAAAACTA TATTGAATTT 7380 

GATGATCGTG CAGAAGGGAT AAAAC AT G C A ATTGACATAG CTGAGCCTGG GGATACTGTC 74 40 

GTTTTAGCAT CAAAAGGAAG AGAACCATAT CAAATCATGC CAGGGCATAT TAAGGTGCCA 7 5 00 

CATCGAGATG ATTTAATTGG CCTTGAAGCA GCTTACAAAA AGTTCGGTGG TGGCCCTGTT 7 5 60 

GAT T AAT AAA AGATTTATTG ATGAAGGTAA AACTATGAT GTTTATTTAT TCGAAGCATT 7 620 

AAATAACCAG ATAATCATTG CTATACCAGA TTGGTTTTGG T CAT AT C AG A T G G C AAT G AC 7680 

AT TAG AT G AA GAAACTTGTT TTGAAGCAAT ACTCATGCAA TTGTTTGTTT TTAAAGAAGA 77 4 0 

GGAAGAGGCA GAATCGATTG CATCACAACT AACAGATTGG ATAGAAACAT ATAAAAAGGA 7 800 

GAAAGACTAA TGAACTTAAA GCAAGAAGTT GAG T C T AG AA AGACTTTTGC GATTATTTCA 7 8 60 

CATCCCGATG CAGGGAAAAC AACGTTAACT G AAAAAC TAT TGTACTTCAG TGGTGCTATT 7 920 
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CGTGAAGCGG G T AC AG T T AA AGGGAAGAAG ACTGGTAAAT TTGCGACAAG TGACTGGATG 7 98 0 

AAAGTTGAAC AAGAGCGTGG TATTTCTGTA ACTAGTTCAG TAATGCAATT TGATTACGAT 804 0 

GATTATAAAA TCAATATCTT AGATACACCA GGACATGAAG ACTTTTCAGA AGATACGTAT 8100 

AGAACATTAA TGGCAGTTGA CAGTGCTGTC ATGGTCATAG ACTGTGCAAA AGGTATTGAA 8160 

C C ACAAAC AT TGAAGTTATT TAAAGTTTGT AAAATGCGTG GTffiTCCAAT CTTTACATTC 8220 

ATTAATAAAT TAG AC C GAG T AGGTAAAGAA CCATTTGAAT TATTAGATGA AATCGAAGAG 82 8 0 

ACATTAAATA TTGAAACATA CCCTATGAAT TGGCCAATTG GTATGGGACA AAGTTTCTTT 8 34 0 

GGCATCATTG AT AG AAAG T C TAAAACAATT GAACCATTTA GAGATGAAGA AAAT AT AT T A 8 4 00 

CATTTGAATG ATGATTTTGA GTTGGAAGAA GATCATGCAA T T AC AAAT GA TAGTGATTTT 84 60 

GAACAAGCGA TTGAAGAATT AATGTTGGTT GAAGAAGCGG GTGAAGCCTT TGATAATGAC 8 52 0 

GCGCTGTTGA GTGGAGACTT AACACCTGTA TTTTTCGGTT CAGCTTTAGC TAACTTTGGT 8580 

GTACAAAATT TCTTAAATGC ATATGTTGAT TTTGCGCCAA TGCCAAATGC GAGACAAACA 8 64 0 

AAAGAAGACG TTGAAGTAAG CCCGTTTGAT GATTCATTTT CAGGATTTAT CTTTAAAATT . 87 00 

CAAGCCAACA TGGACCCTAA ACACCGTGAT AGAATTGCCT TTATGCGTGT CGTTAGTGGT 87 60 

GCATTTGAAC GTGGTATGGA TGTTACTTTG CAACGTACTA ATAAAAAGCAAAAGATCACA 8 82 0 

CGTTCAACGT CATTTATGGC AG AC GAT AAA GAAACTGTGA ATCATGCTGT AGCAGGCGAT 88 8 0 

ATCATTGGAC TAT AT GAT AC TGGTAATTAT CAAATTGGAG ATACTTTAGT TGGTGGAAAA 8 94 0 

CAAACCTACA GTTTCCAAGA T T T AC C AC AA TTTACGCCAG AAATTTTTAT GAAAGTTTCT 9000 

GCTAAAAACG TCATGAAACA GAAGCATTTC CATAAAGGTA TTGAAGAATT AGTACAAGAA 9060 

GGTGCGATTC AATACTATAA AACATTACAC AC AAAC C AAA TTATTTTAGG TGCTGTTGGT 912 0 

CAGTTACAAT TTGAAGTTTT CGAACATAGA AT GAAAAACG AATATAATGT TGATGTTGTT 9180 

ATGGAGCCAG TAGGCCGTAA AATTGCACGT TGATTGAAA ATGAAGACCA AATTACAGAT 92 4 0 

AAGATGAACA CATCAAGATC GATTTTAGTG AAAGATAGAT ATGACGATTT AGTATTCTTA 9300 

TTTGAAAATG AATTTGCAAC AAGATGGTTT GAAGAGAAAT TCCCTGAAAT TAAATTGTAT 93 60 

AGTTTACTTT AACAGCTCAA TTGTATAATC GAATTTGTTA CATTAAAAAT AATTGTTOG 94 2 0 

TTGAAGAAAA ATAAATTGTA TATTTTAAAA GAAAAAGGTA TACTATGATG TATCAAATGA 94 8 0 

ATAACCTATG GCATTTTGTC AGAGGGGAGT AACTTAAGAA TCATGACCGT ATAAATGaTT 95 4 0 

CGACACTTTA TCGTCATTAC GArGATATCT TCCGGTAAAG TGGGCAATTT AAATTGCTTA 9600 

GTGAGACCTT TGCTATTTAT TTAGCATAGG TCTTTTTGTT TGTACTTAAC TTATTTATTT 9660 

AAAGGAGTTG TACATGTTAA TGGATCCAAG TTTGATCTTA CCTTATTTAT GGGTACTTGT 9720 

CGTTTTAGTA TTTTTAGAAG GCTTATTAGC AGCAGATAAC GCGATTGTTA TGGCTGTAAT 97 8 0 
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GGTTAAGCAC TTACCACCCG AACAACGTAA AAAAGCTTTG TTTTACGGTT TGTTAGGTGC 98 4 0 

ATTTGTATTT AGATTTTTAG CATTATTCTT AATTAGTATT ATCGCGAACT TTTGGTTTAT 9900 

TCAAGCTGCA GGAGCGGTTT ACTTAATTTA TATGTCAATC AAAAATCTGT GGCAGTTCTT 9 9 60 

TAAACACCCA G AAAT T G AAA GTCCTGAAGC TGGAGATGAT CAT CAT T AT G ATGAATCTGG 1CB2 0 

TGAAGAGATT AAAGCAAGTA ACAAATCATT CTGGGGAACT GTGTTGAAAA TAGAATTTGC 10080 

AGATATCGCA TTTGCCATTG ATTCTATGCT TGCTGCTTTA gCTATTGCTG TAACACTTCC 1014 0 

TAAAGTTGGT ATTCACTTTG GTGGTATGGA CTTAGGTCAG TTCGTAGTCA TGTTCCTAGG 10200 

TGGAATGATT GGTGTTATTC TAATGCGTTA TGCAGCAACA TGGTTTGTAG AGCTATTAAA 102 60 

CAAATATCCA GGACTTGAAG GTGCAGCCTt CGCGATCGTT GGTTGGGTAG GTGTTAAATT 10320 

AGTTGTCATG GTATTAGCGC ACCCAGACAT CGCTGTATTG CCTGAGCACT TCCCACATGG 10380 

CGTATTATGG CAATCTATTT TCTGGACAGT ACTAATTGGA TTAGTAffiTA TCGGTTGGTT 1044 0 

AGGTTCAGTT GTTAAAAATA AAAAATCGCA TAAATAATTG ATGTGAAGCG GACAATCTTA 105 00 

ATTTAGTTTA AGGTTGTCCT TTTTCATTTA ATTGAGTGAT TTATGAAAAA TGGATTTTGA 105 60 

AGAATGTGAA TCAAAAGATG CGATATAGTA TTAAGAAAAT GTGCCTTTTA TATTTAGCAT 10 620 

TTTTTCAATA GAAAT TAT AT AGAT T T T AAA GC AAAT T AGG TGTTAATGTG TCATAATGAT 10 680 

AAGTGATTTT ATTGAATGGA GTGGACATTA GTGGATATTG GTAAAAAACA TGTAATTCCT 10740 

AAAAGTCAGT nACCsaCGTA AGCGTCGTGA ATTCTTCCAC AACGAAGACA GAGAAGAAAA 10800 

TTTAAATCAA CATCAAGATA AACAAAATM 1 AGATAATACA ACATCAAAAA AAGCAGATAA 10860 

GCAAATACAT AAAGATTCAA TTGATAAGCA CGAACGTTTT AAAAATAGTT TATCATCGCA 10920 

TTTAGAACAG AGAAACCGTG ATGTTAATGA GAATAAAGCT GAAGAAAGTA AAAGTAATCA 10 980 

GGAT AG TAAG TCAGCATATA AC AG AG AT C A TTATTTAACA GACGATGTAT CTAAAAACA 1104 0 

AAATTCATTA GATTCAGTGG ACCAAGATAC AG AG AAAT C A AAATATTATG AGCAAAATTC 11100 

TGAAGCGACT TTATCAACTA AATCAACCGA TAAAGTAGAA TCAACTGAAA TGAGAAAGCT 11160 

AAGTTCAGAT AAAAACAAAG TTGGTCATGA AGAGCAACAT GTACTTTCTA AACCTTCAGA 11220 

AC AT GAT AAA GAG AC T AG AA TTGATTCTGA GTCTTCAAGA ACTGATTCAG ACAGCTCGAT 11280 

GCAGACAGAG AAAATAAAAA AAGACAGTTC AG AT GGAAAT AAAAGTAGTA ATCTGAAATC 1134 0 

TGAAGTAATA T C AG AC AAAT CAAATACAGT ACCAAAATTG TCGGAATCTG AT GAT GAAG T 114 00 

AAATAATCAG AAGCCATTAA CTTTACCGGA AGAAGKAAA TTGAAAAGAC AGCAAAGTCA 114 60 

AAATGAGCAA ACAAAAACCT ATACATATGG TGATAGCGAA CAAAATGACA AGTCTAATCA 11520 

T GAAAAT GAT TTAAGTCATC AT AT AC CATC GATAAGTGAT GATAAAGATA ACGTCATGAG 11580 
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AG AAAAT CAT ATTGTTGACG ATAATCCTGA TAATGATATC AATACACCAT CATTATCAAA 11640 

AACAGATGAC GATCGAAAAC TTGATGAAAA AATTCATGTT GAAGATAAAC ATAAACAAAA 11700 

TGCAGACTCG TCTGAAACGG TGGGATATCA AAGTCAGTCA ACTGCATCTC ATCGTAGCAC 11760 

TGAAAAAAGA AATATTTCTA TTAATGACCA TGATAAATTA AACGGTCAAA AAACAAATAC 11820 

AAAGACATCG GCAAATAATA ATCAAAAAAA GGCTACATCA AAATTGAACA AAGGGCGCGC 118 80 

TACGAATAAT AATTATAGTG ACATTTTGAA AAAGTTTTGG ATGATGTATT GGCCTAAATT 11940 

AGTTATTCTA ATGGGTATTA TTATTCTAAT TGTTATTTTG AATGCCATTT TTAATAATGT 12000 

GAACAAAAAT GATCGCATGA ATGATAATAA TGATGCAGAT GCEaAAAAT ATACGACAAC 12060 

GAT G AAAAAT GCCAATAACA CAGTTAAATC GGTCGTTACA GTTGAAAATG AAACATCAAA 12120 

AGATTCmTCA TTACCTAAAG AT AAAGC AT C TCaAGACGAA GTGGGATCAG GTGTTGTATA 12180 

TAAAAAATCT GGAGATACGT TATATATTGT TACGAATGCA CACGTTGTCG GTGATAAAGA 12240 

AAATCaAAAA ATAACTTTCT CGAATAATAA AAGTGTTGTT GGGAAAGTGC TTGGTAAAGA 12300 

TAAATGGTCA GATTTAGCTG TTGTTAAAGC AACTTCTTCA GACAGTTCAG T G AAAG AG AT 12360 

AGCTATTGGA GATTCAAATA ATTTAGTGTT AGGAGAGCCA ATATTAGTCG TAGGTAATCC 12420 

ACTTGGTGTA GACTTTAAAG GCACETGAC AGAAGGTATT ATTTCAGGTC T G AAC AG AAA 124 80 

TGTTCCTATT GATTTCGATA AAGATAATAA ATATGATATG TTGATGAAAG CTTTCCAAAT 1254 0 

T GAT G CAT C A GTAAATCCAG GTAACTCGGG TGGTGCTGTC GTCAATAGAG AAGGAAAATT 12 600 

AATAGGTGTA GTTGCAGCTA AAATTAGTAT GCCAAACGTT GAAAnTATGTCATTTGCA 12 65 8 
(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6048 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

TGAAATnGAA TAGTACTATT GCAAGTGTAA AGAGGTTAAT TTTTGCCnCA CGCGGGACTT 60 

AAAAAGGCAA CCACTGGTTG TGACATATCC TTATTTACAT TTATAAATAT AAG GAG GAG G 12 0 

TAG TAG T G AA AGACTTATTG CAAGCACAGC AAAAGCTTAT ACCGGATCTC ATAGATAAAA 180 

TGTATAAACG TTTTTCTATT CTTACTACTA TCTCAAAAAA TCAGCCTGTC GGACGTCGAA 24 0 

GTTTAAGCGA ACATATGGAT ATGACTGAAC GTGTACTGCG TTCTGAAACA GATATGCTTA 300 

AGAAACAAGA TTTGATAAAA GTTAAGCCTA CCGGAATGGA AATTACAGCT GAAGGTGAGC 3 60 

AACTGATTTC GCAATTGAAA GGTTACTTfG ATATCTATGC AG AT G AT AAT CGTCTGTCAG 4 20 
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AAGGTATTAA GAATAAATTT CAAATTAAGG AAGTTCATGT TGTTCCTGGT GATGCTGATA 4 80 

AT AGTCAAT C TGTTAAAACA GAATTAGGTA GACAAGCAGG TCAATTACTT GAAGGCATAT 54 0 

TACAAGAAGA CGCGATAGTT GCTGTAACTG GCGGATCCAC GATGGCATGT GTSGTGAAG 600 

CAATTCATTT ATTACCATAT AATGTATTCT TCGTACCAGC CAGAGGTGGA CTAGGCGAAA 660 

ATGTTGTCTT TCAGGCAAAC ACAATTGCAG CCAGTATGGc aCAACAAGCT GGCGGTTATT 72 0 

ATACGACGAT GTATGTACCT GATAATGTCA GTGAAaCAAC ATATAATACA TTGTTGTTAG 780 

AGCCATCAGT CATAAACACT TTAGACAAAA TTAAACAAGC AAAC G T TATA TTACACGGCA 840 

TTGGTGATGC GCTGAAGATG GCGCATCGAC GTCAATCACC TGAAAAGGTC ATTGAACAAC 900 

TTCAACATCA TCAAGCTGTC GGAGAGGCAT TTGGTTATTA TTTTGATACA CAAGGTCAAA 9 60 

TTGTCCATAA GGTTAAAACA ATTGGACTTC AATTAAAGA CCTTGAATCA AAAGACTTTA 102 0 

TTTTTGCAGT TGCAGGAGGC AAATCGAAAG GTGAAGCAAT TAAAGCATAC TTGACGATTG 108 0 

CACCCAAGAA TACAGTGTTA ATCACTGATG AAGCCGCAGC AAAGATAATA CTTGAATAAG 114 0 

AG AT AAAAA G TTTAATACTT TTTAAATATC ATTTTAAAGG AGGCCATTAT AATGGCAGTA 1200 

AAAGTAGCAA TTAATGGTTT TGGTAGAATT GGTCGTTTAG CAT T C AGAAG AATTCAAGAA 12 60 

GTAGAAGGTC TTGAAGTTGT AGCAGTAAAC GACTTAACAG ATGACGACAT GTTAGCGCAT 1320 

TTATTAAAAT AT G AC AC TAT GCAAGGTCGT TTCACAGGTG AAGTAGAGGT AGTTGATGGT 1380 

GGTTTCCGCG TAAATGGTAA AGAAG T T AAA TCATTCAGTG AACCAGATGC AAGCAAATTA 14 4 0 

CCTTGGAAAG ACTTAAATAT CGATGTAGTA TTAGAATGTA CTGGTTTCTA C AC T GAT AAA 1500 

GATAAAGCAC AAGCTCATAT TGAAGCAGGC GCTAAAAAAG TATTAATCTC AGCACCAGCT 1560 

ACTGGTGACT TAAAAACAAT CGTATTCAAC ACTAACCACC ABAGTTAGA CGGTTCTGAA 1620 

ACAGTTGTTT CAGGTGCTTC ATGTACTACA AACTCATTAG CACCAGTTGC TAAAGTTTTA 168 0 

AACGATGACT TTGGTTTAGT TGAAGGTTTA AT G AC T AC AA TTCACGCTTA CACAGGTGAT 17 4 0 

CAAAATACAC AAGACGCACC TCACAGAAAA GGTGACAAAC GTCGTGCTCG TGCAGCGGCA 18 00 

G AAAAC AT C A TCCCTAACTC AACAGGTGCT GCTAAAGCTA TCGGTAAAGT TATTCCTGAA 18 60 

ATCGATGGTA AATTAGATGG TGGTGCACAA CGTGTTCCTG TAGCTACAGG TTCATTAACT 1920 

GAATTAACAG TAGTATTAGA AAAACAAGAC GTAACAGTTG AACAAGTTAA CGAAGCTATG 198 0 

AAAAATGCTT CAAACGAATC ATTCGGTtAC ACTGAAGACG AAATCGTTTC TTCAGACGTT 204 0 

GTAGGTATGA CTTACGGTTC ATTATTCGAC GC TACAC AAA CTCGTGTAAT GTCAGTTGGC 2100 

GACCGTCAAT TAGTTAAAGT TGCAGCTTGG TATGATAACG AAATGTCATA TACTGCACAA 2160 

TTAGTTCGTA CATTAGCATA CTTAGCTGAA CTTTCTAAAT AATTTTAGTATAGTTTTTAT 2220 



478 

( opied h orn 1< 80(ii>0(i on 08/ 1/ t?00 ; 



TCAAATACGC TAGTGCTCAG AACTATTTAG CATTAATTAA AGCTTATGAG TAAGCGGGGA 2280 

GCACAAACGC TTCTCCGCTT ATTTTTATAT AAAATTTCCT AATTACAAGG AGGAAACACC 234 0 

ATGGCTAAAA AAATTGTTTC T GAT T TAG AT CTTAAAGGTA AAACAGTCCT AGTACGTGCT 24 00 

GATTTTAACG TACCTTTAAA AGACGGTGAA ATTACTAATG ACAACCGTAT CGTTCAAGCT 2 4 60 

TTACCTACAA TTCAATACAT CATCGAACAA GGTGGTAAAA TCGTACTATT TTCACATTTA 2520 

GGTAAAGTGA AAGAAGAAAG TGATAAAGCA AAATTAACTT TACGTCCAGT TGCTGAAGAC 2580 

TTATCTAAGA AATTAGATAA AGAAGTTGTT TTCGTACCAG AAACACGCGG CGAAAAACTT 2 64 0 

GAAGCTGCTA TTAAAGACCT TAAAGAAGGC GACGTATTAT TAG T T G AAAA TACACGTTAT 27 00 

G AAG AT T TAG ACGGTAAAAA AG AAT C T AAA AATGATCCAG AATTAGGTAA ATACTGGGCA 27 60 

TCTTTAGGTG ATGTGTTTGT AAATGATGCT TTTGGTACTG CGCATCGTGA GCATGCACT 2 820 

AATGTTGGTA TTTCTACACA TTTAGAAACT GCAGCTGGAT TCTTAATGGA TAAAGAAATT 2880 

AAGTTTATTG GCGGCGTAGT TAACGATCCA C AT AAAC C AG TTGTTGCTAT TTTAGGTGGA 2 94 0 

GCAAAAGTAT CTGACAAAAT TAATGTCATC AAAAAC T TAG TTAACATAGC TGATAAAATT 3000 

ATCATCGGCG GAGGTATGGC TTATACTTTC TTAAAAGCGC AAGGTAAAGA AATTGGTATT 30 60 

T CAT TAT TAG AAGAAGATAA AATCGACTTC GCAAAAGATT T AT T AGAAAA ACATGGTGAT 3120 

AAAATTGTAT TACCAGTAGA CACTAAAGTT GCTAAAGAAT TTTCTAATGA TGCCAAAATC 318 0 

ACTGTAGTAC CATCTGATTC AATTCCAGCA GACCAAGAS GTATGGATAT TGGACCAAAC 32 4 0 

ACTGTAAAAT TATTTGCAGA TGAATTAGAA GGTGCGCACA CTGTTGTATG GAATGGACCT 3300 

ATGGGTGTAT TCGAGTTCAG TAACTTTGCA CAAGGTACAA TTGGTGTATG TAAAGCAATT 3360 

GCAAACCTTA AAGATGCAAT TACGATTATC GGTGGCGGTG ATTCAGCTGC AGCAGCAATC 3420 

TCTTTAGGTT TTGAAAATGA CTTCACTCAT ATTTCAACTG GTGGCGGCGC GTCATTAGAG 3480 

TACCTAGAAG GTAAAGAATT GCCTGGTATC AAAGCAATCA ATAATAAATA AT AAAG T GAT 354 0 

AGTTTAAAGT GATGTGGCAT GTTTGTTTAA CATTGTTACG GGAAAACAGT C AC AAG AT G A 3600 

CATCGTGTTT CATCACTTTT CAAAAATATT TACAAAACAA GGAGTGTCTT TAATGAGAAC 3 6 60 

ACCAATTATA GCTGGTAACT GGAAAATGAA CAAAACAGTA CAAGAAGCAA AAGatTCGTC 3720 

AATACATTAC CAACACTACC AGATTCAAAA GAAGTAGAAT CAGTAATTTG TGCACCAGCA 37 80 

ATTCAATTAG ATGCATTAAC TACTGCAGTT AAAG AAG G AA AAGCAEAGG TTTAGAAATC 38 4 0 

GGTGCTCAAA ATACGTATTT CGAAGATAAT GGTGCGTTCA CAGGTGAAAC GTCTCCAGTT 3900 

GCATTAGCAG ATTTAGGCGT TAAATACGTT GTTATCGGTC ATTCTGAACG TCGTGAATTA 3960 

TTCCACGAAA CAGATGAAGA AATTAACAAA AAAGCGCACG CTATTTTCAA AC AT G G AAT G 4 02 0 

ACTCCAATTA TATGTGTTGG TGAAACAGAC G AAG AG C G T G AAAGTGGTAA AGCTAACGAT 4 080 



479 

( opied horn 1( 80(ii>0(i on 08/ 1/ /">■ ; 



GTTGTAGGTG AGCAAGTTAA GAAAGCTGTT GCAGGTTTAT CTGAAGATCA ACTTAAATCA 414 0 

GTTGTAATTG CTTATGAACC AATCTGGGCA ATCGGAACTG GTAAATCATC AACATCTGAA 4200 

GATGCAAATG AAATGTGTGC ATTTGTAOST CAAACTATTG CTGACTTATC AAGCAAAGAA 4 260 

GTATCAGAAG CAACTCGTAT TCAATATGGT GGTAGTGTTA AACCTAACAA CAT T AAAGAA 4 320 

TACATGGCAC AAAC T GAT AT TGATGGGGCA TTAGTAGGTG GCGCATCACT TAAAGTTGAA 4 38 0 

GATTTCGTAC AATTGTTAGA AGGTGCAAAA TAATCATGGC TAAGAAACCa AOTCGTTAA 444 0 

TTATTTTAGA TGGTTTTGCG AACCGCGAAA GCGAACATGG TAATGCGGTA AAATTAGCAA 4 500 

ACAAGCCTAA TTTTGATCGT TATTACAACA AATATCCAAC GACTCAAATC GAAGCGAGTG 4 560 

GCTTAGATGT TGGACTACCT G AAG g AC AAA TGGGTAACTC AGAAGTTGGT CATATGAATA 4 620 

TCGGTGCAGG ACGTATCGTT TATCAAAGTT TAACTCGAAT CAATAAATCA ATT G AAG AC G 4 680 

GTGATTTCTT TGAAAATGAT GTTTTAAATA ATGCAATTGC ACACGTGAAT TCACATGATT 4 74 0 

CAGCGTTACA CATCTTTGGT TTATTGTCTG ACGGTGGTGT ACACAGTCAT TACAAACATT 4 800 

TATTTGCTTT GTTAGAACTT GCTAAAAAAC AAGGTSTTGA AAAAGTTTAC GTACACGCAT 4 8 60 

TTTTAGATGG CCGTGACGTA GAT C AAAAAT CCGCTTTGAA ATACATCGAA GAG AC T G AAG 4 92 0 

CTAAATTCAA TGAATTAGGC ATTGGTCAAT TTGCATCTGT GTCTGGTCGT TATTATGCAA 4 98 0 

TGGATCGTGA CAAACGTTGG GAACGTGAAG AAAAAGCTTA CAATGCTATT CGTAATTTTG 5 04 0 

ATGCCCCAAC TTATGCAACT GCCAAAGAAG GTGTAGAAGC AAGCTATAAT GAGGGCTTAA 5100 

CTGACGAATT CGTAGTACCA TTCATCGTTG AGAATCAAAA TGACGGTGTT AATGATGGAG 5160 

ATGCAGTGAT CTTCTATAAT TTCCGACCTG ATAGAGCAGC GCAATTATCG GAAATTTTTG 5220 

CGAACAGAGC ATTCGAAGGC TTTAAAGTTG AAC AAG T T AA AGACTTATTC TATGCAACAT 52 8 0 

TCACTAAGTA TAATGACAAT ATCGATGCGG CTATCGTCTT CGAAAAAGTT GATTTAAATA 534 0 

ATACAATTGG TGAAATTGCA CAAAATAACA ATTTAACTCA ATTACGTATT G C AG AAAC T G 54 00 

AAAAAT ACCC TCACGTTACT TACTTTATGA GTGGTGGACG TACGAGGAA TTTAAAGGTG 54 60 

AACGCCGTCG TTTAATTGAT TCACCTAAAG TTGCAACGTA TGACTTGAAA CCAGAAATGA 5520 

GTGCTTATGA AG T T AAAG AT GCATTATTAG AAGAGTTAAA TAAAGGTGAC TTGGACTTAA 5580 

TTATTTTAAA CTTTGCTAAC CCTGATATGG T T GGAC AT AG TGGTATGCTT GAGCCGACAA 5 64 0 

TCAAAGCAAT CGAAGCGGTT GATGAATGTT TAGGAGAAGT GGTTGATAAG ATTTTAGACA 57 00 

TGGACGGTTA TGCAATTATT ACTGCTGACC ATGGTAACTC TGATCAAGTA TTGACGGaTG 57 60 

AT GAT C AAC C AATGACTACG C Aw AC AAC GA ACCCAGTACC AGTGATTGTA ACAAAAGAAG 58 2 0 

GCGTTACACT TAGAGAAACT GGTCGCTTAG GTGACTTAGC ACCTACATTA TTAGATTTAT 58 8 0 
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TAAATGTAGA ACAACCTGAA GATATGACAG GTGAaTCTTT AATTAAACAC TAATATTGTA 5 94 0 

AAAGATGTTA AGTAAACGCT TAATGACACT TATTTTTTGA AAATAATAGT AATATCnTTT 6000 

TGTTAAATGA AAGAATAAAG CTATAATAAT TATAGAATAA CTATTTAn 604 8 
(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

AAACAAGTGC AAG AT AT CAT CGCATTAATT AAGTCGTTAC AAAgTGTAAT TGTAGACaTC 60 

GCTTCCAATA ATGTTGATAC AATTATGCCT GGTTATACTC ATTTACAGCG TGCACAGCCA 120 

ATTTCATTTG CACATCATAT TATGACTTAT TTTTGGATGT TACAACGAGA CCAACAACGA 180 

TTTGAAGATA GTTTAAAACG AATCGATATT AATCCTTTAG GTGCAGCAGC CTTAAGTGGT 24 0 

ACCACATACC CTATCGATAG ACACGAGACA ACAGCATTGT TGAACTTTGG CAGTCTCTAT 300 

GAGAATAGCC TAGATGCTGT TAGTGACAGA GACTATATTA TTGAAACATT GCATAATATT 3 60 

TCTTTAACGA TGGTTCACTT ATCACGCTTT G C AG AG G AAA TTATTTTCTG GTCCACAGAC 42 0 

GAAGCTAAAT TCATTACATT AT C AG AT G C A TTTTCAACTG GCTCATCTAT TATGCCACAA 4 80 

AAGAAAAATC CTGATATGGC AGAATTAATT AGAGGTAAAG TTGGTCGAAC GACTGGTCAT 54 0 

TTAATGAGCA TGCTTATGAC TTTAAAAGGA TTACCTCTAG CATATAACAA AST AT GC AG 600 

GAAGATAAAG AAGGTTTATT CGATGCTGTC CATACAATTA AAGGTTCTTT ACGTATTTTC 660 

GAAGGTATGA TTCAAACGAT GACAATTAAT AAAGAAC G AC TCAATCAAAC TGTTAAAGAA 720 

GATTTTTCAA ATGCAACGGA ACTAGCAGAT TATTTAGTAA CTAAAAATAT TCCATTTAGA 780 

ACTGCACATG AAATTGTAGG AAAAATCGTC TTAGAATGTA TACAACAAGG TCATTATTTA 840 

TTAGATGTTC CTTTAGCAAC ATATCAACAA CATCATTCTA GTATTGATGC C GAT AT T T AC 900 

GATTATTTGC AGCCTGAAAA TTGTTTAAAA CGACGTCAAA GTTACGGTTC AACAGGTCAA 960 

TCATCGGTCA AACAACAACT TGATGTTGCT AAAXAATTAC TATCACAATA AATACGTTAA 1020 

TCTACCTACC CACAATGTCT ATTAAAATTA CATTGTGGGT ATTTTAATGC TCTCTTCGTC 1080 

TTGTTGAACA TCACATTTTT AAGATTCCTA AAATGTTTGA TAATTCTTTT AAATTTATAT 1140 

TACAAAAATG TTATAAATTG TAAAAGAAAT GTGTAAAGCG T T T T C AC AAG CAGGTTTTTG 1200 

TAGTATTTTA AAATTGTTAG ACTACAAATA AAG AG AT G AA AGGATAAAGA C TAT G AC T AA 1260 

CTCTTCGAAA AGCTTCACTA AATTTATGGC TGCTTCTGCT GTTTTTACTA TGGGATTTTT 132 0 
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ATCAGTACCT ACTGCTGGCG CTGAACAAAC AAATCAAATT GCAAATAAAC CTCAGGCTAT 138 0 

TCAATGGCAT ACAAATTTAA CGAATGAGCG ATTCACTACT ATCGCACATC GTGGCGCAAG 14 4 0 

TGGCTATGCA CCCGAGCATA CGTTTCAAGC AT AT G AT AAG AGTCATAATG AGTTAAAAGC 1500 

ATCTTATATC GAAATTGATT TACAACGTAC CAAAGATGGC CATTTAGTTG CTATGCATGA 1560 

TGAAACTGTT AACCGTACAA CAAATGGACA CGGTAAAGTT &GGATTATA CCCTTGATGA 1620 

ATTAAAACAG TTAGATGCAG GAAGTTGGTT TAATAAAAAA TATCCAAAAT ACGCAAGAGC 168 0 

AAG T TAT AAA AATGCTAAAG TACCCACTTT AGATGAAATT TTAGAACGTT ATGGCCCGAA 17 4 0 

TGCAAACTAT TATAT TGAAA CAAAGTCACC TGATGTATAC CCAGGAATGG AAGAACAATT 18QD 

ATTAGCTTCA TTGAAAAAGC ATCACCTTTT AAATAACAAT AAATTAAAAA ATGGACATGT 18 60 

AATGATTCAA TCATTTTCTG ACGAAAGTTT AAAGAAAATT CA.TCGTCAAA- ATAAGCATGT 192 0 

GCCATTAGTA AAATTAGTTG ATAAAGGTGA ACTACAACAA TTTAACGACC AACGCTTAAA 198 0 

AGAGATACGC TCTTATGCGA TTG5ATTAGG TCCTGATTAT ACAGATTTAA CTGAACAAAA 204 0 

TACCCATCAT TTAAAAGACT TAGGATTTAT AGTACATCCT TATACAGTGA ATGAAAAAGC 210 0 

TGATATGTTA CGATTAAATA AATATGGCGT TGATGGTGTC TTTACAAATT TCGCTGATAA 2160 

ATATAAAGAA GTCATTAAGT AGTAATGTTA AACTAGAAAA CATAAATAE AAAATATAGC 222 0 

TATTACTATA AAAAACAGCA GTAAGATATT TCCAAATTGA AATTATCCTA CTGCTGTCTT 22 8 0 

TTTGGGAGTG GGACAGAAAT GATATTTTCG CAAAATTTAT TTCGTCGTCC CACCCCAACT 2 34 0 

TGCACACTAT TGTAAGCTGA CTTTTCGCCA GCTTCTGTGT TGGGGCCCCG CCAACTTGCA 2400 

TTGTCTGTAG AAATTGAGGA GCTAATTTCT CTGTGTCGGG GCTCCACCCC AACTTGCACA 24 60 

CTATTGTAAG CTGACTTTCC GCCAGCCTCT GTGTTGGGGC CCCGCCAACT TGCACACTAT 2 52 0 

TGTAAGCTGA CTTTCCACCA GCCTCTGTGT TGGGGCCCCG ACTATTTTTG AAAAGAGCGT 2580 

GTTACACGGG CATTGTTTTA CAGTCAACTA CTGCTAAAAT AAAATTAACG AGCTTAGGGC 2 64 0 

TTTGTTTTCT GTCCCAAGCT CGTTAAATCA CATATGATAA TTAATTATGC CCAACCACGA 27 00 

TATCTAGCTG CTTCTGCTGT ACGTTTAATA CCTATGATAT ATGCTGCAAG TCTCATATCT 27 60 

ATTTTTCGGT TTTGAGACAA TTCGTAAATC GTATCAAATG CCGCTTCTAA TTTTTJECGT 2820 

AGCTTTTCAT TAACTTCTTC T T C AG AC C AA TAATAACCTT GATTATTTTG TACCCATTCG 2880 

AAGTAAGAAA CCGTtACACC ACCAGCACTT GCTAATACGT CTGGAACTAA TAATATACCA 2 94 0 

CGTTCAGTTA AAATACGTGT TGCTTCTGGT GTTGTAGGTC CATTAGCAGC TTCAACAACG 3000 

ATACTAGCTT TAATATCATG TGCATTGTCT TCTGTAATTT GGTTTGAAAT AGCCGCTGGT 3060 

ACTAAAATGT CACAATCTAA TTCAAACAAT TCTTTATTTG AGATTGTTTC TTCAAATAAA 3120 
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TTTGTTACCG TACCAAAACT ATCACGACGG TCTAATAAAT AATCTATATC TAAGCCATTT 3180 

GGATCGTGTA ATGCACCGTA AGCATCAGAG ATACCTA&A TTTTTGCACC TAAATCATAT 32 4 0 

AAGAATTTAG CTAAGAAACT TCCGGCATTA CCGAAACCTT GAATAACAAC CTTGGCACCT 3300 

TCAATTTGCA TATTACGACG TTTTGCAGCT TGTTCAATTG CAATAACTAC ACCTAGTGCA 3360 

GTTGATCTGT CGCGTCCATG AGAACCACCC AATACAATTG GTTTACCTGT GATGAAACCT 3420 

GGTGAATTAA ATTTATCTAA TGCACTATAT TCATCCATCA TCCAAGCCAT AATTTGTGAG 34 8 0 

TTTGTAAATA CATCTGGTGC TGGAATATCT TTGTTCGGAC CTACGAATTG TGAAATTGCT 354 0 

CTTACATATC CGCGTGATAA ACGTTCAACT TCATGAATGC TCATTTGACG TGGATCACAA 3 600 

ACGATACCAC CCTTACCACC ACCGTATGGT AAGTTTACAA TGCCACATTT CAAAGTCATC 3 6 60 

CACATTGATA ATGCTTTTAC TTCTTCTTCA TCAACATCTG GGTGGAAACG CACGCCCCCT 3720 

TTTGTTGGTC CAACAGCATC ATTATGTTGC GCACGGTAAC CTGTGAATGT TTTTACTGTG 37 8 0 

CCATCATCCA TTCGTACAGG GAT AC GC ACT TGTAACATTC TTAABGTTC TTTAATTAAA 38 4 0 

TCGTACATTC CTtCGTCAAA TCCCAATTTA TGCAATGCTT CTTTAATAAT TCCTTGAGTA 3900 

GAAGTTACTA AATTATTGTT CTCAGTCATG ATCCTTTTCG CCTCTTCTTT ACCTAATGAT 3960 

TTCGCTTTCA AACATATTGT AACATAACGT ATTCCTTTTT AAAGCCCTTA CAAACTGATT 4020 

GTTACAACTT TTTGACATTA TTGAAATACA TGTCTTATTT TTTCAAGTGC AAGGTCCAAT 4 08 0 

TCTTCTTTAG TAATAATTAA TGGTGGTGCA AAACGAATGA CAGTATCATG CGTTTCTTTA 414 0 

CATAATAAGC CTTCTTCTTT CAAAGCTTCA CAATATGGTC TAGCACTTTC ATTAAGTTCC 4 200 

ACACCTATAA ACAAACC AC G TCCACGG\CT TCTTTAATTG ATGGATGATC AATTTGCTTT 4 2 60 

AATTGTTCTT TAAAATAATC TCCTAATTCT AAAGAGCGGC CTGGTAAATC CTCATCAACG 4 32 0 

ATAACATCTA ATGCAGCAAT TGATGCAGCA CAAGCAAGTG GATTACCACC AAATGTTGAA 4 380 

CCATGTGAGC CAGGTGTAAA GACATCTAAT ACTTCTTTAT CTGCTAATAC ASAGAAATT 4 44 0 

GGGAAGACTC CACCACCTAG TGCTTTACCT AAAATATAGA CATCAGGTTT TACATTATCC 4 500 

CAATCCGTAG CAAATAATTT ACCCGAACGA CCTAATCCTG CTTGGATTTC GTCAGCAATA 4 5 60 

AATAAGACAT TATGTTCATC ACATAATTCT CTAATTGCTT TCAAATATCC TTCTGGCGGT 4 620 

ATATTTATAC CCGCTTCACC TTGAATTGGT TCTACTAAAA CTGCTGCAGT ATTTTCATTA 4 680 

ATTGCAGCTT TCAATGCATC TACATCTCCA AAATCAACTT TTCTAAATCC ATCTAATAAC 4740 

GGACCATAAC CACGTTGGTA TTCTGCTTCT GAAGATAATG AAACTGGCGC CATTGTTCGA 4 800 

CCATGGAAGT TACCATTAAA TGCAATGATT TCTETTTAT TTGGCTCAAT TCCTTTAACA 4 8 60 

TCGTATGCCC AGCGTCGTGC TGCTTTCAAA GCTGTTTCTA CTGCTTCAGC ACCTGTATTC 4 920 

ATTGGTAAAG CTTTATCTTT ACCTGCCAGT TTACAAATTT TTTCGTACCA TTCACCTAAG 4 980 
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T TAT C AC TAT GAAAAGCACG TGAAACTAAA GTCACTTTAT CAGCTTGATC TTTTAATGCT 504 0 

TGAATAATTT TCGGATGTCT ATGACCTTGG TTAACAGCGG AATATGCAGA TAACATATCC 5100 

ATATATTTAT TGCCTTCAGG ATCTTTAACC CATACCCCTT CAGCTTcTGa AATGaCAATT 5160 

GGcAATGGTA AATAATTATG TGCTCCGTAA TGATTTGTTA ACTCAATAAT TTTTTCAGAT 5220 

TTAGTCATCA TATCTGCCCT TTTCATCATT TATAACTATT AT AC AT GAAA CATTATCCAA 52 8 0 

ATAATTACAT TAGTTTTCAA AGCAGATACT TTTCCACCAA AAAAGATGAA ATAATCACTA 534 0 

AGTTTCATTA AATTTGTCTA TTTTGAAAAC CCTTACATTT ATAATGACAT AATTACTTAA 54 00 

ATGaTTACAA GCAAAAGAAT TGATAATTTT ACACTTAATC AAAGTATAT TTTACTAAGA 54 60 

ATATTTTTAT T T AT AAAT AT TGAAAACCAC TAACAAATTG CATACACAAT ATCATTAGTG 5520 

GTAACAGTTA AACACTTATT TATCTTTACG GGGTAATGGG TTAAAACCCT TnCATTAAAA 5 58 0 

TTGGATGnCC ATAAAATTAG GG 5 6D 



(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 130: 

TAACCCCATT TTACCTGGAA AAATCgTTTG CGATGCaATm GCaTTtGaAT ATAaATACAT 60 

TTTACGTATa G AAT TAT AAA AgGTTTCATT CaAATCTTAG GGTCAAAAAT GTTATAATAT 120 

TTTTATGTCA AATTTAAAAC AGTAACACTT ATTTACAAGG TTGCAATATT TTGAAGTAAT 180 

AAAGGAAGTG TCGCGTATTT TAACTTTTTC AGAGCAAAAT GCACTCGCGA AAATAGATGA 24 0 

TTTAATGAAT ACTTATTGCA ATCAATGTCC AATCAAAACT CGTCTGCGTA AATTAGAGGG 300 

G AAAAC GAAG GCGCATCATT TTTGTATCAA TGAGTGTTCA ATAGGGAAAG AAATAAAACA 3 60 

ATTAGGAAAT GAACTTCAAT AGGAGGAAGT CAAATGAAAA TTAETCTAT AT C AG AAAC A 4 20 

CCGAACCACA AC AC AAT GAA GATTACACTT AG TGAAAGC A GAGAAGGTAT GACATCAGAT 4 80 

AC G TAT AC T A AAGTTGATGA TTCACAGCCA GCATTTATTA ATGACATCTT AAAGGTTGAA 54 0 

GGCGTTAAAT CAATTTTCCA TGTTATGGAC TTTATTTCAG TAGATAAAGA AAATGACGCA 600 

AATTGGGAAA CAGTATTGCC AAAAGTAGAG GCTGTATTCG AAT AAAT T T T TCATCAACTA 660 

GTATTCGGGG GGAATAAAGT ATATGGAAAT TTTACGTATA GAGCCAACAC CAAGTCCAAA 72 0 

TACAATGAAA GTTGTTTTGT CATATACAAG AGAAGACAAG TTATCTAATA CTTATAAAAA 780 
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AGTAGAAGAA AC AC AAC C AA GAT T T A? AAA TCAGTTGTTA TCTATAGATG GTATCACTTC 84 0 

CATTTTTCAT GTCATGAACT TCTTAGCTGT TGATAAGGCA CCAAAAGCTG ATTGGGAAGT 900 

CATATTACCT GAT AT T AAAG CTGCTTTTTC TGATGCGAAT AAGGTTTTAG AATCTGTAAA 960 

TGAACCTCAA ATTGACAATC ATTTTGGTGA AATTAAAGCT GAATTATTAA ETTTAAGGG 10 20 

TATACCGTAT CAAATTAAGC TAACTTCTGC TGACCAAGAA TTAAGAGAAC AATTACCACA 108 0 

AACATATGTT GACCATATGA CTCAAGCGCA AACAGCACAT GACAATATTG TTTTTATGCG 114 0 

TAAATGGCTA GATTTAGGAA ATCGCTATGG AAATATTCAA GAAGTAATGG ATGGTGTCCT 120 0 

AGAAGAAGTG CTAGCTACCT ATCCAGAATC ACAGTTACCC GTATTGGTAA AACATGCTTT 1260 

AGAAGAAAAT CACGCAACTA ATAATTATCA TTTCTATCGA CATGTCTCTT TGGATGAATA 132 0 

TCATGCAACT GATAATTGGA AGACTCGATT ACGAATGTTA AACCATTTTC CAAAGCCGAC 138 0 

TTTTGAAGAT ATACCGCTGC TTGATTTAGC TTTVTCTGAT G AAAAAG T AC CGGTTAGACG 14 4 0 

TCAAGCGATT GT ATT ATT AG GTATGATTGA AAGTAAAGAA ATTTTACCGT ATT TAT AT AA 1500 

GGGGCTTCGT GATAAAAGTC CTGCTGTAAG AAGAACAGCA GGGGATTGCA TAAGCGATTT 15 60 

AGGGTATCCA GAGGCACTAC CAGAAATGGT GCTACTATTA GATGATCCAC AGAAAATCE 162 0 

TAGGTGGCGT GCTGCTATGT TTATCTTTGA TGAAGGTAAT GCAGAGCAGC TTCCCGCACT 1680 

AAAAGCCCAT ATTAATGACA ATGCGTTTGA AGTTAAATTA C AAAT T G AAA TGGCCATATC 17 4 0 

TCGTATTGAA AACGGAGATG AAGCACTTGG TTCAGTATGG AAACAAATGG CAAACAGAAC 18 00 

AATTTAATTG GAGGAATTAA ATATGAATGC ATATGATGCT TATATGAAAG AAATTGCGCA 18 60 

ACAAATGCGT GGCGAATTAA CTCAAAATGG TTTTACAAGT TTAGAAACGA GCGAACAGct 192 0 

ATCGGAGTAT ATGAACCAAG TAAATGCTGA TGACACTACT TTTGTAGTTA T T AAC T C T AC 198 0 

ATGCGGCTGT GCAGCTGGAT TAGCAAGACC AGCTGCAGTAGCAGTTGCAA CACAAAATGA 20 4 0 

ACATAGACCT ACAAATACAG TTACAGTTTT TGCTGGGCAA GATAAAGAAG CAACTGCTAC 2100 

AATGCGAGAA TTCATTCAGC AAGCACCATC TAGTCCTTCG TATGCTTTAT TCAAAGGTCA 2160 

AGATTTAGTT TATTTTATGC CTAGAGAATT T AT CGAAGG T AGAGATATTA ATGACATTGC 220 

AATGGACTTA AAGGATGCCT TTGACGAAAA TTGTAAATAG TACACATAAA T AAAT AT AAA 228 0 

GGTTAACACA TTTTATAATA TTAAAAATGG TGTCTGTCAT T G AAAAT AG A GAATATAGTT 234 0 

GTATTCTATT TGTTAAATAA AGTCCGTTTT TACCaACTAT ATTTTCTAGA AATTTAACTG 24 00 

TTTTAATAGG ACATCAAACA TJATATTCaA ATCaTGTGTT AACCTCTTTT TTAAAATTTT 24 60 

TTAGCATTAA AG T TAT AG AT TTGGGTAAAC AATTACCAAT TGGAAACATA TATCACGTTA 252 0 

CGATGGGGTA GGTACTTAAT CAGCATTTTA T AAAT AAAG T AACGGAATTC ATGATATTAA 2 58 0 

TATCATATTC C T AAAAT GAG T GAT AAC AAA ATGCTACATA AAGTTAAET ATATCAAACT 264 0 
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AAATATACAT ACTATAAATA AT G AAAAT G A GGTGTTATCG CATATGTTGA ATTCATTTGA 2 7 00 

TGCAGCATAT CACAGTCTTT GTGAAGAAGT TTTAGAAATA GGAAATACAC GAAATGATCG 27 60 

CACAAATACA GGTACGATTT CGAAATTTGG TCATCAACTT CGCTTTGACT TATCTAAAGG 2 82 0 

ATTTCCACTA TTAACGACAA AGAAAGTTTC TTTTAAATTA GTAGCAACCG AATTATTATG 2880 
GTTCATTAAA GGAGATACAA ACATCCAATA CTTATTAAAA TATAATAATA ATATATGGAA 2 94 0 

CGAATGGGCT TTTGAAAATT ATATCAAATC AG AC GAG TAT AAAGGTCCAG AT AT G AC AG A 3000 

TTTCGGGCAT CGTGCATTGA GTGATCCTGA ATTTAACGAA CAATATAAAG AACAAATGAA 30 60 
ACAATTTAAG CAACGTATTC TTGAAGATGA TACATTTGCG AAGCAATTCG GGGATTTAGG 3120 
AAATGTTTAT GGTAAACAAT GGCGAGATTG GGTTGATAAA GATGGTAATC ATTTTGATCA 3180 

ACTTAAAACA GTAATTGAAC AAATTAAGCA TAATCCAGAT TCAAGGCGAC ACATGTATC 32 4 0 
TGCATGGAAT CCAACAGAAA TTGATACAAT GGCACTTCCG CCTTGTCATA CCATGTTCCA 3300 
GTTTTATGTC CAAGATGGTA AGTTAAGTTG CCAGTTATAC CAACGTAGCG CAGATATCTT 3360 
TTTAGGTGTG CCATTTAATA TCcGCagctA CGCTTTATTG ACACACCTTA TTGCCAAAGA 3420 

ATGTGGACTT GAAGTGGGTG AATTTGTGCA TACATTTGGA GATGCACATA TTTATTCAAA 34 80 
T CAT AT T GAT GCGATTCAAA CACAATTAGC ACGTGAAAGC TTCAATCCTC C AAC AT T AAA 354 0 

AATTAACAGT GACAAGTCTA TTTTCGACAT AAAT TATGAA GAT T T GG AAA TTGTTGACTA 3600 

TGAATCACAT CCAGCAATAA AAGCTCCAAT AGCAGTCFAG TCATTGCATA GTTAGCTAAC 3 6 60 
CATATAGACA TCAAAATGAC ATCATAGTAT TTTCAAGTGC AAAAAAGTAC TTTTTTGTGT 37 2 0 

TAAACGTTTT CATAAATTAT GCAAAATCAT TATTTCTATC ACACTTTATG ATAAAAATTG 37 8 0 

TGTTAAATTA AAGATAACTT AGTAATAAAA AATGAAATGA TAGAAGAAGG AGGATAATTA 38 4 0 
TGACTTTATC CATTCTAGTt GCACATGACT TGCAACGAGT AATTGGTTTt GAAAATCAAT 3900 
TACCTTGGcA CCTACCAAAT GATTTGAAGC ATGTTAAAAA ATTATCAACA GGTCATACTT 3 9 60 

TAGTAATGGG TCGTAAGACA TTTGAATCGA TTGGTAAACC ACTACCGAAT CGTCGAAATG 4 02 0 

TTGTACTTAC TTCAGATACA AGTTTCAACG TAGAnGGCGT TGATGTAATT CACTCTATTG 4 08 0 
AAGATATTTA CCAACTACCG GGCCATGTTT TCATATTTGG AGGGCAAACA TTATTTGAAG 414 0 

AAAT GAT T G A TAAAGTGGAC GACATGTATA TTACTGTTAT TGAAGGTAAA TTCCGTGGTG 4200 

ATACGTTCTT TCCACCTTAT mCATTkGAgr CTGGGAAGTT GCCtCTTCAG TTGAAGGTAA 4 2 60 
ACTAGATGAG AAAAATACAA TTCCACATAC CTTTCTACAT TTAATTCGTA AAAAATAAGG 4 32 0 

GGGAAAACGA CCATGACAAA ACAGATTATA G T AAC AG AC T C AAC AT C C G A TTTATCTAAA 4 38 0 

G AAT AC T TAG AAGCAAACAA CATTCATGTA ATTCCTTTAA G T T T AAC TAT TGAAGGAGCT 4440 
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TCATACGTTG ACCAAGTAGA TATTACATCA GAAGAATTTA TTAATCATAT TGAAAATGAT 4 500 

GAAGATGTAA AGACAAGTCA GCCAGCCATA GGTGAATTTA TATCTGCTTA TGAAGAACTA 4 5 60 

GGAAAAGATG GCTCTGAAAT CATAAGTATT CATCTTTCTT CAGGATTAAG TGGTACATAT 4 620 

AACACTGCTT ACCAAGCAAG TCAAAEGTA GATGCTAATG TAACTGTTAT TGATTCAAAA 4 68 0 

TCTATTTCTT TTGGTTTAGG GTATCAAATA CAACACCTAG TAGAGCTTGT AAAAgAaGGT 474 0 

GtCTCAACTT CTGAAATAGT T AAAAAGT T A AATCATTTAA GAGAAAACAT TAAATTATTT 4800 

GTAGTTATAG GGCAATTGAA TCAATTAATT AAAGGTGGCA GAATTAGTAA ACAAAAGGT 4 8 60 

TTGATTGGTA ATCTTATGAA AAT TAAACC A ATTGGTACAC TAGATGATGG TCGCTTAGAG 4 92 0 

CTTGTGCmCA ATGCGAGAAC TCaAAATTCk AGTATCCAAT ACTTGAAAAA GGAAATTGCT 4 980 

GAATTTATAG GAG AT CAT G A AATCAAATCC ATTGGTGTCG CACATGCTAA CGTCATTGAA 5040 

TATGTTGATA AATTGAAGAA AGTTTTTAAT GAAGCTTTTC ATGTGAATAA TTACGATATA 5100 

AATGTAACTA C AC C AG T T AT TTCTGCACAT ACTGGTCAAG GTGCGATTGG CCTCGTAGTC 5160 

CTTAAGAAGT AAATTTAATC TTTTCAGTGT TAATTACTTC CATTTCAATC CTTTATAGAC 5220 

TAAATTTATA ATTAGATAGA TAGAGGAGGT AATCATATG ACAAAAGAAT ATGCAACATT 5280 

AGCAGGAGGA TGTTTCTGGT GCATGGTTAA ACCATTTACA TCATATCCAG GCATCAAGTC 534 0 

AGTCGTATCT GGTTATAGTG GCGGTCATGT TGACAACCCA ACTTATGAAC AGGTAT GT AC 54 00 

GAATCAAACC GGCCATGTCG AAGCAGTACA AATTACGTTT GATCCAGAGG TTACTTCCTT 54 60 

TGAAAATATA T TAG AC AT AT ATTTCAAAAC ATTTGACCCA ACT GAT GAT C AAGGGCAATT 5520 

TTTCGATAGA GGCGAAAGCT ATCAACCAGT CATTTTCTAT CATGATGAAC AT C AG AAAAA 558 0 

GGCTGCTGAG TTTAAAAAGC AAC AAT T AAA TGAACAAGGT ATTTTCAAGA AACCAGTGAT 564 0 

TACACCTATT AAACCATATA AAAATTTCTA TCCAGCTGAA GACTACCATC AAGATTATTA 57 00 

CAAAAAGAAC CCGGTACATT ATTACCAATA TCAACGTGGT TCAGGTAGAA AAGCGTTTAT 57 60 

AGAATCACAT TGGGGGAATC AAAATGCTTA AAAAAGATAA AAGTGAACTA ACAGATATAG 5 82 0 

AATATATTGT TACACAAGAn AACGGCACTG AACCACCATT TATGAATGAA TATTGGAATC 58 80 

ATTTTGCTAA AGGATTTATG TAGATAAAnT TCnGGTAAAC CTTG 5 924 
(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
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GGCCGTTnAA AATCTCCAAA ATAnAAAAAC CCATCTTGTT CCAATGTTTT AAAATCGCCa 60 

TCCaACACTT GaTCaATAGC TTGCAACAAC GTTGAACGTG TTTTaCCAAA AGCATCaAAC 12 0 

GCTCCCACTA AAATCAGTGC TTCAAGTAAC TTTCTCGTTT TGACTCTCTT CGGTATACGT 18 0 

CTAGCAAAAT CAAAGAAATC TTTAAATTTG CCGTTCTGAT AACGTTCATC AACAATCACT 240 

TTCACACTTT GATAACCAAC ACCTTTAATT GTACCAATTG ATAAATAAAT GCCTTCTTGG 300 

GAAGGTTTAT AAAACCAATG ACTTTCGTTA ATGTTCGGTG GCAATATAGT GATACCTTGT 3 60 

TTTTTTGCTT CTTCTATCAT TTGAGCAGTT TTCTTCTCAC TTCCAATAAC ATTACTTAAA 4 20 

ATATTTGCGT AAAAATAATT TGGATAATGG ACTTTTAAAA AGCTCATAAT GTATGCAATT 4 80 

TTAGAATAGC TGACAGCATG TGCTCTAGGA AAACCATAAT CAGAAATTT CAGAATCAAA 54 0 

TCAAATATTT GCTTACTAAT GTCTTCGTGA TAACCATTTT GCTTTGCACC TTCTATAAAA 600 

TGTTGACGCT CACTTTCAAG AACAGCTCTA TTTTTTTTAC TCATTGCTCT TCTTAAAATA 660 

TCCGCTTCAC CATAACTGAA GTTTGCAAAT GTGCTCGCTA TTTGCATAAT TTGCTCTTGA 7 20 

TAAATAATAA CACCGTAAGT ATTTTTTAAT ATAGGTTCTA AATGCGGATG TAAATATTGA 7 80 

ACTTTGCTTG GATCATGTCT TCTTGTAATG TAAGTTGGAA TTTCTTCCAT TGGACCTGGT 840 

C T AT AC AAAG AAGTTACAGC AACAATATCT TCAAAGTGTT CCGGCTTTAA TTTTTTTAAT 900 

ACACTTCTTA CACCGTCAGA CTCT7ATTGG AATATGCCAG TCGTATCTCC TTGCGACAAC 960 

AATTCAAACA CTTTTTGATC ATCAAACGGA ATCTTTTCGA TATCAATATT AATACCTAAA 1020 

TCTTTTTTGA CTTGTGTTAA GAT T T GAT G A ATAATCGATA AGTTTCTCAA CCCTAGAAAA 1080 

TCTATTTTTA ATAACCCAAT ACGTTCGGCT TCAGTCATTG TCCATTGCGTTAATAATCCT 114 0 

GTATCCCCTT TCGTTAAAGG GGCATATTCA TATAATGGAT GGTCATTAAT AATAATTCCT 12 00 

GCCGCATGTG TAGATGTATG TCTTGGTAAA CCTTCTAACT TTTTACAAAT ACT G AAC C AG 12 60 

CGTTCATGTC GATGGTTTCG ATGTACAAAC TCTTTAAAAT CGTCAATTTG ATATGCTTCA 1320 

TCAAGTGTAA TTCCTAATTT ATGTGGGATT AAACTTGAAA TTTCATTTAA TGTAACTTCA 1380 

TCAAACCCCA TAATTCTTCC AACATCTCTA GCAACTGCTC TTGCAAGCAG AT GACC GAAA 14 4 0 

GTCACAATTC CAGATACATG TAGCTCGCCA TATTTTTCTT GGACGTACTG AATGACCCTT 1500 

TCTCGGCGTG TATCTTCAAA GTCAATATCA AT^TCAGGCA TTGTTACACG TTCTGGGTTT 15 60 

AAAAAACGTT CAAATAATAG ATTGAATTTA ATAGGATCAA TCGTTGTAAT TCCCAATAAA 1620 

TAACTGACCA GTGAGCCAGC TGAAGAACCA CGACCAGGAC CTACCATCAC ATCATTCGTT 1680 

TTCGCATAAT GGATTAAATC ACTTACTATT AAGAAATAAT CTTCAAAACC CATATTAGA 17 4 0 

ATAACTTTAT ACTCATATTT CAATCGCTCT AAATAGACGT CATAATTAAG TTCTAATTTT 1800 
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TTCAATTGTG TAACTAAGAC ACGCCACAAA TATTTTTTAG CTGATTCATC ATTAGGTGTC 18 60 

TCATATTGAG GAAGTAGAGA TTGATGATAT TTTAATTCTG CATCACACTT TTGAGCTATA 192 0 

ACATCAACCT GCGTTAAATA TTCTTGGTTA ATATCTAATT GATTAATTTC CTTTTCAGTT 198 0 

AAAAAATGTG CACCAAAATC TTCTTGATCA TGAATTAAGT CTAATTTTGT ATTGTCTCTA 204 0 

ATAGCTGCTA ATGCAGAAAT CGTATCGGCA TCTTGACGTG TTTGGTAACA AACATtTTGA 2100 

ATCCAAACAT GTTTTCTACC TTGAATCGAA ATACTAAGGT GGTCCATATA TGTGTCATTA 2160 

TGGGTTTCAA ACACTTGTAC AATATCACGA TGTTGATCAC CGACTTTTTT AAAAAT GAT A 2220 

ATCATATTGT TAGAAAATCG TTTTAATAAT TCAAACGACA CATGTTCTAA TGCATTCATT 2280 

TTTATTTCCG ATGATAGTTG ATACAAATCT TTTAATCCAT CATTATTTTT AGCTAGAACA 24 0 

ACTGTTTCGA CTGTATTTAA TCCATTTGTC ACATATATTG T CAT AC C AAA AATCGGTTTA 2400 

ATGTTATTTG CTATACATGC ATCATAAAAT TTAGGAAAAC CATACAATAC ATTGGTGTCA 2 4 60 

GTTATGGCAA GTGCATCAAC ATTTTCAGAC ACAGCAAGTC TTACgGCATC TTCTATTTTT 2 52 0 

AAGCTTGAAT TTAACAAATC ATAAGCCGTA TGAATATTTA AATATGCCAC CATGATTGAA 2580 

TGGCCCCTTT CTATTAGTTA AGTTTTGTGC GTAAAGCTGT AGCAAGTTGC TCAAATTCAT 2 64 0 

CCCAGCTGTC CACTGAAACT CCTGACGCAT TCGGATGACC ACCGCCACCA AAATCTTGCG 2 7 00 

CAATATCATT AATAATCAAT TGCCCTTTAG AACGTAATCG ACATCTfiTT TCATTACCTT 27 60 

CATCGACTGC AAATACCCAT ATTTTCAAGC CTTTGATGTC AGCAATTGTA TTAACAAACT 2820 

GAGATGCTTC ATTTGGCTGA ATACCGAATT GCTCCAATAC ATCTTCAGTT ATTTTAACTT 288 0 

GGCAGAATCC ATCATCCATA AGTTCGAAAT GTTGTAAAAC ATAACCTTGA AAC GGCAAC A 2 94 0 

TTTTTGGGTC CTTCTCCATC ATTTTATTTA AAAGCGCATT AT GAT C AAT A TCATGCCCAA 3000 

TTAACTTTCC AGCAATTTCC ATAGTATGTT CTGAGGTATT GTT AAAAAGG AATCGCCCAG 3060 

TATCACCGAC GATACCAAGA TATAAAACGC TCGCGATATC TTTATTAACA ATTGCTTCAT 312 0 

CATTAAAATG T G AGAT T AAA TCGTAAATGV TTTCACTTGT AGATGACGCG TTCGTATTAA 318 0 

CTAAATTAAT ATCACCATAC TGATCAACTG CAGGATGATG ATCTATTTTA ATAAGTTTAC 324 0 

GACCTGTACT ATAACGTTCA TCGTCAATTC GTGGAGCATT GGCAGTATCA CATACAATTA 3300 

CAAGCGCATC TTGATATGTT TTATCATCAA TGTTATCTAA CTCTCCAATA AAAETAATG 3360 

ATGATTCCGC TTCACCCACT GCAAATACTT GCTTTTGCGG AAATTTCTGC T G AAT AT AG T 34 20 

ATTTTAAACC AAGTTGTGAA CCATATGCAT CAGGATCTGG TCTAACATGT CTGTGTATAA 34 8 0 

TAATTGTATC GTTGTCTTCG ATACATTTCA TAATTTCATT CAAAGTACTA ATCATTTTCA 354 0 

TACTCCCTTT TTTAGAAAAG TTGCTTAATT TAAGCATTAG TCTATATCAA AATATCTAAA 3600 

T TAT AAAAAT TGTTACTACC ATATTAAACT ATTTGCCCGT TTTAATTATT TAGATATATA 3660 
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TATTTTCATA CTATTTAGTT CAGGGGCCCC AACACAGAGA AATTGGACCC CTAATTTCTA 37 2 0 

CAAACAATGC aAGTTGGGGT GGGGCCCCAA CGTTTCTGCG AAATCTATCT TATGCCTATT 37 8 0 

TTCTCTGCTA AGTTCCTATA cTTCGTCAAA CATTTGGCAT ATCACGAGAG CGCTCGCTAC 38 4 0 

TTTGTCGTTT TGACTATGCA TGTTCACTTC TATTTTGGCG AAGTTTCTTC CGACGTCTAG 3900 

TATGCCAAAG CGCACTGTTA TATGTGATTC AATAGGTACT GTTTTAATAT ACACGATATT 3 9 60 

TAAGTTCTCT AT CAT G AC AT TACCTTTTTT AAATTTACGC ATTTCATATT GTATTGTTTC 4020 

TTCTATAATA CTTACAAATG CCGCTTTACT TACTGTTCCG TAATGATTGA TTAAAAGTGG 4 0 80 

TGAAACTTCT ACTGTAATTC CATCTTGATT CAT TGT TATA TATTTGGCGA TTTGATCGTT 414 0 

AATTGTTTCA CCCATCTGA.G GCTGTCTTCC TAAAAGTTGC ATAGACTTTA AAACATCTTG 4200 

TCTATTAATC ACACCCACTG TCTTTTTATT ACTCGAAACG ACAGGAATCA ATTCAATACC 4 2 60 

TTCCCAAATC ATCATATGCG CACAACTTGC TACTGTACTC ATAGCATTTA CATAAATAGG 4 320 

ATTTCGCGTC ATCACTTTAT CTATTTCGTC GTCGTCCTTT GTffiTAATCA TCTCTCGACT 4 38 0 

TGTTACAATA CCTACTAATT TATACGACTC ATTGACTACC GGAAATCTTG TATGGCCAGT 44 40 

TCGATTCGCC ATACGCTTAT AATCTGCTAT TTTCaTCGTA TCAAACAGCA CAGATAAATC 4 5 00 

ATCTAATGGC G T CAT TAT AT CTTGAACTAT TAAGATATCT TTTCGTATTT TCTGATTAAA 4 5 60 

AAGTGCTTTG TTGATAATAT TTGCAACTAG GAATGTATCA TAACTTGATG ATAGAACAGG 4 620 

TAAATCATGT TCATTCGCAA AATTAATAAC TTTATTAGAT GGCTTAAATC CACCAGTAAT 4 680 

TAATATAGCC GTACCTCTTT TTAAAGCTTC AATCTGCACA TCTTCACGAT TTCCGACAAT 4740 

CAATAATGTC TTTGGACCAA TATACTTTAA AATATCTTTG AGTTCCATTG CTCCAATTGC 4800 

AAATTTAGAT ACCATCTTAG TGATACCTTT GTTGCCACCT AACACTTGGC CATCAATAAT 4 8 60 

ATTGACAATT T CAT T AAAAG TTAAATGTTC AATTTCATTA CGATTACGTT TTTCGATTCG 4 920 

AACCGTACCA ACACGATCTA TCGTTGCGAC CATGCCCATT TTATCAGCATCTTTmATTGc 4 98 0 

ACGATATGCT GTCCCytCaG ATACGTTTAA AAATTTAGCG ATTTTACGCA CCGAAATTTT 5040 

AGAGCCTATA GATAACGATT CAATATAATC TAAAATTTGT TCATGTTTTG TCATTCTTTA 5100 

CCTCTTCTTT TCGAACAGTA TTAACTACAT TATAACTTTA TTTTGGATAA AAAGCATTGA 5160 

AGTGAAATGA AATAATGATC GTTtCACCTA TTTTATTTTT TGAAAATATA CAACAAACAC 5220 

AAAGAT C AC A AAATCTTTAA TTTTAAATGG AAAAATCCAT TATTATTTAT TAGAATGTAA 5280 

GTGAGGAGGG ATGTACTAAT GTATAAAAAT AT AT TACT TG G T G TAG AC AC TCAGTTAAAA 534 0 

AATGAAAAAG C AC T AAAAG A AGTGTCTAAA TTiGCTGGCG AAGGTACAGT CGTAACAGTT 54 00 

TTAAACGCAA TCAGCGAACA AGaTGCTCAA GCATCAATTA AAGCAGGTGT TCATTTAAAC 54 60 
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AAACTTACTG AAGAACGAAG CAAGCGATTG GAAAAAACAC GCAAAGCTTT AGAAGATTAT 5520 

GGTATTGATT ATGACCAAAT AATTGTTCGT GGTAATGCAA AAGAAGAACT ATTAAAAST 558 0 

GCTAATAGCG GTAAATATGA AATTGTTGTT TTAAGTAACC GTAAAGCAGA AGACAAAAAG 5 64 0 

AAATTTGTAC TTGGAAGTGT CAGCCACAAA GTAGCAAAAC GTGCGACTAT CCCTGTATTA 57 00 

ATCGTTAAAT AAAATTTTTA TCCAGAATCA CAAATAATCT TTCAATCATG ATGCAGTCTC 57 60 

AAACGACTGA GTAAA.TACAA GAAACGATTA TGACTGTGGT TCTGGATTTT TTATATCGTA 5820 

GTAAATTTAT AATCAATGTC TAATTGTATA AAACTAAAAT TACGAGAGTA GGTCAGAAAT 5880 

GATAAAGAAC CACTGATGTC CCCCGTCCAC GTCGTAACTG AATCAGTAGA ATATAAAAAC 5 94 0 

AC C C AC T AAA AAT AT GCAGA CGATAACTTC CACATAGATT AGCGAGGTGT TTTTTAGTGT 6000 

AAAATCTATA TTCTATTTAA AACTGAACAG ATTCACCTGG TTTTAAAATT TGCACGTCCC 6060 

CTACATTAAC AGCATCTTTA AATTGTTGTG GATCTTGTTC GATTAATGGG AATGTATCAT 6120 

AATGAATCGG TACAGAAATT TTTGGTTTAA TAAATTCATT AATAGCATAA CTTGCATCAT K8 0 

CAATACCCAT CGTAAAATTA TCTCCAATTG G T AC AAAAC A TACATCAACT GGATGACGTT 624 0 

TCGCAATAwG TGACATGTCA tAAAcAGACC TGTATCACCA GTATGATAAA TTGTTTTTCC 6300 

TTCAACTTCA AACACGATAC CCATTGGCAT ACCTAAATAA ACTGGgAATA CCATTTTCAT 63 60 

GTGTAAAACT TGAACTATGA ffiTGCTTGAA CAAATTTAAC GCTTCCGAAA TCAAaGTTTG 64 20 

CTTTACCACC AaTATTCATA CCATGAACAT TTTCAACACC GTGATATGAA GAAAG AT AG T 64 8 0 

CAGCCATTTC TGCACTTCCA ATTACTGTTG CTCCTGTTTT CTTTGCTAGT TCCACAACAT 654 0 

CACCAAAATG ATCAAAATGA CCGTGCGTTA AAACGATATA GTCTACEGC ACTGTTTCAA 6600 

TATTCAAATC ACACTTAGGG TTATTTGAAA TAAACGGATC TACGATAACC TTTTTGTTGT 6660 

TCCCTTCTAA ATAAATCGTT GATTGACCAT GAAATGATAA CTTCATTTGA GCATCCTCCT 67 2 0 

ATCAATTACT ATATAAATTT AGTACCCTTT TGCCACTTAA TTATAACAAA TTCTCAAATT 67 8 0 

TTAAAAATTG AAAATCTAGT TAATGTATTA GCTCGATTTT GAAATCTAAT AATAATTGGC 68 4 0 

ATAAAATGGA AGTAATATTA TGTTGAGGAG TGTTTATAAA ATGACAAAAA . TATCAAAAAT 6900 

AATAGACGAA TTGAACAATC AACAAGCTGA TGCAGCATGG ATTACAACAC CGTTGAATGT 6960 

ATATTATTTT ACTGGATACC GTAGCGAACC CCATGAAAGA TTATTTGCAT TATTGATTAA 7 02 0 

GAAAG AT G G T AAACAAGTAC TATTTTGTCC AAAAATGGAA GTCGAAGAAG TCAAAGCATC 7 080 

ACCTTTCACA GGTGAAATCG TTGGATATTT AGACACTGAA AACCCTTTTT CACTTTATCC 714 0 

TCAAACAATC AATAAATTAC T AAT T GAAAG CGAGCACTTA ACAGTAGCAC GCCAAAACA 7200 

ATTAATCTCT GGTTTCAATG T C AAT T CAT T CGGAGATGTT GATTTAACAA TCAAACAATT 72 60 

GAGAAATATT AAATCCGAAG ATGAAAT TAG CAAAATACGT AAAGCTGCTG AGTTAGCAGA 7 32 0 

491 

Copied from 10806206 on 08/12/2004 



TAAGTGTATC GAAATAGGTG TTTCTTATTT AAAAGAAGGT GTGACTGAAT GTGAAGTAGT 7 38 0 

CAACCATATT GAGCAAACTA TCAAACAATA TGGCGTCAAT GAAATGAGTT TTGATACGAT 74 4 0 

GGTTTTATTT GGAGATCATG CCGCATCACC TCATGGCACA CCAGGAGATC GCAGATTAAA 7 50 0 

AAGCAATGAA TATGTACTAT TTGATTTAGG TGTAATTTAT GAGCATTATT GTAGCGATAT 7 5 60 

GACACGTACT ATTAAATTTG GTGAACCTAG CAAAGAGCA CAAGAAATTT ATAATATTGT 7 620 

AT T AG AAGC A GAAACATCTG CAATCCAAGC AATTAAACCT GGAATACCAT TAAAAGATAT 7 680 

CGATCATATC GCTAGAAATA T TAT T T C AG A AAAAGGTTAT GGTGAATATT TCCCTCATCG 774 0 

CTTAGGTCAT GGCCTAGGAT TACAAGAACA TGAATATCAA GATGTTTCAA GTACTAATTC 7 800 

TAATTTGTTA GAAGCTGGCA TGGTTATTAC AATCGAACCA GGTATTTATG TACCTGGTGT 7 8 60 

TGCAGGTGTA AGAATTGAAG AT G AC AT AC T TGTCACTAAT GAAGGATATG AAGTATTAAC 7 92 0 

ACATTACGAA AAATAAGGAG TGGGATAAAA ATGAAAAGCT TGTTACAAGC GC AT T C T CAT 7 98 0 

TCAGTCAAAC ACTGCCAATA TAACATTGTA GCGCCTAAGA CATAAATTTT TATCCAAGTC 80 4 0 

TAAATGCAAT ATGTAACAAA CAAGCTAGAA AC AC AT AT GC AGGTATGTTC ATCAGTAACA 8100 

TGTAATGAAT CAAATCAATA TCATTCATGT TCGATGATTT CTTCGCATTG TTTCTAGCTT 8160 

TAATTTATCA TTATTTAATT TTAATAACCA AGGAGATGAT AAGTCATTC TTTAGTACGC 8220 

TGTAATCCAT TCCCTTTTCA TCAAATTCAA ATTATAATTG TAATGCTTCT TCTACAGATT 828 0 

TATATTCCAT TTCAAATGCC TCTGCAACGC CTTTATTGGT TACGTGACCT TTGTAAGTAT 8340 

TTAAACCTAA TGATAATGGT TGATTTGATT TAAATGCTTC TCTATACCCT TTATTAGCTA 8400 

GCATGAGCGC ATAAGGTAGC GTAgCATTAT TTAAAGCTAA CGTCGAAGTA CGCGGTACTG 84 60 

CACCTGGCAT ATTTGCAACT GCATAATGAA CCACACCATG CTTAATATAT GTAGGATCAT 8 52 0 

CATGTGTCGT AATTTTATCA GTTGtTTCAA AAATACCGCC TTGATCAATA GCAATGTCAA 8580 

TAATAACTGA CCCATTTTTC ATTTGTTTAA TCATGTCTTC TGTTACAAGT CTTGGCGCTT 8 64 0 

TAGCACCTGG AATTAAAACT GCACCTATTA CTAAATCACT TTGTTTAACA TACAACTCAA 87 00 

TATTCAACGG ATTTGACATA ATTGTATGTA CACGTCCACC GAATAAATCA TCTAATTGTT 87 60 

GTAAACGCTT TGGATTAACA TCTAAAATCG TAACATCTGC ACCTAGTCCTAGTGCAATTT 8 82 0 

TAGCTGCATT TGTTCCTGCT TGACCACCAC CGATAATAGT TACTTTACCC TTAGGTACTC 8880 

CTGGGACACC ACCTAGTAGA ATTCCCATAC CACCATTAAG TTTTTGTAGG AACTCTGCGC 8 94 0 

CAACTTGAGC TGACATTCTT CCTGCTACCT CACTCATTGG TGATAACAAT GGTAAAGATC 9000 

GGTCTGGTAA CTGCACAGTC TCATATGCAA TACTAATTAC TTTTCTATCT ATCAAAGCTT 9060 

GTGTTAATTT TTCTTCATTT GCTAAATGAa gatAaGTGAA TAATACAAGC CCTTCTTTAA 9120 
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AATATGGATA TTCAGATTCA AGTGGTTCTT TAACTTTAAT AACCATATCC ACATCCCAAA 9180 

CTTTTGCTTG TTCAGCAACA ATCTCAGCAC CECTTCTTT GTAATCTACA TCTTCAAAGA 924 0 

ATGATCCTGA ACCCGcATTT GTTTCCACTA AAACAGTATG 9280 
(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132': 

CTGATTAATC TCTTGTTGTC GTGTATTTAC TAATTGAATC GTTGGTGTCT ^GAACACGTCC 60 

CAGGGATAGC TGTGCATCAT ACTTTGTTGT TAGTGCACGC GTTGCATTAA TCCAACAAT 120 

CCAATCTGCC TCACTTCTCG CTAACGCTGC ATAATACAAA TCGTTATATT GACGACCGTC 180 

TTTTAAATTT TTAAAACCTT GTTGAATAGC TTTTTTAGTA ACTGAGCTAA TCCATAATCG 24 0 

ACGGATTGGC TTTTTGTTAC CAACTTTATC CAAAATCAAT CTTGCAACTA GTTCACCTTC 300 

TCGTCCaGCA TCTGTTGCAA TAATAATATC TTTCACTTTA TTATCTAAAA TTAACGCTTT 3 60 

TACTGTTTTA AATTGTTTGC TTGTTTTACC AATAACAACA GTTTTCATAT ATTTAGGTAT 4 20 

AATTGGAAGG TCTTCTAATC GCCATTCCTT TAAATTTTTA TCGTATTGTT CAGGTGTCGC 4 80 

ATTTGTCACT AGATGACCTA ACGCCCACGT GACAAATAT TGGTTATTTT CAAAGTAACC 54 0 

ATTACGCTTC TGATTTATTT GTAAAGCATC AGCAATATCT CTTGCGACTG ATGGTTTTTC 600 

AGCTAATATT AAAGATTTCA TAAATTATCC TTTCTCATAC GTTCTTTTAT TTCGAACGTG 660 

CTTCATCTAT TCCACTAATC TTTGATTTAA ATTCAATGAT TGCAAATGAT GTGTTAAATG 7 20 

TATTGTAACA TGTTAATATC ACTATTAACT TTCATTTCAG TTGAAATACT ATATAATAAA 780 

AGTAACAAAA AGTACGGAGG TAATGACATG AGCATAGTTC AG T TAT AT G A TATTACACAA 840 

ATAAAATCGT T CAT T G AAC A TTCGAATTAT GAATCAGCAT CATACTTATA TAAACTTCCT 900 

CAACAGTACA AT G AAA TAG A TGTATTAATA ACCGATGCGA TTGAATCACC TGGTGTATTT 960 

TCGATTAAAG AAAACGATTC AATCAAAGCA ATCATATTGT CTTTTGCATA CGATAAAAAT 1020 

AAATTCAAAG TCATAGGCCC TTTCGTGGCT GACAATTATG TATTATCTGT CGATACGTTT 1080 

GAAACGCTAT TTAAAGCAAT GACTTCGAAC CAACCTGACG AECCGTCTT TAACTTTTCT 114 0 

TTTGAAGAAG GCATTCAACA ATACAAACCA TTAATGAAAG TTATTCAAGC AAGTTATAAC 1200 

TTCACTGACT AT T AC AT AG A AGCCCGTACA AG AT T AG AAG AAGATATGCA CCAACCAAAT 12 60 

ATCATTCCTT ATCACAAAGG GTTTTATCGT GCTTTCAGCA AAT T AC AC AC AAC T AC AT T T 1320 
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AAATATCAGG CACAGTCACC ACAAGATATC ATTGATAGTT TAGACGACCA TCATCATTTG 138 0 

TTTTTATTTG TTAGCGAAGG TTTACTTAAA GGTTATTTAT ACCTTGAAAT TGATTCACAA 14 4 0 

CAGTCAATCG CCGAGATTAA ATACTTCAGT TCTCATGTAG ATTACCGTTT GAAAGGTATC 1500 

GCTTTCGAGT TGCTTGCGTA TGCAtTGCAA TATGCTTTTG ATAATTTTGA TAT T AG AAAA 15 60 

GTTTATTTTA AAATTCGTAA TAAAAATAAT AAACTCATCG AACGATTTAA TGGTCTAGGT 162 0 

TTCCATATCA ACTATGAGTA CATTAAATTC AAATTCGAAT CACGTAACGT AAAAGATCAA 168 0 

ACAATCCCTG AATAAAACAC CAAGCAAATA CCCTACAGTA CATCATTAGCATGTATTGTG 17 4 0 

GGTTTTTCTA CTTTTTGTAA ATATTGAAAA TTATAAGTAG TTGTTTTTTA CTATTAGGGC 1800 

AGAATGCTTT ACAATAACAT GCAAGTGTCA ATTAAGGGGA GCACTTGCAT AAATAGTATA 18 60 

GGAGAGTGAG TAGTCTTGCA ATTTCTTGAT TTCTTAATCG CACTTTTACC TGCTTTATTC 192 0 

TGGGGAAGTG TCGTTCTTAT TAATGTGTTC GTCGGCGGTG GACCTTACAA CCAAATTCGT 198 0 

GGTACGACGC TAGGCGCACT CATTGTTGGA TTAGGTTTAC TTATTACTGG TTTCGCAAAA 2 04 0 

TTCAATAATC CTACTGTAAT TATTGTCGGT CTTATTTCTG GTGCATTATG GGCGTTTGGA 2100 

CAAGCGAATC AGCTTAAATC TATTAGTTTA ffCGGTGTAT CAAATACTAT GCCAGTTTCT 2160 

ACAGGTATGC AATTAGTTGG T AC AAC AT T A TTCAGCGTTA TCTTTTTAGG TGAATGGTCT 2220 

TCAATGACTC AAATTATCTT TGGTTTAATC G C CAT GAT AT TAT TAG T T AC TGGTGTAGCA 228 0 

CTTACTTCAC TTAAAGCTAA AAATGAACGT CAATCAGATA ATCCTGAATT TAAAAAfflCA 2340 

ATGGGTATTT TAATTGTATC TACAGTTGGA TATGTAGGTT TCGTTGTACT TGGTGACATC 2 4 00 

TTTGGTGTTG GTGGAACTGA TGCATTGTTC TTCCAATCTG TCGGTATGGC AATTGGTGGC 2 4 60 

TTTATCCTAT CCATGAATCA TAAAACATCA CTTAAATCAA CAGCACTTAA TCTATTGcCA 2 520 

GGTGTGATTT GGGGAATTGG TAACTTGTTC ATGTTCTATT C T C AAC C AAA AGTTGGTGTA 2580 

GCTACAAGTT TCTCATTATC ACAGTTACTT GTTATCGTTT CAACCTTAGG CGGTATTTTC 2 64 0 

ATTTTAGGAG AAAGAAAAGA TCGTCGTCAG ATGACGGGTA TTTGGGCAGG TATTATTATT 2700 

ATCGTGATAG CTGCTATAAT TCTAGGTAAT TTGAAATA(A AAGTTAAATA CTCATGTAAC 2 7 60 

GTAAAAATGT AATCACTTCT GAAAATAACC ATTCACTTAT AGAATGATTA AAATTAATTT 282 0 

TCGGGAATTT TACGTTGAAT GTTCCTCTAT ATGTCCTAGG AAATACGTGG CTCTAAAAAC 2 88 0 

AAAACGCAAT AACACATCAT GACATTAATC ATGCGTTTTA AGACTTTAAA ATTAGCGATA 2 94 0 

CTTTTAAAAT C T T GAT GAT A TTCATATATC AAGTATGCGC CAT AC AT AT G AAGTGGATAG 3 00 0 

CTGCATAACG CACTGCATTA TCAACTTGAA TGTATGAGTT G AAC AAC TAT GTCATAAATA 3060 

AAAGCCCCCT TTTCACAATA TACATTTACA TATTGTGGTA AAGGGGGCTC TCATTTTCTA 312 0 
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CGAATACTAA AATGGATTTT ATTTTCAAAT GTGTAAACTA G AC AAAC AC T GCCTGATACA 3180 

CGTACAAAAT AATGATACTA ATAATGATTG TCAAATTGGT CGTCATACCT ATAAATGGCA 32 4 0 

GTGTTCGATA TTTAAACTGA ATACCATAAG AAATAATTGC AACACcTACC GGGAACATCC 3300 

AAGTGACCAA CAATGTCGTC TTAATCATAT CATCTGATAC TGGTAEAAC ACATATACTA 3 3 60 

ACAATCCCGC AACTAATGCT AATCCATAAT GCAAACATAA ATATTTAATA GTAGCAGGTA 34 20 

TATACTTTCT TTCCAGAGTA AAATTCAACA TGACACCTAG CAAAATCATT GATAACGGCA 34 80 

TATTTGCATG GGAAAGTATG CTAAAGAAAT CGATTGCCAC ATGTGGTAAA TGGATGTGAC 3 54 0 

TTATATTCAA TATAAACATT ACAATGTATG TAACGAGTGG CACTGATTGT AATAATTTCT 3 600 

TACCTAAATA TTTAAAATCG AAT TGATCAC TACCTTCACT AAAGTAGCTA CCTACAAAGT 3 660 

AAGTAATTCC AAAC AT C AC A AAGGCACCAC CTATATCAGC CATAACAAAA TAAATAAGTC 37 2 0 

CCGTTTTAGG CCATATCACT TCAAT TAGTG GATATGCAAA CAATCCAATA TTCATAGCAC 37 8 0 

CAGTCATCAT ACCTACTGAT CCACGCATGT GATTATCATA TTTTCTAAAA AAC C AG AC AG 38 4 0 

CAATCATTTT CGCCACAATA CCATATATAA T C AT TAAAAT TGGTAAAATG GAGAATGACA 3 900 

ATTTTAATTC TGCACTGTTT AAATTCACAA TAACTAAAGA TGGGAGTGTG ACATAAGAA 3 960 

CTAATGTAGC AATGACTTGA CTATCTGTTG CTTTTATAAA ATTAATGCGC TTCAAAAAGT 4 020 

AACCAAGCGC AAT T AAT AAA ATAATCATAG TAAATTGTTC TGTCACTGTT ATCCCTTCTT 4 080 

TCAAT AATCT TCATAATTTA T AAC T T T AAC ATACTCCACA GATATTTTAG AAGTCTACTG 414 0 

TTTCATGCTA TAATCTACAT TAAATGCACT TAATTATATT TCAAAGGAGT GTTATAGTAT 4 2 00 

GTCTTTAGAA AACCAACTAG CCGAACTTAA ATATGATTAT GTTCGTCTTC AAGGTGACAT 4 2 60 

AG AAAAAC GG GAATCTTTGA ATT TAG AT AC TTCCGCACTT GTTCGTCAAC TTAAAGATAT 4 320 

TGAAAATGAA AT T AGAAACG TTCGTGCTCA AATG(AAGAT TAATAATCTA TCATTCAAGC 4 380 

AATAAATGCT TTTTGTTACA TAAATTTGAC TAGCATTGCT CTGAATACGT TAT AT T GAT G 4 440 

AATTGCTTCA TTTTTCGCTC AAT T AC AT C T AG AAT C AC AA GATGTTGTCG TGTTATGATT 4 500 

TAGTGTTTCA T T AAC AAC AT ACACGCATAT CTATCCCAAC ACTGCTATTT ATGTTTTCTA 4 5 60 

CGCTGnTGTA C T AC AT GAAC CCTTTGAAAC GGAGAGGAAG TTATCATATG CAATTTTAnC 4 620 

TGATTTTACT AGCAATACTT TAACnAATTG nTAGTTTAAT AGAATTTTA 4 669 
(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

TTTGCACCCA TCTGaTACAA T GC AC CAT GC GGTTTAACAT GATTAATTTT AACTTGATGA 6 0 

AT G C G AC AAA ACCCTTGTAA TGCACCTAAT TGATAAATCA TCAAATTATA AATCTCGTCG 12 0 

T TAG AG AT AT CTATATTTCG TCTGCCAAAG CCTTTCAAAT CAGGTAAACC AGGATGTGCA 18 0 

CCTACTGCAA CATTATGTGC TTTGGCAAGT TTTACCGTTT CATTCATTAC ATTTTCATCA 24 0 

CCAGCGTGAA AACCACAAGC AACATTCGCA CTTGTAATA ACGGAATAAT TTGATGATCA 300 

CCACCAAAGG AATAATTTCC AAATGCTTCG CCTAAATCAC AATTCAAATC AACTCGCATT 3 60 

ATAATTCCAC CCCTTTAACA ATTTGATGTT TTTCTAAAAA TTTAATATCA ACATCTTTTG 42 0 

CATCTCCATC AC GAT AT AG T GGATAATTTA AAACTGCATA TAAAAAATCG GCAGTTGTAG 4 80 

AAAAT C CAT C TATCACCATT TCATCTAAGG TGACTTTCAA CTTATCAATT GCTGAAGCTC 54 0 

TATCATGAGA TTTTACAATT ACTTTAGCTA CTAAAGAATC ATAATATGGT GAAACTTGAT 600 

AACCGTGATA TAGTAAAGAA TCGACTCGCA CATTAAAGCC TTGAGGTAAA TGTAACGCTG 660 

TCACTTTACC TGGTGTTGGT TGAAATTTCT TTTCaGGATT TTCGGCATTT ATTCTCGCTT 720 

CTATCACATG ACCATTAAAT TGAATATCGC TTTGTGAAAA AGGTAAATGA TTATGTTCCA 78 0 

ATAAATACAG TTGTGCTGCA ACCAAATCAC GTTCTGCTCG CATCTCTGTA ACAGTATGTT 84 0 

CAACTTGTAT TCGAGCATTC ATTTCAATAA AGTAATGTGC GGTACAGTT ACTAAAAATT 900 

CAATCGTACC TGCACTTCTA TAATTTGCTG CACGTGCAAC TTTAACAGCA TCGTTACATA 960 

TTTGTTGTCG TCTTTCTTCA GTTAATGCTG CACAAGGAGA TTCTTCGATT AATTTTTGAT 102 0 

TTTTACGTTG TACAGAACAA TCACGTTCCC CTAAATGTAC ATAATTATCC TGCCCATCTC 108 0 

CCaTAACTTG AACTTCAACA TGTTTTGcAA CAGGTATAAA AGCCTCAACA TAAACACGAT 114 0 

CATCATCAAA GTATTTTTTT CCTTCACTTT TAGCTTCTTT AAATGCCTTT TCTAAATCTT 1200 

CAGCTTTCTT TACAATACGT ATACCTTTAC CACCACCGCC ACTGGCAGCT TTGATAACAA 12 60 

CTGGATAACC GATGTCTTTG GCAAGATTCT CAATTTCAGA CACATGATTC ACAGCACCAT 1320 

TTGATCCTGG AATCACAGGA ACACCTGCAT GATGAACTGT TTGTCTTGCT GTTATTTTAT 138 0 

CCCCCATCAT TTCCATCGTT TTTTTAGTAG GCCCTATAAA CGCTATGCCT TGTTCCTCAA 14 4 0 

CGGTTTGAGC AAATTTTGTT GATTCTGATA AAAAGCCATA TCCTGGGTGA ATGCATTAG 15 0 0 

CAC CAGTGAT TTGTGCAGCA GATATGATGC GGTCAATATT TAAATAACTA TCTAAAgCAT 15 60 

TArcwTCCCC AATACATATA GCTTGATCTG CTAAATGTAC ATGCAAGCTT TGCTCGTCCC 162 0 

CTTTTGCATA AACTGCTACA GTTTCAATCC CATATTCTCT GCAAGCTCTT ATAATCCTTA 168 0 

CAGCAATTTC ACCTCTGTTC GCAATTAAAC AACGAAGCAT TTACTTACCC CCTTTACTTA 174 0 
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ATACGTACCA AAACTTGGTC GTATTCAACA TTTGTGCCAT GATCAGCTAC TATTTCAGTA 1800 

ATTTCTCCAG CAACATCTGT TGTTACCTCG TTTAATACTT TCATCGCTTC AACATATCCT 18 60 

ATAATATCTC CCTTGTTAAC TTTGTCACCG ACATTCACAA TTGGTTCAGT TAATTCTTTA 192 0 

CTATCTTGTA AAAAGAATGT ACCTATCATT GGTGATTTAA TGTCATGATA ATCATTTGTC 1980 

GAAACATCGG AGTTATCATT CGCTTTTGAA GCTGTCAAAT CATTATTGTT CATACTTTGA 20 4 0 

TTTGATTGAT TACTGTGTGC AGCCAAATGA TTCGAGTCAG TGAAGTCAAT TTCTATTTCA 2100 

TCTTCAAAAT TTTTATATTT AAATTTCTTA ACATCATTTT CCTTCACTAA TTTGATTATT 2160 

TGTTCGATTT nTTCAATATT CATTTTACAA ATCCCCTTTT AAAATTGTTG CTAATTTTTT 2220 

CGAAGTATGT CGCAAGCTAG ATGTATCAAA AATTGGAGTC TTTTGATGAC TCTTAAGAAT 22 80 

TTCATTAAAC AGAGACATTT GTTCCCGATT CTTATCTACA GCTTCTTGGA ATGATATCCA 2 34 0 

TTTAAATTGA ATTGTATCTT GTGGTTTCAT CTGTGCTAAC TTTGGCAGAT CAAATTTGCA 24 00 

TACAGTTGCA ATTTTGGTAT AACCACCTAT CGTTTGTTTA TCATTAAGCA GAATAATAGG 24 60 

TTGACCATCA TTTGGTACCT GAACACTACC AAGAGCAACC GTTCAGAAA TGATATCTGC 2 52 0 

TTGATTAAtT GGTGCAACGC TGTCACCTTC CAAACGATAG CCCATACGGT CTGATTGTTC 25 80 

AG T AAT T AAA TATGGATGAT TTACAATTTT CGCTCTAGCC TCTTCAGAAA ATGCCTCGAA 2 64 0 

TTGAGGTCCT TGAAGAATGT GT AT AAT ATT ATTTTCTGGC AATAAATCGT CCTGTAAATG 27QD 

AATCGTCTTT CCAATGTTTT CTTTAAAGTC ATTATTTATT TTCACTGTTA T T AC AT CAT C 27 60 

AGCTAATAAC TTTCTACCTT TGAAT 27 8 5 
(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

AATGGAAACG GTTGAAACAG CAATTATTAC TATTTCTATG GGTGAAGGTA T T T C AG AG AT 60 

ATTTAAATCA ATGGGTGCOA CACATATCAT TAGTGGTGGA CAAACG AT GA ATCCTTCTAC 120 

AGAAGATATC GTTAAAGTCA TTGAACAATC AAAATGTAAA CGTGCAATTA TTTTACCGAA 180 

TAATAAAAAT ATCTTAATGG CAAGTGAACA AGCAGCGAGT ATTGTTGATG CAGAAGCTGT 24 0 

TGTTATTCCA ACGAAATCTA TTCCTCAAGG TATAAGCGCA CTATCCAAT ATGATGTGGA 300 

CGCAACACTT GAAGaAAATA AAGCGCAAAT GGCTGATTCA GTAAATAACG TTAAATCTGG 3 60 

TTCATTAACG TACGCTGTTC GTGATACGAA AATTGATGGC G T T GAG AT T A AAAAAG AC G C 4 20 
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GTTTATGGGC TTGATTGAAG ATAAGATTGT AAGCAGCCAA AGTGATCAAT TAACAACGGT . 4 80 

TACTGAGTTG TTAAATGAGA TGTTAGCAGA AGATAGTGAA AT AT T G AC T G TGATTATTGG 54 0 

TCAAGATGCA GAGCAAGCAG TTACAGATAA CAT G AT AAAC TGGATCGAAG AGCAATATCC 60 0 

AGATGTAGAA GTGGAAGTTC ATGAAGGTGG ACAACCAATT TATCAATATT TCTTTTCAGT 6 60 

AGAATAAAAA TTTAAAATAA AAAACTACCA ATGATAAATC ATCAGTTGGT AGTTTTTTAT 720 

TTTGCTATTT TAG T GAT AT T GCGGGTTAAA AGTATCGTTC TCGAGTTGGT AACAATGTCA 780 

TGTTCAACTT AGTCATGATA AAATAAATAA CAT ACT AAA T GATACGTAAA ATCAAATAAA 84 0 

ACATAGGTGA TTTATTTTGG CTAAAGTAAA CTTAATAGAA AGTCCATATT CCTTTTACA 900 

ATTAAAAGGT ATAGGTCCTA AGAAAATAGA AGTATTGCAA CAACTAAATA TTCATACAGT 960 

GGAAGATCTT GTTCTTTATT TGCCAACTAG ATATGAAGAT AATACAGTGA 1010 
(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

TGTAGTTGAA CATGAACAAC AAAAGAAAGA AAAGACAAAA AAGCAATACA AGCCATTTTG 60 

GATTGTCATG AGTTTTATAA TACTTATAGT TGTACTATTA CTCCCGGCAC CTTCAAGTCT 120 

GCCGATAATG GCTAAGGCAG TACTAGCTAT TTwAGCTTTT GCAGTTATTA TGTGGGTAAC 180 

GGAAGCTGTA TCATATCCGG TGTCAGCAAC TTTAATTATT GGCTTAATGA TATTACTTTT 24 0 

AG GAT T TAG C CCTGTTCAAA ATTTAGGGG\ GAAGCTAGGT AATCCGAAAA GTGGCAGTGC 30 0 

TATTTTAGCT GGAAGTGACC TTCTAGGAAC TAATCATGCA T TAT CAT TAG CGTTTAGTGG 3 60 

ATTTGCAACT TCAGCTGTAG CTCTCGTTGC AGCTGCATTA TTTTTGGCTG CTGCTATGCA 4 20 

AGAAACGAAT TTGCATAAAA GACTAGCTCT TTTAGTGTTA TCAATTGTTG GTAAAAAAC 4 80 

TAGAAATATA GTTATTGGAG CAATTATCGT TTCAATTGTA CTTGCATTTT TCGTTCCTTC 54 0 

TGCAACAGCT AGAGCAGGGG CAGTTGTACC AATCTTGCTG GGTATGATTG CGGCATTTAA 600 

AGTTTCCAAA GATAGCAAGT TAGCGTCTTT ATTAATAATT ACTTCAGTAC AAGCTGTGTC 660 

AATTTGGAAT ATTGGTATCA AAACGGCGGC AGCACAAAAT ATCGTAGCGA TTAATTTTAT 72 0 

AAACCATCAA TTAGGATTTG ATGTTTCATG GGGCGAGTGG TTCTTATATG CAGCGCCTTG 78 0 

GTCCATAGTT ATGTCCGTAG CTTTATATTT CATCATGATT AAAGTGATGC CTCCAGAAAT 84 0 
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TAATACAATA GAAGGTGGTA AAGATTTAAT AAAAGffiGAA TTGCATAAAC TTGGCCCCGT 90 0 

TAGCCCACGT GAATGGCGTT TAATTGTTAT ATCGATGTTA TTATTACTGT TTTGGTCAAC 960 

TGAAAAAGTA TTACATCCGA TTGACTCTGC ATCCATTACT ATTATTGCTT TAGGTGTTAT 1020 

GTTAATGCCG AAAATTGGTG TCATGACATG GAAACATGTT GAAAATAAAA TACCATGGGG 108 0 

AACAATTATC GTGTTTGGTG TAGGTATTTC ACTAGGTAAC GTTCTTTTGA AAACAGGTGC 114 0 

AGCTCAATGG TTAAGTGATC AAACTTTTGG TGTTTTAGGT TTAAAACATT TACCTATTAT 1200 

CGCGACAATT GCACTTATCA CGCTTTTTAA TATATTGATT CATTTGGGCT TTGCGAGTGC 12 60 

AACAAGTTTA TCATCAGCGT TAATACCTGT TTTTATTTCG CTAACCTCTA CGTTACACTT 132 0 

AG GAG AC C AG TCTATAGGAT TTGTTTTAAT TCAACAATTT GTTATTAGTT TTGGTTTCTT 1380 

i 

ATTACCTGTT AGTGCACCTC AAAATATGTT GGCTTATGGC ACTGGTACTT TTACGGTTAA 14 4 0 

AGATTTCTTG AAGGCAGGTA TACCATTGAC AATTGTAGGG aAE.tctAgT GaTAGTTTTT 1500 

AG CAT GAC T T ATTGGAAATG GGTAAGGTTG CnTTAATTAA 154 0 
(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

ACTTCTCACA ATAAGAAATA TGAAATTGTT ATGTGTTAGT TGAGATTCAG TGATGAATTA 60 

CTTTTATCAT TTAAAATGTT GTTATCATTG TCATGCGTTA CCAAATCGCT TACGTATACA 120 

CGATTCCCAA TCTTAACATA GACGATTTGT AT AT C AG AAT TTTCTGATTA CTAACAGTTT 180 

ACCTAAGTTT AAATATCTGT TCAATGATTT TCAGTTATTT TTAAAAGAAA AATCGTAATG 240 

CTGCCATGAT AACAATCCCA C T AAT AAT T G TAATAGTTAA AtACGCGTGA TTATAGATAA 300 

AATAACCGTC GGAATGAGCG CGATAATGTA AGGGATGTTT AATGTATACC CCTCACCATG 3 60 

AGGCGTCTGT TGAATAATGC TGTCAATGAC AAGTGCCGTA AATAGTGTGA TTGGGATAAA 4 20 

TGATAGCCAT CGAACCACGA C AT C AG G C AA TTGCACTTTT GAAATCATGA TAAAAGGTAT 4 80 

AATTCGAATT AATAGCGTTA C GAT AC C AC A CAATAAAATA AGTATAACA TGTTCATATG 54 0 

AGTTATCATT GTTCCATCAT CACTCCTAAC GCTGCTGAAA TTGTGGCTGC AAT T AAT ATT 600 

GCTAGATATG AAGG CAT AAA CATACTTAGC GATAACATCA TTACTATGAC GGCAATAATG 660 

AGTACTATGT AAATTCTTAA TCGCGATTTA G T AAT T GAT T CAAATTGCGC AATGGCCAAA 720 

AAGATAAACA TAGCCGTGAT AGCAAAATCT AACCCTAGCG TTTGCGGATT T GAG AT AT AT 780 
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TCGCCAAATA AAGCCCCAGC T AC AC AT G AA ATTGCCCAAA ATAAATATGC TGTGATGTTA 84 0 

AGACCATGCA TCCAACGATC ATTGATAGCT TCTCCTTTTA AATAAGGTGT AATGGCGACG 90 0 

CCAAACGTTT CGTCAGTTAC TAATGASCCT AAT C C AAC AC GGTTCCAAAA CCCATATGTC 960 

TTGAAGTTTG GTGCAAGCGA CATACTTAAA AGGAACATTC TTGAATTTAC GATAAATACA 102 0 

GTTAGTACAA TCGCTGATAT AGGTGTACCT GCTATAAACA ACGCGCACAT AATAAATTGC 108 0 

GCAgcaCCGG CATATATAAC AAGACATAAC AAGACAATTT CTAAAATACT AAGTTTTGA 114 0 

GACGAAGCCA CAATACCAAA TGAAATACCA ACACCGGCAT AACCCAATAA TGTTGGGATA 1200 

CACTCTTGCA CGCCTTGTCT AAAACTTAAA TGTGTTGTCA TCTCAATTAC CTCCTTTGCC 1260 

ACTTTATATT TATATAATTA CGAAAAAATA ATATACCTAG TCTAATACAG TTCTATACCC 132 0 

TAAGCAATAA CATTAGACAT CAGTTTGTCT GAGGTTAGAC ATTCCGGAGT CTTTAGTCAG 138 0 

CTTCATATTA ACTTTTTATT TTTGAGAATT TTCAATTTTT TATTTAAGAC TACCTCCATA 144 0 

TTTTCTATGG aTTTGTAGTT GTTTTTAAGT ATCAATTTTA TAAATTTTTA TATCTGATGA 1500 

TGAGTCTGGG aTATTGaTTC AT G T AC C AC T CCCTTaTaAT CATCCCCTCC CCCTaCCCTA 15 60 

CTCCATCGAT ATAACTCATA CTACATATCA ACGAAAT CAG TATTTTATCG CTTCCTTTCC 162 0 

TATATTAGTG ATGCTCAAAC TTGTTACGTT TTAGATTGTT TTAGTTCATC ATAATTATCC 168 0 

CGTATTGTTG CTATAATGAA ATGCGTTCAC CCCAT TAAAC CACAAACTTA ATTTATTGTT 1740 

GTTATGTGCA TTGGCTCACT ATTATATTTT TACAGCACAA AAAAAGTGGC GACAGTTCGT 1800 

CACCACTTTT T AAAAT AT T A TTTAAAGTAT CTTGCCCTTG CTTTAAGTAT ACGTAGATAT 18 60 

ATACTTTTTA AAGCTTGTAG CTAAAGCCTT TATTTAACTG GTTTTGAAAT TTGTGTTTTA 192 0 

CCACCCATAA ATGGTACTAA TGCTTCTGGA ATTGTTACTG TTCCATCTTC ATTTTGGTAA 198 0 

TTTTCAACAA TAGCAGCAAA TGTACGTCCA ACTGC TAAAC CACTACCATT TAATGTATGT 204 0 

GCTAATTCTG GTTTAGCTGC TTTGTCACGC TTGAAGCGGA TGTTAGCACG ACGCGCTTGG 2100 

AAATCCGTAC AGTTTGAGCA TGAACTAATT TCTTTATAAT STTGTAGCT TGGTAACCAA 2160 

ACTTCTAAAT CATATGTTTT GCTTGCACTA AATCCAATAT CACCTGTACA T AAAAT AACA 2220 

CGACGGTATG GTAAACCTAA CTCTTCTAGA ATTGCTTCTG CGTTTGTTGT CATTTCTTCT 2280 

AAAGCATTCC ATGAATCTTC AGGTTGTTCA AAACGTACCA TTTCCACTTT AT C G AAT T G A 2 30J 

TGTAAACGAA TTAATCCTCT TGTATCTCTA CCTGCTGATC CTGCTTCACT ACGGAAACAT 2 4 00 

GCAGATTGAC CAGTGAATTT TTCAGGAAGT ACACCTGGTT GAATAATTTC ATTACGGTAG 2 4 60 

AAATTCGTTA ATGGTACTTC AGCAGTTGGA ATTGTATATA ATCCTTCTTT TTCTACTTTA 2 52 0 

AATAAATCTT CTTCAAATTT AGGTAATTGA CCTGTACCAT ACATTGTATC TGCGTTCACA 2580 
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